No Data Returned From Delta Table Although Delta Files Exist

Question

I created a delta table in Databricks using sql as:

%sql
create table nx_bronze_raw
(
  `Device` string
)
USING DELTA LOCATION '/mnt/Databricks/bronze/devices/';

Then I ingest data (device column) into this table using:

bronze_path = '/mnt/Databricks/bronze/devices/'
df.select('Device').write.format("delta").mode("append").save(bronze_path)

The underlying storage is Azure Blob Storage, and the Databricks runtime is 12.1

The problem is when I query this table it returns 0 records:

df_read = spark.read.format("delta").load("/mnt/Databricks/bronze/devices/")
display(df_read )

Query returned no results

Although, when I look inside the storage account, the delta files are created with the expected size:

What went wrong in this scenario, especially no error is returned ? and why can't I retrieve the data ?

do `display(spark.sql("describe history delta.\`/mnt/Databricks/bronze/devices/\`"))` — Alex Ott, May 24 '23 at 12:44
The result shows the history of the table, first operation is CREATE TABLE and the second is WRITE. But what can I get from that ? — khidir sanosi, May 24 '23 at 13:16
what you are getting in `numFiles` under **operationMetrices** when display delta table [history](https://i.imgur.com/X6v8fPf.png) — JayashankarGS, May 26 '23 at 04:35

score 0 · Accepted Answer · answered May 29 '23 at 10:58

Following are the possible reasons for getting empty results.

enter image description here

Before: enter image description here

After:

enter image description here

In this case, you can describe history of table and retrieve data of specific version.

enter image description here

Here, selecting the version before it is truncated.

df_read = spark.read.format("delta").option("versionAsOf",3).load("/mnt/Databricks/bronze/devices2/")
display(df_read )

enter image description here

There is a chance that, the data has been written to the Delta files but hasn't been flushed to the table yet. To ensure that all changes are visible, you can try running OPTIMIZE

code:

%sql
optimize raw;

enter image description here

1 Answers1