Hi I am trying to delete records from a delta table. It is causing a broadcast timeout error from time to time. Can someone please help with this
spark.sql(s"""DELETE FROM stg.bl WHERE concat(key,':',revision) in
(Select distinct concat(bl.key,':',bl.revision) from stg.bl bl left semi join
tgt.bl tgt ON bl.key = tgt.key and bl.revision = tgt.revision)""")
org.apache.spark.SparkException: Could not execute broadcast in 300 secs.
You can increase the timeout for broadcasts via spark.sql.broadcastTimeout or
disable broadcast join
by setting spark.sql.autoBroadcastJoinThreshold to -1