I have recently been introduced to SparkSQL
. We use Spark 2.4
. I recently found out that SparkSQL
query supports the following hints for its Join strategies:
- BROADCAST hint
- MERGE hint
- SHUFFLE_HASH hint
Unfortunately, I have not found any online materials which elaborately discuss these hints and their application scenarios. I wish to learn some tips regarding when to use these hints in a query Join for improving query performance.
Can anyone explain with some examples. Any help is appreciated. Thanks