We use broadcast as one of the joining optimize solution in spark. Could you please help me to understand below things
1) Always broadcast table size should be less than driver memory .
In this case suppose my broadcast table size is 4 GB but driver memory is 3GB , Can i increase the driver memory to 6 GB and broadcast 4 GB table
2) What could be the maximum driver memory can i provide is there any limit ?
I think , It totally depends on what we are bringing to driver ( broadcast, collect etc)
3) I heard we can broadcast upto only 2GB data , because java serialisation has support till 2GB data only , is it true ?