I have two input files - let's call them master and slave. Based on a common key, I want to join them (retain both left and right for null values on the other side) and in order.
So basically, end result is any kind of Java RDD which looks like
<master record, slave record>
and there are null values where the other does not exist.
I don't want to use lengthy operations like sort, zip by key or join itself. I am looking for a custom reader that I can write so that I don't have to read these files separately and join them later. Any ideas as to how I can write them?
Thanks!
edit: I am not looking for ready-made code. Just a rought guideline/outline also helps