I am doing some work related to datafiles and indexfiles in spark shuffle mechanism. I have one doubt related to it: Can we merge or combine two datafiles (and two indexfiles) of two different spark-jobs (on same Rdds) in Apache Spark?
Any help? Thanks in advance!