My question is more related to the actual code. I am working on Hadoop 2.7.7.
I seem to understand that after an application has been submitted by the client, the Resource Manager has to assign one application master to process the application.
At this point, when the Resource Manager communicates with the Name Node to get the meta data of the files required for processing, which package and class is this communication located at?
Edit: Currently I am looking at FifoScheduler.java in the package org.apache.hadoop.yarn.server.resourcemanager.scheduler.fifo. The method private void assignContainers(FiCaSchedulerNode node) choses the application from a fifo list of applications that have a request to launch containers for a node. So logically I believe that by the time the fifoscheduler is looking to chose the application from the fifo list, the RM already has the information about the files/file locations required for ay application on the list. I looked at the all the methods in the call hierarchy for the aforementioned assignContainers() method in FifoScheduler.java and could find no clues as to where/when the RM receives information about the files (or metadata of files) required for an application.
This is the best I could describe where I am looking in the code. If you need any additional information I would gladly provide that. I apologize if this is unclear.