Since some data are shared among all the map()
functions, I can't generate them in the setup(), since each setup()
corresponds to each map()
function, and what I want to do is to pre-produce some data and store it somewhere achievable, then use it in each map()
.
How can I possibly do it? Say I am doing KNN with Mapreduce, and I want to use all of the test data by each map()
. Where should I store those test data and then use them in the mapper?
Thank you so much.