0

How to retrieve TBs of data from HDFS using Rhdfs package because data is stored on multiple machines and R runs on single machine.

How this much much amount of data is stored in R dataframe on a single system. If so, how can that huge data is stored in single hardware, basically which conflicts Big Data Storage concept.

Prabhat Jain
  • 321
  • 1
  • 4
  • 9
  • Have you seen SparkR - https://spark.apache.org/docs/latest/sparkr.html – Binary Nerd Jul 28 '16 at 10:18
  • Yeah. I had worked on SparkR. I want to know. How R retrives this much data on Its intrrface. Because I didn't use distributed environment while running R job with Spark and Hadoop. – Prabhat Jain Jul 28 '16 at 10:23

0 Answers0