2

is OpenMPI supported on distributed file systems like HDFS(Hadoop Distributed File System)?

rishi007bansod
  • 1,283
  • 2
  • 19
  • 45

1 Answers1

2

Hadoop is not implemented using MPI because

  1. MPI is a message passing interface whereas MapReduce is a tool initially developed by Google for running Big Data applications.
  2. One of the main feature available in Hadoop/Mapreduce is fault tolerance. But it is not supported in MPI.

Refer
Why isn't Hadoop implemented using MPI?
MPI and MapReduce

Let we come to your question,

An article says that,

The prospect of running MPI jobs under YARN is an enticing proposition. As mentioned, the vast stores of data now living in HDFS can be accessed by non-MapReduce applications like those written in MPI. (Please note, however, that the intimate interface of MapReduce to HDFS is part of the MapReduce framework and is not automatic in other frameworks and must be managed by the user’s application.)

The prospect of running Open MPI under YARN has been investigated by Ralph H. Castain of the Open MPI team.

you can refer the below link for more information,

The New Hadoop

Community
  • 1
  • 1
Kumar
  • 3,782
  • 4
  • 39
  • 87