Questions tagged [mapr]

MapR is a commercial data platform that offers a HDFS compatible distributed file system, a database that allows to store data in BigTable or JSON and a streaming platform for messaging. MapR leverages APIs from open source tools such as Hadoop, Kafka, HBase and provides a proprietary implementation written in C optimised for improved performance.

MapR is a complete enterprise-grade distribution for Apache Hadoop. The MapR Converged Data Platform has been engineered to improve Hadoop’s reliability, performance, and ease of use.

The MapR distribution provides a full Hadoop stack that includes the MapR File System (MapR-FS), the MapR-DB NoSQL database management system, MapR Streams, the MapR Control System (MCS) user interface, and a full family of Hadoop ecosystem projects. You can use MapR with Apache Hadoop, HDFS, and MapReduce APIs.

MapR supports the Hadoop 2.x architecture and YARN (Yet Another Resource Negotiator). Hadoop 2.x and YARN make up a resource management and scheduling framework that distributes resource management and job management duties.

enter image description here

There are three MapR editions.

  • MapR Community Edition (formerly M3)
    • Free community edition.
  • MapR Enterprise Edition (formerly M5)
    • Adds high availability and data protection, including multi-node NFS.
  • MapR Enterprise Database Edition (formerly M7)
    • Adds structured table data natively in the storage layer and provides a flexible NoSQL database.

MapR can be installed on many versions of Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, and SUSE. A full matrix of supported Linux operating systems can be found here.

To install MapR the following requirements are needed.

  • A 64-bit CPU.
  • One of the above mentioned operating systems. (Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, or SUSE)
  • A minimum of 8GB of RAM.
  • At least one single unformatted disk.
  • A Resolvable hostname.
  • A common user on each server you wish to install MapR on.
  • Java 1.7.0 or higher.
  • Other
    • NTP, Syslog, PAM



Try MapR

Download the MapR Sandbox for VMware or Virtualbox for free.

OR

Install MapR on your own. Check to see if the installer is supported for your OS

You will have to meet the prerequisites for a successful installation

Get the mapr-setup sctipt from the MapR repository.

wget http://package.mapr.com/releases/installer/mapr-setup.sh

Run the mapr-setup script to start the installation.

bash ./mapr-setup.sh -y

Open the web UI with the following URL

https://<Installer node hostname/IPaddress>:9443

Following the prompts and you will be on your way to installing MapR.

There is also manual installation available. Full instructions can be viewed here.

Extensive documentation can be found on MapR's documentation site. http://maprdocs.mapr.com/home/



The Stackoverflow tag [mapr] can be used for questions about issues you have with the MapR platform.

381 questions
-1
votes
2 answers

Cannot access hdfs file system running in mapr sandbox VM

I have just installed the MapR sandbox virtual machine running in Virtualbox. The VM is set up using "NAT" network mode and ports are forwarded to my Mac. Since the ports are forwarded I am guessing that I should be able to access the hdfs on…
user152468
  • 3,202
  • 6
  • 27
  • 57
-1
votes
1 answer

Submitting pig jobs (in MRv1 mode) to Mapr yarn framework

How do I submit pig jobs to the Mapr yarn framework. Any properties file changes required? Do I have to alter Pig scripts running in MRv1 mode?
-2
votes
1 answer

Which distribution of hadoop is better?

I am working with massive data, my input data is about 100 GB.I want to choose one of the hadoop distributions, but i don't know to choose mapr cluster or cloudera cluster. i want to use free versions(mapr M3 and cloudera CDH4 that uses hadoop…
csperson
  • 901
  • 3
  • 12
  • 17
-3
votes
2 answers

What is the difference between MapR and Map Reduce?

Is MapReduce the underlying algorithm for MapR or MapR FS? Are the two terms related at all? If yes, please throw some light on the relation. Is MapR related to Big Data? Please be as detailed and as clear as possible in the answer. MapR FS -…
Praveen Nvs
  • 331
  • 3
  • 14
-3
votes
3 answers

How to split rows of a Spark RDD by Deliminator

I am trying to split data in Spark into the form of an RDD of Array[String]. Currently I have loaded the file into an RDD of String. > val csvFile = textFile("/input/spam.csv") I would like to split on a a , deliminator.
Mayank
  • 165
  • 1
  • 5
  • 20
-5
votes
1 answer

Begenner at spark Big data programming (spark code)

i'm learning spark for distributed systemes. i runned this code and it's worked. but i know that it's count word in input files but i have probleme undestanding how Methods are written and what the us of JavaRDD public class JavaWordCount { public…
Naoufal Abde
  • 21
  • 1
  • 2
1 2 3
25
26