Questions tagged [mapr]

MapR is a commercial data platform that offers a HDFS compatible distributed file system, a database that allows to store data in BigTable or JSON and a streaming platform for messaging. MapR leverages APIs from open source tools such as Hadoop, Kafka, HBase and provides a proprietary implementation written in C optimised for improved performance.

MapR is a complete enterprise-grade distribution for Apache Hadoop. The MapR Converged Data Platform has been engineered to improve Hadoop’s reliability, performance, and ease of use.

The MapR distribution provides a full Hadoop stack that includes the MapR File System (MapR-FS), the MapR-DB NoSQL database management system, MapR Streams, the MapR Control System (MCS) user interface, and a full family of Hadoop ecosystem projects. You can use MapR with Apache Hadoop, HDFS, and MapReduce APIs.

MapR supports the Hadoop 2.x architecture and YARN (Yet Another Resource Negotiator). Hadoop 2.x and YARN make up a resource management and scheduling framework that distributes resource management and job management duties.

enter image description here

There are three MapR editions.

  • MapR Community Edition (formerly M3)
    • Free community edition.
  • MapR Enterprise Edition (formerly M5)
    • Adds high availability and data protection, including multi-node NFS.
  • MapR Enterprise Database Edition (formerly M7)
    • Adds structured table data natively in the storage layer and provides a flexible NoSQL database.

MapR can be installed on many versions of Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, and SUSE. A full matrix of supported Linux operating systems can be found here.

To install MapR the following requirements are needed.

  • A 64-bit CPU.
  • One of the above mentioned operating systems. (Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, or SUSE)
  • A minimum of 8GB of RAM.
  • At least one single unformatted disk.
  • A Resolvable hostname.
  • A common user on each server you wish to install MapR on.
  • Java 1.7.0 or higher.
  • Other
    • NTP, Syslog, PAM



Try MapR

Download the MapR Sandbox for VMware or Virtualbox for free.

OR

Install MapR on your own. Check to see if the installer is supported for your OS

You will have to meet the prerequisites for a successful installation

Get the mapr-setup sctipt from the MapR repository.

wget http://package.mapr.com/releases/installer/mapr-setup.sh

Run the mapr-setup script to start the installation.

bash ./mapr-setup.sh -y

Open the web UI with the following URL

https://<Installer node hostname/IPaddress>:9443

Following the prompts and you will be on your way to installing MapR.

There is also manual installation available. Full instructions can be viewed here.

Extensive documentation can be found on MapR's documentation site. http://maprdocs.mapr.com/home/



The Stackoverflow tag [mapr] can be used for questions about issues you have with the MapR platform.

381 questions
0
votes
1 answer

Why is MapR giving me a null pointer when reading files?

I get the following exception when reading files from a mapr directory: java.lang.NullPointerException at com.mapr.fs.MapRFsInStream.read(MapRFsInStream.java:150) at java.io.DataInputStream.read(DataInputStream.java:83) at…
user3335040
  • 649
  • 1
  • 7
  • 17
0
votes
2 answers

How to enable additional logging when running `hadoop fs` with MAPRFS?

When I run this command: hadoop fs -copyFromLocal /tmp/1GB.img 'maprfs://maprfs.example.com/tmp/1GB.img' I get the following errors. 2014-11-05 01:21:08,7669 ERROR Client fs/client/fileclient/cc/writebuf.cc:154 Thread: 240 FlushWrite failed: File…
coderfi
  • 378
  • 3
  • 7
0
votes
1 answer

MAPR M7 hbase cluster is not populating .META. table

I was finding the reason that why MAPR M7 EMR cluster is not populating HBASE .META. table. I tried to fix using hbck command also but still got no success. Any help will be highly appriciated
Subhradip Bose
  • 3,065
  • 2
  • 13
  • 17
0
votes
1 answer

what is Container Architecture used in MapR distribution?

MapR comes with a new architecture (Container Architecture) which is different from the HDFS approach . what is the differnet ? how this stores data ? what are the advantages and disadvantages ?
Nagendra kumar
  • 215
  • 1
  • 2
  • 14
0
votes
1 answer

How to schedule Hadoop jobs conditionally?

I am pretty new to Hadoop, and particularly to Hadoop Job Scheduling. Here is what I am trying to do. I have 2 flows, each having a Hadoop job. I have freedom to put these flows either in the same project or in different ones. I don't want the…
Bhushan
  • 18,329
  • 31
  • 104
  • 137
0
votes
1 answer

Chef to manage Hadoop COnfiguration / services Cluster

I am trying to develop Chef cookbooks to Deploy and Manage Hadoop Cluster (specifically MapR). As per my understanding Chef is a good tool to install hadoop cluster in an automated way. But I am not sure how Chef can help to manage/change…
Pradeep Bhadani
  • 4,435
  • 6
  • 29
  • 48
0
votes
4 answers

Installing openssh on centos

I am trying to install sshpass on centos, as I want to install MapR on our systems. I am following guidelines given here: http://doc.mapr.com/display/MapR/Quick+Installation+Guide I get following issue: sudo yum install sshpass Loaded plugins:…
techele
  • 165
  • 3
  • 8
0
votes
0 answers

Loading extremely long lines with TextLine in Cascading

I'm using TextLine in Cascading to load files with very large lines in Cascading. The lines are very long - around 30Mb on average, some much longer. When I run the job locally to test it it runs fine, but when I run it on the cluster it fails after…
Savage Reader
  • 387
  • 1
  • 4
  • 16
0
votes
2 answers

Reverse pagination with HBase

I am working on hbase table pagination. functionality i need to implement is, UI pagination table should have next/forward and back/backward feature. Forward option i implemented using PageFilter but for backward, i need to get the rows(20) from end…
Rahul Sharma
  • 5,614
  • 10
  • 57
  • 91
0
votes
2 answers

How to list all HBase M7 tables from java client

i am able to list HbaseM7 tables using command- hbase>list '/mapr/cluster/tables/' But, when I am trying to list the tables using HBaseManager it doesn't workout.. code snippet Configuration conf =//set all m7 cluster; HBaseAdmin admin = new…
Rahul Sharma
  • 5,614
  • 10
  • 57
  • 91
0
votes
1 answer

Classpath to use for MapR/Hadoop/Hive

I'm trying to compile some java code for hadoop and need to know what classpath I need to specify. For cloudera I use this below but what do I use for a MapR installation? Surprisingly I could only find how to set the classpath in google, not what…
MikeKulls
  • 873
  • 1
  • 10
  • 22
0
votes
1 answer

Error in installing Mapr

hi I am trying to install MapR on ubuntu 12.04 precise but facing some isuues. I followd below link for installation http://doc.mapr.com/display/MapR/Quick+Installation+Guide I followed all the steps mentioned in the link Below is the error i am…
Chhaya Vishwakarma
  • 1,407
  • 9
  • 44
  • 72
0
votes
1 answer

how to avoid filling up hadoop logs on nodes?

When our Cascading jobs encounter an error in data, they throw various exceptions… These end up in the logs, and if the logs fill up, the cluster stops working. do we have any config file to be edited/configured to avoid such scenarios? we are using…
0
votes
2 answers

Connecting to Hbase Mapr using Java

Hi I am running the hbase VMWare sandbox MapR-Sandbox-For-Hadoop-3.1.0_VM. I have made no changes to the sandbox. I can access the control panel via browser. On another linux VM I have eclipse and the hbase client installed. I have a java program…
Rob McFeely
  • 2,823
  • 8
  • 33
  • 50
0
votes
1 answer

MapR - How to Install Sqoop on a Client/Edge Node?

I'm familiar with the Cloudera architecture but not MapR. I have a RHEL vm and previously installed the MapR client software using this documentation. I am able to submit mapreduce jobs and query HDFS as expected. I followed this documentation…
Matthew Moisen
  • 16,701
  • 27
  • 128
  • 231