Questions tagged [mapr]

MapR is a commercial data platform that offers a HDFS compatible distributed file system, a database that allows to store data in BigTable or JSON and a streaming platform for messaging. MapR leverages APIs from open source tools such as Hadoop, Kafka, HBase and provides a proprietary implementation written in C optimised for improved performance.

MapR is a complete enterprise-grade distribution for Apache Hadoop. The MapR Converged Data Platform has been engineered to improve Hadoop’s reliability, performance, and ease of use.

The MapR distribution provides a full Hadoop stack that includes the MapR File System (MapR-FS), the MapR-DB NoSQL database management system, MapR Streams, the MapR Control System (MCS) user interface, and a full family of Hadoop ecosystem projects. You can use MapR with Apache Hadoop, HDFS, and MapReduce APIs.

MapR supports the Hadoop 2.x architecture and YARN (Yet Another Resource Negotiator). Hadoop 2.x and YARN make up a resource management and scheduling framework that distributes resource management and job management duties.

enter image description here

There are three MapR editions.

  • MapR Community Edition (formerly M3)
    • Free community edition.
  • MapR Enterprise Edition (formerly M5)
    • Adds high availability and data protection, including multi-node NFS.
  • MapR Enterprise Database Edition (formerly M7)
    • Adds structured table data natively in the storage layer and provides a flexible NoSQL database.

MapR can be installed on many versions of Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, and SUSE. A full matrix of supported Linux operating systems can be found here.

To install MapR the following requirements are needed.

  • A 64-bit CPU.
  • One of the above mentioned operating systems. (Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, or SUSE)
  • A minimum of 8GB of RAM.
  • At least one single unformatted disk.
  • A Resolvable hostname.
  • A common user on each server you wish to install MapR on.
  • Java 1.7.0 or higher.
  • Other
    • NTP, Syslog, PAM



Try MapR

Download the MapR Sandbox for VMware or Virtualbox for free.

OR

Install MapR on your own. Check to see if the installer is supported for your OS

You will have to meet the prerequisites for a successful installation

Get the mapr-setup sctipt from the MapR repository.

wget http://package.mapr.com/releases/installer/mapr-setup.sh

Run the mapr-setup script to start the installation.

bash ./mapr-setup.sh -y

Open the web UI with the following URL

https://<Installer node hostname/IPaddress>:9443

Following the prompts and you will be on your way to installing MapR.

There is also manual installation available. Full instructions can be viewed here.

Extensive documentation can be found on MapR's documentation site. http://maprdocs.mapr.com/home/



The Stackoverflow tag [mapr] can be used for questions about issues you have with the MapR platform.

381 questions
2
votes
0 answers

pip install pandas couldn't find any downloads that satisfy the requirement pandas

while I'm trying to install pandas , getting below error . can you please suggest me to get solved ? [mapr@csdssqwqasw22 ~]$ pip install pandas Downloading/unpacking pandas Cannot fetch index base URL https://pypi.python.org/simple/ Could not…
MapReddy Usthili
  • 288
  • 1
  • 7
  • 23
2
votes
2 answers

Hadoop Hive query optimisation

I have a table that is partitioned by date in the format yyyyMMdd. If I do a simple query like this: SELECT COUNT(*) FROM MyTable WHERE Date >= '20140924' then it will scan 3 days of data (today is the 26th). However I would like my query to always…
MikeKulls
  • 873
  • 1
  • 10
  • 22
2
votes
1 answer

How to use .jar in a pig file

I have two input files smt.txt and smo.txt. The jar file reads the text files and split the data according to some rule which is described in java file. And the pig file takes these data put into output files with doing mapreduce. register…
erbileren
  • 152
  • 2
  • 9
2
votes
1 answer

Joining with a hbase table in pig error: No FileSystem for scheme: hbase

I am trying to joining a hbase table with some information from an hdfs file. I am relatively new to hbase and pig. set hbase.zookeeper.quorum 'mprhdp2-dev.local' set hbase.zookeeper.property.clientPort 5100 A = LOAD '/raw/JoinTest2.txt' USING…
2
votes
1 answer

Getting Amazon EMR to use S3 for input and output

How would I get Amazon EMR (0.20.205 MapR) to use S3 buckets for input and output? I tried adding the following to the core configuration xml file (through bootstrap actions): fs.default.name
jldupont
  • 93,734
  • 56
  • 203
  • 318
1
vote
1 answer

How to save a file in MapR HDFS using Ruby

Is there a way to save a file in HDFS using MapR distribution of Hadoop from Ruby? Apparently, there's a Thrift API called thriftfs that makes it possible to communicate with HDFS from clients but looks like it is not bundled with MapR.
Behrang
  • 46,888
  • 25
  • 118
  • 160
1
vote
1 answer

Map function over nested columns

I was wondering if I could get some help with the code below. I would like to create a new list column with the results of my peaksizefunction. I think the error may be miss using the map function? library(pracma)##for findpeaks…
1
vote
1 answer

hive tez mapr distribution query have issues with tez engine

Query fails on tez but fine on MR. We have tried many different permutations and combinations of set ... paarmeters but unable to run this query successfully in tez. Query in MR runs in about 20 mins but tez engine, we saw that query was consuming…
Sanjay
  • 63
  • 7
1
vote
1 answer

How can I read files from a MapR cluster using Go?

I have a Go application running in a Kubernetes cluster which needs to read files from a large MapR cluster. The two clusters are separate and the Kubernetes cluster does not permit us to use the CSI driver. All I can do is run userspace apps in…
Mihai Todor
  • 8,014
  • 9
  • 49
  • 86
1
vote
1 answer

Volume creation MAPR

I am pretty new to MapR and I have a task about creating a MapR volume. I used this command maprcli volume create -name test-volume -path /MyCluster/apps/application_logs/node1 to create the volume but I get this error: ERROR (2) - Failed to mount…
1
vote
2 answers

How to determine which Apache Drill drillbit was selected when connecting via zookeeper?

I'm using a zookeeper connection string to get a connection to a drill cluster. I would like to log which drillbit (hostname or IP address) I successfully connected to, but I can't find any property or method on the java.sql.Connection class to get…
Mark Rajcok
  • 362,217
  • 114
  • 495
  • 492
1
vote
0 answers

Authentication to Apache Drill is temporary failing

I'm running a 5 Nodes Mapr Drill cluster, and everything is working fine, except that sometimes (can be multiple time during the day, sometimes once in a few days, no specific pattern), when I try to connect to one of the drillbits (Via Drill Web-UI…
kfy
  • 11
  • 1
1
vote
0 answers

Error copying large files from HDFS to local

Recently, our hadoop cluster running mapr 5.2 has been throwing an error that only seems to affect large files in HDFS. When the primary account (that is running all the mapr services) interacts with a large file, it works fine. When any other…
user268859
  • 13
  • 2
1
vote
1 answer

Apache Spark: Cluster with nodes of different configuration

I have a production box, where it has 14 nodes. Out of them 14, 12 nodes are of same configuration and 2 of them with higher configuration (almost 3times), so 1> will it impact the over all resource utilization of spark. 2> how can I make use of…
anurag1007
  • 107
  • 1
  • 10
1
vote
2 answers

Uploading file with square brackets in its name to Hadoop via hadoop fs -put

I have a file that has a square bracket in its name. This file needs to be uploaded to Hadoop via hadoop fs -put. I am using MapR 6. The following variants lead to a put: unexpected URISyntaxException hadoop fs -put aaa[bbb.txt /destination hadoop…
Stefan Papp
  • 2,199
  • 1
  • 28
  • 54