Questions tagged [mapr]

MapR is a commercial data platform that offers a HDFS compatible distributed file system, a database that allows to store data in BigTable or JSON and a streaming platform for messaging. MapR leverages APIs from open source tools such as Hadoop, Kafka, HBase and provides a proprietary implementation written in C optimised for improved performance.

MapR is a complete enterprise-grade distribution for Apache Hadoop. The MapR Converged Data Platform has been engineered to improve Hadoop’s reliability, performance, and ease of use.

The MapR distribution provides a full Hadoop stack that includes the MapR File System (MapR-FS), the MapR-DB NoSQL database management system, MapR Streams, the MapR Control System (MCS) user interface, and a full family of Hadoop ecosystem projects. You can use MapR with Apache Hadoop, HDFS, and MapReduce APIs.

MapR supports the Hadoop 2.x architecture and YARN (Yet Another Resource Negotiator). Hadoop 2.x and YARN make up a resource management and scheduling framework that distributes resource management and job management duties.

enter image description here

There are three MapR editions.

  • MapR Community Edition (formerly M3)
    • Free community edition.
  • MapR Enterprise Edition (formerly M5)
    • Adds high availability and data protection, including multi-node NFS.
  • MapR Enterprise Database Edition (formerly M7)
    • Adds structured table data natively in the storage layer and provides a flexible NoSQL database.

MapR can be installed on many versions of Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, and SUSE. A full matrix of supported Linux operating systems can be found here.

To install MapR the following requirements are needed.

  • A 64-bit CPU.
  • One of the above mentioned operating systems. (Red Hat Enterprise linux, CentOS, Ubuntu, Oracle Linux, or SUSE)
  • A minimum of 8GB of RAM.
  • At least one single unformatted disk.
  • A Resolvable hostname.
  • A common user on each server you wish to install MapR on.
  • Java 1.7.0 or higher.
  • Other
    • NTP, Syslog, PAM



Try MapR

Download the MapR Sandbox for VMware or Virtualbox for free.

OR

Install MapR on your own. Check to see if the installer is supported for your OS

You will have to meet the prerequisites for a successful installation

Get the mapr-setup sctipt from the MapR repository.

wget http://package.mapr.com/releases/installer/mapr-setup.sh

Run the mapr-setup script to start the installation.

bash ./mapr-setup.sh -y

Open the web UI with the following URL

https://<Installer node hostname/IPaddress>:9443

Following the prompts and you will be on your way to installing MapR.

There is also manual installation available. Full instructions can be viewed here.

Extensive documentation can be found on MapR's documentation site. http://maprdocs.mapr.com/home/



The Stackoverflow tag [mapr] can be used for questions about issues you have with the MapR platform.

381 questions
3
votes
1 answer

How to Install Sqoop in Windows system and what are all necessary things it requires?

I tried to install sqoop for that I was referring to some installation guide that says HBase and mapr to be installed as prerequisite. I am not understanding how to proceed. So can some one guide me how to Install Sqoop in Windows system and what…
Shreyuth
  • 116
  • 3
  • 13
3
votes
1 answer

MapR client not executing hadoop - Windows

I have an Amazon Windows VM where i did install MapR-Client 2.1.2, and another MapR cluster waiting for the jobs to be executed. I set up MAPR_HOME in C:\opt\mapr, and when I execute hadoop fs -ls / from C:\opt\mapr\hadoop\hadoop-0.20.2\bin I…
cybertextron
  • 10,547
  • 28
  • 104
  • 208
3
votes
1 answer

rhive.connect() issues with MapR distribution

After loading and initializing RHive (using rhive.init()) I get the following errors with rhive.connect(): java.lang.UnsatisfiedLinkError: no MapRClient in java.library.path at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1734) …
2
votes
1 answer

where is core-default.xml file?

I'm interested in value of fs.s3a.connection.ssl.enabled parameter in my mapr cluster. I know the value is set in core-default.xml (if not overwritten by core-site.xml) but I cannot find core-default.xml file. Any suggestions where it can be? Is…
psmith
  • 1,769
  • 5
  • 35
  • 60
2
votes
1 answer

Java Version Conflct with Protobuf

I am creating a GRPC endpoint that will accept data and pass the data on to MapR Streams (port of Kafka). I have created the GPRC endpoint and that is working fine when dumping the data to console. When I add the code to push the messages to…
MikeKulls
  • 873
  • 1
  • 10
  • 22
2
votes
1 answer

HIVE : Insert Query Fails with error "java.lang.OutOfMemoryError: GC overhead limit exceeded"

My Hive Insert Query is getting failed with the below error : java.lang.OutOfMemoryError: GC overhead limit exceeded Data in table2 = 1.7tb Query : set hive.exec.dynamic.partition.mode= nonstrict;set hive.exec.dynamic.partition=true;set…
2
votes
1 answer

Querying MapR with db-shell - where clause

The following works: mapr dbshell 'find /sourcepath --q {"$select":"source","$where":{"$and":[{"$eq":{"status":"file_LOAD_START"}}]}}' The following works too: mapr dbshell 'find /sourcepath --q…
Stefan Papp
  • 2,199
  • 1
  • 28
  • 54
2
votes
3 answers

Difference b/w Mapr Vs Cloudera?

Cloudera is free edition and enterprise edition but MapR is almost enterprise edition why? is there any major difference between them?
user6772304
2
votes
2 answers

Apache Drill - Getting error on KVGEN method

I have a huge dataset where each record have json data similar to below…
Lokesh Jain
  • 396
  • 3
  • 11
2
votes
1 answer

Apache Drill: Hash join does not support schema changes Error on extra column

When querying a table in Apache Drill, i get this error: SYSTEM ERROR: SchemaChangeException: Hash join does not support schema Changes. The query is like below: Select abc, xyz, proxyip from D_UNIV_TABLE where SUB_NUM in (select SUB_NUM from…
Srihari Karanth
  • 2,067
  • 2
  • 24
  • 34
2
votes
1 answer

MapR maprcli command not found

I have installed mapr-client,mapr-httpfs and also I have run the command below: ./configure.sh -N my.cluster.com -c -secure -C 192.168.145.121:7222 -HS 192.168.145.122 and the result is: create /opt/mapr/conf/conf.old Configuring Hadoop-2.7.0 at…
郑善宇
  • 41
  • 1
  • 4
2
votes
1 answer

pyspark streaming with kafka error

I am using spark 2.1.0 version with kafka 0.9 in MapR environment.I am trying to read from Kafka topic into spark streaming. However i am facing error as below when i am running Kafkautils createDirectStream command. py4j.protocol.Py4JError: An…
scn3089
  • 31
  • 7
2
votes
1 answer

DNS incosistent

I am trying to create a MAPR cluster, nut I have DNS inconsistent warning. I have edited my /etc/hosts file as 10.0.0.10 master.aptus.com 10.0.0.20 slave1.aptus.com 10.0.0.30 slave2.aptus.com These system contribute to establishing cluster. When i…
2
votes
2 answers

Establishing connection to drill using pyspark

I am trying to fetch the data from MapR DB into a dataframe using drill to connect in pyspark shell. Here is what I do in my pyspark shell : `dataframe_mysql = sqlContext.read.format("jdbc").option("url",…
AYa
  • 421
  • 3
  • 9
  • 21
2
votes
2 answers

Unable to import SparkContext

I'm working on CentOS, I've setup $SPARK_HOME and also added path to bin in $PATH. I can run pyspark from anywhere. But when I try to create python file and uses this statement; from pyspark import SparkConf, SparkContext it throws following…
Mubin
  • 4,325
  • 5
  • 33
  • 55
1 2
3
25 26