Questions tagged [cloudera]

Cloudera Inc. is a Palo Alto-based enterprise software company which provides Apache Hadoop-based software and services.

Cloudera, the commercial Hadoop company, develops and distributes Hadoop, the open source software that powers the data processing engines of the world’s largest and most popular websites.

Cloudera's Distribution including Apache Hadoop (CDH) is a free package built from the powerful, flexible, scalable Apache Hadoop software. To help you learn about Hadoop and how to use it, Cloudera offers public and private training, certification and online courseware.

Useful Links

Related Tags

2533 questions
16
votes
5 answers

Is there a way to add nodes to a running Hadoop cluster?

I have been playing with Cloudera and I define the number of clusters before I start my job then use the cloudera manager to make sure everything is running. I’m working on a new project that instead of using hadoop is using message queues to…
user1735075
  • 3,221
  • 4
  • 16
  • 16
16
votes
7 answers

Repair HBase table (unassigned region in transition)

I'm a bit stuck repairing a faulty table (on Hbase 0.92.1-cdh4.0.0, Hadoop 2.0.0-cdh4.0.0) There is a region in transition that doesn't finish: Region State bf2025f4bc154914b5942af4e72ea063…
Mario
  • 1,801
  • 3
  • 20
  • 32
15
votes
6 answers

Cloudera Manager fails to add hosts

I've followed the installation procedure from here and when I reach the Inspect Role Assignments stage I only see one managed host: localhost.localdomain. Any subsequent attempts to add other hosts have the same outcome: each cluster host…
ene.andrei
  • 187
  • 1
  • 2
  • 9
15
votes
3 answers

Spark : multiple spark-submit in parallel

I have a generic question about Apache Spark : We have some spark streaming scripts that consume Kafka messages. Problem : they are failing randomly without a specific error... Some script does nothing while they are working when I run them…
Taoma_k
  • 303
  • 2
  • 3
  • 9
15
votes
8 answers

Why does "hadoop fs -mkdir" fail with Permission Denied?

I am using Cloudera on a VM machine that I am playing around with. Unfortunately I am having issues copying data to the HDFS, I am getting the following: [cloudera@localhost ~]$ hadoop fs -mkdir input mkdir: Permission denied: user=cloudera,…
AAA
  • 2,388
  • 9
  • 32
  • 47
15
votes
4 answers

Install Hue without Cloudera

Has anyone tried/succeeded in installing Hue on Hadoop without Cloudera? I have gotten to a point where I can reliably reproduce a hadoop cluster with hbase and hive and can set it all up in about 15 minutes. I'd love to have Hue along with all this…
Aman Chawla
  • 704
  • 2
  • 8
  • 25
14
votes
2 answers

Using Phoenix with Cloudera Hbase (installed from repo)

I can get Phoenix working on a standalone Apache Hbase (note, all this is for Hbase 1.0.0 on RHEL6.5) For the Cloudera flavour of Hbase however I never get it working without it throwing Exceptions. (even tried RHEL7 minimal as en OS) The same thing…
Havnar
  • 2,558
  • 7
  • 33
  • 62
14
votes
7 answers

beeline not able to connect to hiveserver2

I have a CDH 5.3 instance. I start the hive-server2 by first starting the hive-metastore and then the hive-server from command line. After this I use beeline to connect to my hive-server2 but apparently it is not able to so. Could not open…
tinus91
  • 247
  • 1
  • 6
  • 22
14
votes
1 answer

Spark executor logs on YARN

I'm launching a distributed Spark application in YARN client mode, on a Cloudera cluster. After some time I see some errors on Cloudera Manager. Some executors get disconnected and this happens systematically. I would like to debug the issue but the…
Nicola Ferraro
  • 4,051
  • 5
  • 28
  • 60
13
votes
7 answers

HBase Error - assignment of -ROOT- failure

I've just installed hadoop and hbase from cloudera (3) but when I try to go to http://localhost:60010 it just sits there continually loading. I can get to the regionserver fine - http://localhost:60030... Looking at the master hbase server logs I…
NightWolf
  • 7,694
  • 9
  • 74
  • 121
13
votes
4 answers

No partition predicate found for Alias even when the partition predicate in present in the query

I have a table pos.pos_inv in hdfs which is partitioned by yyyymm. Below is the query: select DATE_ADD(to_date(from_unixtime(unix_timestamp(Inv.actvydt, 'MM/dd/yyyy'))),5), to_date(from_unixtime(unix_timestamp(Inv.actvydt,…
jeff
  • 157
  • 1
  • 2
  • 8
13
votes
4 answers

Installation failed. Failed to receive heartbeat from agent

I got this error Installation failed. Failed to receive heartbeat from agent. when I was installing cloudera on single node. This is what is in my /etc/hosts file: 127.0.0.1 localhost 192.168.2.131 ubuntu This is what is in my /etc/hostname…
user3613796
  • 161
  • 1
  • 1
  • 4
13
votes
4 answers

Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/util/PlatformName

I know there have been many posts regarding this exception, but I am not able to fix this issue. Classpath has to be edited I think to resolve it. I am trying to run a program called DistMap in hadoop infrastructure. This is the error I am…
Ashwin
  • 577
  • 3
  • 6
  • 15
12
votes
1 answer

Import data from HDFS to HBase (cdh3u2)

I have Installed hadoop and hbase cdh3u2. In hadoop i have a file at the path /home/file.txt. it has the data like one,1 two,2 three,3 I want to import this file into hbase. in that, the first field should parsed as String, and 2nd field parsed as…
Nageswaran
  • 7,481
  • 14
  • 55
  • 74
12
votes
2 answers

Hadoop JobConf class is deprecated , need updated example

I am writing hadoop programs , and i really dont want to play with deprecated classes . Anywhere online i am not able to find programs with updated org.apache.hadoop.conf.Configuration class insted of org.apache.hadoop.mapred.JobConf class. …
CodeBanger
  • 201
  • 1
  • 3
  • 9