Questions tagged [cloudera-cdh]

For questions specifically about Cloudera's Distribution of Apache Hadoop (CDH). Please look at https://community.cloudera.com/ before posting questions.

From cloudera.com - CDH Components:

CDH is Cloudera’s 100% open source platform distribution, including Apache Hadoop and built specifically to meet enterprise demands. CDH delivers everything you need for enterprise use right out of the box. By integrating Hadoop with more than a dozen other critical open source projects, Cloudera has created a functionally advanced system that helps you perform end-to-end Big Data workflows.

Key Projects:

  • Apache Hadoop (Core)
  • Apache Accumulo
  • Apache Flume
  • Apache HBase
  • Apache Hive
  • Hue
  • Apache Impala (incubating)
  • Apache Kafka
  • Apache Pig
  • Apache Sentry
  • Cloudera Search
  • Apache Spark
  • Apache Sqoop

RESOURCES

  • CDH5 - archives - CDH5 packages and parcels
  • Documentation - official documentation
  • Blogs - engineering blogs with useful tutorials and in-depth explanations of Hadoop functionality
  • Community Forums - questions and answers from the CDH community

Related Tags

1018 questions
0
votes
1 answer

Retry Oozie workflow action in Hue (CDH4)

how to set retry failed oozie workflow actions in hue (CDH 4.6), i didn't see any option to do that in workflow editor oozie log 2014-05-15 14:30:01,784 INFO org.apache.oozie.command.wf.ActionStartXCommand: USER[test] GROUP[-] TOKEN[] APP[Test]…
user2800129
  • 63
  • 1
  • 8
0
votes
1 answer

Writing MapReduce job to concurrently download files?

Not sure if this is a suitable use case for MapReduce: Part of the OOZIE workflow I'm trying to implement is to download a series of files named with sequential numbers (e.g. 1 through 20). I wanted those files to be downloaded simultaneously (5…
aiman86
  • 71
  • 8
0
votes
1 answer

Unable to connect to database on host 'http://namenode.tarun.com:7432' from host 'namenode.tarun.com

i have installed cloudera manager server while doing the configuration of database setup of embedded database. The error is given below : Unable to connect to database on host 'http://namenode.tarun.com:7432' from host 'namenode.tarun.com' using the…
tarun verma
  • 221
  • 3
  • 5
  • 14
0
votes
1 answer

What can I expect about hive and hadoop in performance?

I'am actually trying to implement a solution with Hadoop using Hive on CDH 5.0 with Yarn. So my architecture is: 1 Namenode 3 DataNode I'm querying ~123 millions rows with 21 columns My node are virtualized with 2vCPU @2.27 and 8 GO RAM So I tried…
Junayy
  • 1,130
  • 7
  • 13
0
votes
2 answers

Oozie Web Console working very slow

In my cluster Oozie web console is very slow. When I go to the web console it takes 10 -15 secs to pass the request to oozie server and takes another 10-15 secs to publish the result got from oozie server. The same is working fine in Hue. Has any…
vishnu
  • 451
  • 4
  • 18
0
votes
1 answer

Which version of CDH using Cloudera Manager automatically Installs JDK1.7?

I am using Cloudera Manager with CDH4.2.2 for my 3+1 cluster. On starting the installation with cloudera manager, it automatically downloads and installs JDK1.6. I want to use JDK1.7 with CDH for my convinience. Is it possible or is there any…
Debashisenator
  • 1,621
  • 4
  • 17
  • 16
0
votes
2 answers

Sqoop import from couchbase to hadoop

On Ubuntu, using couchbase 2.5.1, cloudera cdh4, the hadoop plugin for couchbase and oracle jdk 6. Everything installs fine (seemingly), I can work with hadoop and couchbase independently with no problems, but when I try use the plugin as…
Philip O'Brien
  • 4,146
  • 10
  • 46
  • 96
0
votes
1 answer

R 2.15.1; Rhipe 0.73; CDH 5 - java.io.IOException: No FileSystem for scheme: hdfs

I had installed CDH5 on Debian 7. I have working Hadoop with MapReduce1 in Single Node Pseudo-distributed Mode. I would like to run Rhipe inside R. After downloading I had installed it: sudo R CMD INSTALL Rhipe_0.73.1.tar.gz Then I exported…
Damian Melniczuk
  • 393
  • 6
  • 18
0
votes
2 answers

Querying Impala from Tomcat throws Invalid URL exception

I am trying to connect to Impala and run a query from my web application. Here is my jsp code: <%@page contentType="text/html" pageEncoding="UTF-8"%> <%@ page import="java.sql.SQLException" %> <%@ page import="java.sql.Connection" %> <%@ page…
Ramtin
  • 135
  • 1
  • 2
  • 9
0
votes
1 answer

CDH4 - Mix 64 bits and 32bits machine

I have a small cluster with 6 64bits machines installed with cloudera manager (CM), which works perfectly. I want to add a set of 32 bits machines as datanode to this cluster. According to the CM doc it is not possible to install 32 bits machines…
lizzie
  • 1,506
  • 1
  • 18
  • 31
0
votes
2 answers

Hive table created, however map task is failing with exception

I have created a tweets table in Hive(hive-0.10.0 on CDH 4.6), stuck with the below error when try to execute query hive> select * from tweets where id='442990122795929600'; Total MapReduce jobs = 1 Launching Job 1 out of 1 Number of reduce tasks is…
0
votes
1 answer

trying to deploy 5 nodes on virtual

I'm trying to install hadoop on 5 nodes with virtual box . I have been through many websites any of them did not work properly. And I used this website…
0
votes
1 answer

running simple MR job on CDH4

I'm trying to run a simple MR job using CDH4. I'm getting the strangest error and I don't konw why. Basically my program reads a file, uses an identity mapper, and then the reducer simply emits a key and a string for the value. I don't understand…
Tucker
  • 7,017
  • 9
  • 37
  • 55
0
votes
1 answer

cloudera distribution of hadoop not getting installed

I am installing cdh4 by following the manual given in cloudera installation manual but is not able to install jobtacker, tasktracker using apt-get. Can some one help me out. hduser@Dell:~$ sudo apt-get install…
AjayLohani
  • 872
  • 1
  • 6
  • 26
0
votes
1 answer

Cloudera CDH4 installation

I see the below step in CDH4 MRV1 installation instructions at: http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/CDH4-Quick-Start/cdh4qs_topic_3_2.html Step 4: Create the MapReduce system directories: sudo -u hdfs hadoop fs…
sab
  • 9,767
  • 13
  • 44
  • 51
1 2 3
67
68