Questions tagged [cloudera-cdh]

For questions specifically about Cloudera's Distribution of Apache Hadoop (CDH). Please look at https://community.cloudera.com/ before posting questions.

From cloudera.com - CDH Components:

CDH is Cloudera’s 100% open source platform distribution, including Apache Hadoop and built specifically to meet enterprise demands. CDH delivers everything you need for enterprise use right out of the box. By integrating Hadoop with more than a dozen other critical open source projects, Cloudera has created a functionally advanced system that helps you perform end-to-end Big Data workflows.

Key Projects:

  • Apache Hadoop (Core)
  • Apache Accumulo
  • Apache Flume
  • Apache HBase
  • Apache Hive
  • Hue
  • Apache Impala (incubating)
  • Apache Kafka
  • Apache Pig
  • Apache Sentry
  • Cloudera Search
  • Apache Spark
  • Apache Sqoop

RESOURCES

  • CDH5 - archives - CDH5 packages and parcels
  • Documentation - official documentation
  • Blogs - engineering blogs with useful tutorials and in-depth explanations of Hadoop functionality
  • Community Forums - questions and answers from the CDH community

Related Tags

1018 questions
8
votes
3 answers

hadoop namenode port in use

This is actually a standby HA namenode. It was configured with the same settings as the primary and hdfs namenode -bootstrapStandby was successfully run. It begins coming up on the standard HTTP port 50070 as defined in the config…
Bill Warner
  • 639
  • 6
  • 18
8
votes
1 answer

Deleting files from HDFS does not free up disk space

After upgrading our small Cloudera Hadoop cluster to CDH 5, deleting files no longer frees up available storage space. Even though we delete more data than we add, the file system keeps filling up. Cluster setup We are running a four node cluster on…
knutn
  • 113
  • 1
  • 1
  • 6
7
votes
2 answers

Impala: Show tables like query

I am working with Impala and fetching the list of tables from the database with some pattern like below. Assume i have a Database bank, and tables under this database are like…
Manindar
  • 999
  • 2
  • 14
  • 30
7
votes
1 answer

Oozie and Job History Server configuration problems

Problem I'm trying to install psuedo-distributed CDH without the use of CDM. Everything "works" via the console. However, the second I begin using Hue, I receive an error when trying to work with Pig. The error shown in Hue is: JA017: Could not…
coatless
  • 20,011
  • 13
  • 69
  • 84
7
votes
1 answer

Is it possible to concat a string field after group by in Hive

I am evaluating Hive and need to do some string field concatenation after group by. I found a function named "concat_ws" but it looks like I have to explicitly list all the values to be concatenated. I am wondering if I can do something like this…
kee
  • 10,969
  • 24
  • 107
  • 168
7
votes
4 answers

How to change the version of Java that CDH uses

I currently have CDH 5 installed on CentOS 6.5 with java jdk1.7 and I am trying to get CDH to use jdk1.8. I do know that Java 1.8 is not a recommended version of CDH, but it is only a test cluster, so it isn't a big deal. I have installed both…
milk3422
  • 660
  • 6
  • 10
7
votes
4 answers

Namenode HA (UnknownHostException: nameservice1)

We enable Namenode High Availability through Cloudera Manager, using Cloudera Manager >> HDFS >> Action > Enable High Availability >> Selected Stand By Namenode & Journal Nodes Then nameservice1 Once the whole process completed then Deployed…
roy
  • 6,344
  • 24
  • 92
  • 174
6
votes
1 answer

Which Distribution CDH Vs HDP

I happened to work on CDH longtime back ( around 1 year) and am planning to start again.Now we had CDH , HDP and Hortonwork acquired by Cloudera . Is HDP being developed actively ? Or Is CDH being developed actively ? Which distribution I should…
6
votes
2 answers

hdfs moveFromLocal does not distribute replica blocks across data nodes

I recently upgraded my Cloudera environment from 5.8.x (hadoop 2.6.0, hdfs-1) to 6.3.x (hadoop 3.0.0, hdfs-1) and after some days of data loads with moveFromLocal, i just realized that the DFS Used% of datanode server on which i execute…
belce
  • 203
  • 2
  • 5
6
votes
0 answers

Lookup Hbase Tbl from UDF (Beeline , Hbase,Delegation Tokens)

I have a requirement to write Custom UDF for data lookup from Hbase Table . NOTE : I have done Unit Testing with HIVE . It seems to be working . But when I use the same UDF Beeline, Its failed . By default Cloudera restricts impersonation and…
Govind
  • 419
  • 8
  • 25
6
votes
7 answers

java.lang.UnsatisfiedLinkError: org.apache.hadoop.io.nativeio.NativeIO$Windows.createDirectoryWithMode0

I cannot solve this exception, I've read the hadoop docu and all related stackoverflow questions that I could find. My fileSystem.mkdirs(***) throws: Exception in thread "main" java.lang.UnsatisfiedLinkError:…
jasie
  • 2,192
  • 10
  • 39
  • 54
6
votes
1 answer

pig is not visible inside hue

I have a hadoop cluster. Pig is installed: But the pig editor is not visible inside hue (3.7): How can I fix it?
rom
  • 3,592
  • 7
  • 41
  • 71
6
votes
2 answers

Immediate evaluation of CTE

I am trying to optimize a very long and complex impala query which contains multiple CTE. Each CTE is used multiple times. My expectation is that once a CTE is created, I should be able to direct impala that results of this CTE should be re-used in…
AYK
  • 3,312
  • 1
  • 17
  • 30
6
votes
2 answers

Running from a local IDE against a remote Spark cluster

We have a kerberized cluster with Spark running on Yarn. At the moment, we write our Spark code in Scala locally, then build a fat JAR which we copy over to the cluster and then run spark-submit. I would instead like to write Spark code on my local…
Vishakh
  • 1,168
  • 1
  • 11
  • 20
6
votes
1 answer

How to check whether the file exist in HDFS location, using oozie?

How to check whether a file in HDFS location is exist or not, using Oozie? In my HDFS location I will get a file like this test_08_01_2016.csv at 11PM , on a daily basis. I want check whether this file exist after 11.15 PM. I can schedule the…
Sai
  • 1,075
  • 5
  • 31
  • 58
1
2
3
67 68