Questions tagged [cloudera-manager]

Cloudera's end-to-end administration tool for Apache Hadoop, its ecosystem components, and related applications from Cloudera

From Cloudera Manager 5 Overview:

Cloudera Manager is an end-to-end application for managing CDH clusters. Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of the CDH cluster—empowering operators to improve performance, enhance quality of service, increase compliance and reduce administrative costs. With Cloudera Manager, you can easily deploy and centrally operate the complete CDH stack and other managed services. The application automates the installation process, reducing deployment time from weeks to minutes; gives you a cluster-wide, real-time view of hosts and services running; provides a single, central console to enact configuration changes across your cluster; and incorporates a full range of reporting and diagnostic tools to help you optimize performance and utilization.

Additional Information

Related Tags

380 questions
1
vote
0 answers

tasks distribution among nodes

how map tasks are distributed among nodes. Can i set nodes to run in parallel. for example if datanodes are 2, and map tasks are 4 which of them take the task? i see from the manager sometimes datanode takes the task and sometimes the other…
std
  • 13
  • 4
1
vote
1 answer

CDH Security using Kerberos and Microsoft Active Directory

I'm trying to secure my CDH cluster using Kerberos but I want to use the user info that I have in my Microsoft Active Directory. Is there a way to use MS AD as Kerberos user lookup source?
1
vote
2 answers

Cloudera-scm-server.log showing error at two places

I am trying to run Cloudera-Manager and it's giving me error given in following screenshots and marked with red pen. Can anybody help me resolve those error ??
Ankita Dhawale
  • 211
  • 1
  • 6
  • 17
1
vote
1 answer

How to install cloudera on top of apache hadoop 2.7.1

I am currently working on apache hadoop2.7.1, cluster includes 1 name node and 3 data nodes. Is it possible to install cloudera manager on existing apache hadoop 2.7.1 cluster. If yes, could you please suggest me how it can be done. Thanks in…
Khadhar Koneti
  • 189
  • 3
  • 14
1
vote
0 answers

How to connect Cloudera Manager to existing Spark cluster

I have following requirement: I need to provision both Cloudera Manager and Spark Cluster via Puppet but in a way that I need minimal (or none) configuration through Cloudera Manager UI afterwards. Ideal scenario that I'm looking for is…
Bakir Jusufbegovic
  • 2,806
  • 4
  • 32
  • 48
1
vote
0 answers

Host Inspector Failed

I am trying to install CDH5 on EC2 instances (Ubuntu). Every time I hit the page where host inspector starts to execute, it fails on the worker nodes saying IOException thrown while collecting data from host: Connection timed out. Further digging…
1
vote
1 answer

Cloudera Manager Health Issue: NameNode Connectivity, Web Server Status

Below is a snapshot of the health issues reported on CM. The datanodes in the list keep changing. Some errors from the datanode logs : 3:59:31.859 PM ERROR org.apache.hadoop.hdfs.server.datanode.DataNode …
scott
  • 235
  • 4
  • 12
1
vote
1 answer

New Installation using CM 5.5.1 urlopen error [Errno 22] Invalid argument

I am trying a brand new installation on a 3 node cluster running CentOS 6.6 using Cloudera Manager 5.5.1,however on step # 7 named "Installing Selected Parcels" , I get an error that says -…
user1965449
  • 2,849
  • 6
  • 34
  • 51
1
vote
1 answer

Installing cloudera manager in ubuntu 14.04/64b

I am installing Cloudera Manager in my system(14.04/64b). While installing at the final step, before finish installation I got some ERRORs in validation as shown below, errors in above page are, ERROR 1 Individual hosts resolved their own…
A J
  • 492
  • 1
  • 7
  • 24
1
vote
1 answer

cloudera manager vs Apache

cloudera distribution is using cloudera manager for its automated installation and maintenance activities in a cluster like that Horton works distribution is using Ambari for same purpose in windows,can anyone tell what is the tool that Apache…
user2631600
  • 759
  • 1
  • 11
  • 18
1
vote
0 answers

Number of Executors is less than what is assigned for a Spark job

I am having two hadoop clusters containing 15(big) and 3(small) nodes respectively. Both are managed by cloudera manager. I am running a Spark job using yarn setting --num-executors to 6. The Spark UI of the big cluster is showing the 6 executors,…
Chandan
  • 764
  • 2
  • 8
  • 21
1
vote
1 answer

How to auto configure Cloudera Manager?

I have created a few scripts that allow me to automate a vast majority of installing CDH5 & Cloudera Manager v5. The issue is that each time the cluster is spun up, I have to configure cloudera manager via the GUI wizard. How would one configure…
1
vote
0 answers

Cloudera Manager add services failed with ArithmeticException

I've installed Cloudera Manager in a three-machine cluster. While I added CDH5 services (including Hive,Spark,Yarn...) to cluster, it then failed with "ArithmeticException: / by zero". version: Cloudera Enterprise Data Hub Edition trail 5.4.5 (#5…
Joson_prog
  • 11
  • 3
1
vote
2 answers

Cloudera 5.4.4 Cluster - Getting aggregate usage metrics

I would like to collect aggregate usage metrics from a Cloudera 5.4.4 Hadoop cluster. Some of the metrics in my mind are as below: Average CPU utilization of the cluster per day/ per week Top n longest running jobs/queries on Hadoop Top n users who…
activelearner
  • 7,055
  • 20
  • 53
  • 94
1
vote
0 answers

HDFS DataNode - Decomissioned node marked as live

I have a 8 node hadoop cluster (CDH 5.2.0) of which i'm using all the nodes as datanodes. I observed this rather peculiar behavior where no writes are happening on one of the data nodes and the disk activity is non existent for the past few weeks,…
ManojVenkat
  • 474
  • 7
  • 13