Questions tagged [cloudera]

Cloudera Inc. is a Palo Alto-based enterprise software company which provides Apache Hadoop-based software and services.

Cloudera, the commercial Hadoop company, develops and distributes Hadoop, the open source software that powers the data processing engines of the world’s largest and most popular websites.

Cloudera's Distribution including Apache Hadoop (CDH) is a free package built from the powerful, flexible, scalable Apache Hadoop software. To help you learn about Hadoop and how to use it, Cloudera offers public and private training, certification and online courseware.

Useful Links

Related Tags

2533 questions
0
votes
0 answers

How to grant nifi access to write to directory

Am trying to use NiFi's putFile processor, but it keeps thrwoing AccessDeninedException how can I grant Nifi access to the targeted directory ?
Exorcismus
  • 2,243
  • 1
  • 35
  • 68
0
votes
0 answers

HBase Table.batch() doesn't respond when running in a reducer

I am using CDH5.16.1 version of hbase-client. I have a MapReduce job set up to read some avro files and batch write their content into HBase. My setup includes a Zookeeper quorum, HBase Thrift server, one HBase master and a bunch of…
return 0
  • 4,226
  • 6
  • 47
  • 72
0
votes
0 answers

How to get total cluster space quota in Cloudera distribution

How to get total cluster space quota in Cloudera distribution. One point here is, some of the existing directories didn't set space quota.
0
votes
0 answers

Cloudera Namervice proxy setup with existing nameservice with HA

Repurpose the existing nameservice for the NameNode Proxies, and create a new nameservice to represent the NameNodes of the previous nameservice.
nesinor
  • 1,514
  • 1
  • 10
  • 20
0
votes
2 answers

How to configure ‘‑‑enable_orc_scanner’ to ture in cloudera manager 6.3, my impala version is 3.2

In cdh6.3.0, the version of impala is 3.2. Impala can support files in ORC format, but you need to set ‑enable_orc_scanner to true. How do I set this property in the Cloudera manager console?
liuli
  • 3
  • 2
0
votes
1 answer

Nifi + Hortonworks Schema Registry + Kerberos: 401 Authentication required

I am ussing Apache Nifi 1.7 and I'm trying to use a RecordWritter that uses Hortonwors Schema Registry service controller to read schema metadata. However, this controller service doesn't have any KErberos configuration properties like "KErberos…
dhalfageme
  • 1,444
  • 4
  • 21
  • 42
0
votes
2 answers

Hive [Error 10025]: Expression not in GROUP BY key name

I'm trying to select records defined by key-terms in a variable within a group. name is a string with key terms interested. groups are defined by the combo of id1 and id2. I'm interested in extract records by groups that contain the key…
lydias
  • 841
  • 1
  • 14
  • 32
0
votes
1 answer

Transpose dataset in Hive

I'm trying to transpose a variable in Hive such as: Id1 Id2 Event 1 1 7 2 2 3 2 2 7 to Id1 Id2 Event_7 Event_3 1 1 1 2 2 1 1 Following is what I have so far: create temporary table event_trans as …
lydias
  • 841
  • 1
  • 14
  • 32
0
votes
2 answers

HBase MasterProcWALs issue

I noticed that due to some ongoing bug, the Hbase MasterProcWALs folder has filled up my Hdfs. I wanted to know if removing the files under the MasterProcWALs folder will remove any of the data in Hbase?
etp
  • 23
  • 5
0
votes
1 answer

Submitting pyspark job programmatically without using submit pyspark, in python

I want to submit spark job from my local system, to remote server where cloudera is installed(Spark on YARN). Have tried all the possibilities. Trying to create SparkSession and SparkContext The following is the…
sridar1992
  • 21
  • 2
0
votes
0 answers

Hbase reading freezes for a few records when reading with partial rowkey

I am reading data from HBase through spark. The code runs fine when reading the data using a prefix filter with a complete rowkey or using GET, but it freezes if I use a partial prefixed rowkey. The rowkey structure is md5OfAkey_Akey_txDate_someKey.…
0
votes
1 answer

HDP 2.6.5 on Windows 10 Hyper-v

I am trying to run HDP 2.6.5 on Windows 10 laptop using Hyper V Manager. I downloaded the VirtualBox version and extracted the .vmdk and converted it to .vhd file using VirtualBox software. & 'C:\Program Files\Oracle\VirtualBox\VBoxManage.exe'…
user691197
  • 927
  • 6
  • 20
  • 38
0
votes
0 answers

Java Consumer for Kafka in Cloudera Quickstart not working

I have a cloudera Quickstart VM. i have installed Kafka parcels using Cloudera Manager and its working fine inside the VM using console based consumer and producer. But when i try to use java based consumer it does not produce or consume messages.…
0
votes
1 answer

yarn application accepted but not running cloudera despite resource allocation

I am using a Cloudera quickstart VM 5.13.0.0 to run Spark applications in yarn-client mode. I have allocated 10GB and 3 cores to my Cloudera VM. When I submit the application, the application is ACCEPTED but never moves on to RUNNING. When I try to…
0
votes
1 answer

Tags could not be loaded in table details of Cloudera Hue

I'm using Hue to access Cloudera Impala. When I open the table browser and view details of an arbitrary table, I get the error "Tags could not be loaded." How can I get the tags to load? Alternatively, is there a way to query the metadata and…