Questions tagged [cloudera-quickstart-vm]

Cloudera QuickStart VM contains a single-node Apache Hadoop cluster including Cloudera Manager, example data, queries, and scripts.

Cloudera QuickStart VM contains a single-node Apache Hadoop cluster including Cloudera Manager, example data, queries, and scripts. It is free software developed by Cloudera.

195 questions
1
vote
2 answers

SparkSQL-Scala with POM

I have some problem with Cloudera VM and Spark. First of all, I'm completely new on Spark, and my boss asked to me to run Spark on Scala in a Virtual Machine for some test. I have downloaded the Virtual Machine on Virtual Box environment, so I open…
1
vote
1 answer

Error running Kafka on Cloudera quickstart: assembly-package-dependency not valid

I have downloaded Kafka from apache and extracted it to its own folder. Following the quickstart, I also installed sbt, but at the third line in the sbt commands (I am launching the terminal from INSIDE the kafka folder, I get: [error] Not a valid…
Vale
  • 1,104
  • 1
  • 10
  • 29
1
vote
0 answers

Hadoop- could not compile java classes

I modified a file named compile.sh, I changed exactly the paths of .jar files. This is the compile.sh original file: rm dmtl.jar javac -classpath…
jozef
  • 11
  • 3
1
vote
2 answers

create table in hive is stuck using cloudera quickstart vm

I am trying to create a table in hive using hive CLI of cloudera quickstart vm. The command is stuck and even after a long time does not happen anything. Am i missing anything. Appreciate if anyone can help on this please. [cloudera@quickstart ~]$…
Naga
  • 51
  • 3
1
vote
1 answer

cloudera - Failed to execute command Start on service Accumulo 1.6

I've been pulling my hair out trying to install accumulo on a cloudera quickstart VM (i've found quickstart to be anything but). I'm attempting to install it via cloudera manager (which I thought would have been a lot more plug and play) however at…
jhole89
  • 718
  • 9
  • 28
1
vote
1 answer

How to update MLLIB version in PySpark

I have installed Cloudera VM and hence it has PySpark with MLLIB library, but the ML library MLLIB is too old, I just wanted to upgrade it with latest version of MLLIB, Already updated the python from 2.6 to 2.7, but unable to find any documentation…
1
vote
1 answer

Not able to access HDFS

I installed cloudera vm and started trying some basic stuff. First I just wanted to ls the hdfs directoires. so I issued the below command. [cloudera@quickstart ~]$ hadoop fs -ls / ls: Failed on local exception: java.net.SocketException: Network is…
Pa1
  • 861
  • 1
  • 7
  • 6
1
vote
1 answer

Oozie job expiring on Java action when writing to HDFS

I have an Oozie coordinator that runs a workflow every hour. The workflow is composed of two sequential actions: a shell action and a Java action. When I run the coordinator, the shell action seems to execute successfully, however, when it's time…
oikonomiyaki
  • 7,691
  • 15
  • 62
  • 101
1
vote
1 answer

Hipi Average no of pixels program fails due to yarn containers error in cloudera quickstart vm

I am newbie to HIPI/Hadoop so I choose cloudera quickstart vm (5.4.2). I am following getting started tutorial to do that. While running I find that I have files in hib file as shown below: [cloudera@quickstart tools]$ ./hibInfo.sh…
RUKNA
  • 21
  • 6
1
vote
2 answers

Cloudera Quickstart VM illegalArguementException: Wrong FS: hdfs: expected: file:

I have a simple java code to copy a text file from my local to the hdfs. I am using cloudera's quickstart virtual machine. Configuration conf = new Configuration(); conf.addResource(new Path("/etc/hadoop/conf/core-site.xml")); conf.addResource(new…
Free Man
  • 195
  • 3
  • 13
1
vote
4 answers

Oozie job won't run if using PySpark in SparkAction

I've encountered several examples of SparkAction jobs in Oozie, and most of them are in Java. I edit a little and run the example in Cloudera CDH Quickstart 5.4.0 (with Spark version 1.4.0). workflow.xml
oikonomiyaki
  • 7,691
  • 15
  • 62
  • 101
1
vote
1 answer

Adding worker to Spark Standalone CDH5.3

I have a cloudera cdh5.3 quickstart running on a VM. I am having problems with running Spark. I have gone through those steps http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/cdh_ig_spark_configure.... and run the word…
0
votes
0 answers

What is my problem with join mapper code?

I'm trying to practice joining data using mapreduce, but when i run this line cat join1_File*.txt | ./join1_mapper.py | sort | ./join1_reducer.py it displays this erorr : Traceback (most recent call last): File "./join1_mapper.py", line 24, in…
0
votes
0 answers

getting error while loading data in a table using hive

Am getting error as "analysisException: Syntax error in line 1: undefined: LOAD DATA LOCAL INPATH'/home/cloudera...Encounttered:IDENTIFIER Expected:INPATH CAUSED BY:Exception:Syntax error" My code is LOAD DATA LOCAL INPATH…
0
votes
1 answer

cloudera sandbox - docker run error mac m1

I am unable to run the docker container on mac m1 Please note i was not able to run the below docker container Reference https://hub.docker.com/r/cloudera/sandbox-cdh my system configuration ram 16 gb sdd - 250 gb and free space is 150 gb command…
Data2explore
  • 452
  • 6
  • 16