Questions tagged [cloudera-quickstart-vm]

Cloudera QuickStart VM contains a single-node Apache Hadoop cluster including Cloudera Manager, example data, queries, and scripts.

Cloudera QuickStart VM contains a single-node Apache Hadoop cluster including Cloudera Manager, example data, queries, and scripts. It is free software developed by Cloudera.

195 questions
0
votes
1 answer

Running spark examples on Cloudera VM 5.7 and

I am learning hadoop, machine learning and spark. I have downloaded Cloudera 5.7 Quick Start VM. I have also downloaded the examples from https://github.com/apache/spark as a zip file and copied them to the Cloudera VM. I have a challenge running…
0
votes
1 answer

Accessing RStudio server on Cloudera VM running on Ubuntu Host

What I would like to do: Access RStudio WebGUI, running on a Cloudera Quickstart VM on an Ubuntu Host from a OSX Browser. This is what works: 1. Ubuntu host is running Cloudera Quickstart VM 2. Cloudera VM has R and RStudio Server installed and…
0
votes
0 answers

How to read the file stored in cloudera home folder

I am new to python and spark and i have installed cloudera quickstart and I am trying to read a file which i created in cloudera's home folder which is present in the cloudera distribution desktop, how can i read the file? I tried using the below…
0
votes
2 answers

Error installing RODBC package

Iam trying to install RODBC on my Cloudera server but Iam getting an error when downloading the package... is there anyone that can tell me what Iam doing wrong? Below my logs: http://pastebin.com/s2b1y0Xr
M Beers
  • 71
  • 1
  • 7
0
votes
0 answers

I cannot access programmatically a file within a CDH 5.7 image running in vmware

I have a vmware cloudera image, cdh-5.7 running with centos6.8, i am using OS X as my development machine, and the cdh image to run the code. UPDATE This is the build.sbt that i am currently using, i just have updated spark version from official…
aironman
  • 837
  • 5
  • 26
  • 55
0
votes
2 answers

Exceptions when reading tutorial CSV file in the Cloudera VM

I'm trying to do a Spark tutorial that comes with the Cloudera Virtual Machine. But even though I'm using the correct line-ending encoding, I can not execute the scripts, because I get tons of errors. The tutorial is part of the Coursera…
Matthias
  • 5,574
  • 8
  • 61
  • 121
0
votes
1 answer

Copying data from cloudera cluster to google cloud hdfs cluster using distcp

I'm using cloudera quickstart vm. I started playing around with google cloud platform yesterday. I'm trying to copy data in cloudera hdfs to 1. google cloud storage (gs://bucket_name/) 2. google cloud hdfs cluster (using…
0
votes
0 answers

beeline and hive queries in metastore tables

I have cloudera quickstart 5.5 on linux red hat. I try create metastore tables by hue, but these tables were created in hive1 instead of hive2 too. So when I do queries by hive line, queries and show tables works, but in beeline (hive2) is empty…
0
votes
1 answer

Hive Server in Cloudera quickstart

I am trying to create a form and using servlets connect with hive´s tables. But I have some doubts: Is Hive server installed in cloudera quickstart? It is necessary other server like Tomcat for the servlet? Must I have the libraries in the IDE? Or…
0
votes
1 answer

where to set config values in cloudera hive setup?

I am new to Cloudera quickstart. As per the requirement, we need to partition the data of large hive tables. there is cap of 100 dynamic partition in hive. We need to increase number of dynamic partitions in the configurations. I don't want to set…
107
  • 552
  • 3
  • 26
0
votes
1 answer

How to get the exact query generated by the ODBC driver

We are connecting Hadoop cloudera CDH distribution through ODBC driver. Queries are generated from SSRS. Few queries are working fine with parameters augmented through ? placeholder. Few other queries with parameters augmented through ? are not…
107
  • 552
  • 3
  • 26
0
votes
0 answers

Additional Spark' installation's access to HDFS and Hive

I have Cloudera Quickstart VM 5.5.0 installed and it has Spark v1.5.0 bundled. When I launch spark-shell scala> val data = sc.textFile("/hdfs/path/file.csv") Spark was able to read from HDFS (proven by using data.first), even without the…
oikonomiyaki
  • 7,691
  • 15
  • 62
  • 101
0
votes
1 answer

trouble in adding spark-csv package in Cloudera VM

I am using Cloudera quickstart VM to test out some pyspark work. For one task, I need to add spark-csv package. And here is what I did: PYSPARK_DRIVER_PYTHON=ipython pyspark -- packages com.databricks:spark-csv_2.10:1.3.0 pyspark started up fine,…
0
votes
1 answer

Programming Sqoop: v1.4.x or v1.99.x

I'm using Cloudera Quickstart VM CDH 5.5.0 and I'm trying to do some custom Java code for Sqoop in a Maven project. However, I'm a bit confused about the two versions of Maven dependencies: org.apache.sqoop
oikonomiyaki
  • 7,691
  • 15
  • 62
  • 101
0
votes
1 answer

Oozie Shared Lib: where to place jars

I have installed Cloudera CDH QuickStart VM 5.5, and I'm running a Sqoop action in my Oozie workflow. I encountered an error that says MySQL JDBC driver is missing and I came across to a SO answer here that says the mysql-connector-java.jar should…
oikonomiyaki
  • 7,691
  • 15
  • 62
  • 101
1 2 3
12
13