Questions tagged [apache-spark-1.2]

Use for questions specific to Apache Spark 1.2 For general questions related to Apache Spark use the tag [apache-spark].

5 questions
6
votes
3 answers

How to encode categorical features in Apache Spark

I have a set of data based on which I want to create a classification model. Each row has the following form: user1,class1,product1 user1,class1,product2 user1,class1,product5 user2,class1,product2 user2,class1,product5 user3,class2,product1 There…
user706838
  • 5,132
  • 14
  • 54
  • 78
5
votes
4 answers

Cluster hangs in 'ssh-ready' state using Spark 1.2.0 EC2 launch script

I'm trying to launch a standalone Spark cluster using its pre-packaged EC2 scripts, but it just indefinitely hangs in an 'ssh-ready' state: ubuntu@machine:~/spark-1.2.0-bin-hadoop2.4$ ./ec2/spark-ec2 -k -i .pem -r us-west-2…
nmurthy
  • 1,337
  • 1
  • 12
  • 24
2
votes
2 answers

How to connect Spark-Notebook to Hive metastore?

This is a cluster with Hadoop 2.5.0, Spark 1.2.0, Scala 2.10, provided by CDH 5.3.2. I used a compiled spark-notebook distro It seems Spark-Notebook cannot find the Hive metastore by default. How to specify the location of hive-site.xml for…
Rex
  • 2,097
  • 5
  • 16
  • 18
1
vote
1 answer

Spark 1.2.1 fails to compile the assembly project

Just downloaded Spark 1.2.1 and it fails to compile in the assembly project with the following error: The requested profile "hadoop-2.6" could not be activated because it does not exist. [ERROR] Failed to execute goal on project spark-assembly_2.10:…
bsam
  • 1,838
  • 3
  • 20
  • 26
0
votes
1 answer

SPARK 1.2.0 SQLContext need conditional value like case condition in oracle

I am using spark 1.2.0 with python. My problem is that in a sql query if the value of a field is zero , i need to replace it by some other value. I have tried case /coalese which works for 1.4.0 but not for 1.2.0 case when COALESCE("+fld+",0)=0 then…
P RAY
  • 353
  • 1
  • 2
  • 12