Use for questions specific to Apache Spark 1.2 For general questions related to Apache Spark use the tag [apache-spark].
Questions tagged [apache-spark-1.2]
5 questions
6
votes
3 answers
How to encode categorical features in Apache Spark
I have a set of data based on which I want to create a classification model. Each row has the following form:
user1,class1,product1
user1,class1,product2
user1,class1,product5
user2,class1,product2
user2,class1,product5
user3,class2,product1
There…

user706838
- 5,132
- 14
- 54
- 78
5
votes
4 answers
Cluster hangs in 'ssh-ready' state using Spark 1.2.0 EC2 launch script
I'm trying to launch a standalone Spark cluster using its pre-packaged EC2 scripts, but it just indefinitely hangs in an 'ssh-ready' state:
ubuntu@machine:~/spark-1.2.0-bin-hadoop2.4$ ./ec2/spark-ec2 -k -i .pem -r us-west-2…

nmurthy
- 1,337
- 1
- 12
- 24
2
votes
2 answers
How to connect Spark-Notebook to Hive metastore?
This is a cluster with Hadoop 2.5.0, Spark 1.2.0, Scala 2.10, provided by CDH 5.3.2. I used a compiled spark-notebook distro
It seems Spark-Notebook cannot find the Hive metastore by default.
How to specify the location of hive-site.xml for…

Rex
- 2,097
- 5
- 16
- 18
1
vote
1 answer
Spark 1.2.1 fails to compile the assembly project
Just downloaded Spark 1.2.1 and it fails to compile in the assembly project with the following error:
The requested profile "hadoop-2.6" could not be activated because it does not exist.
[ERROR] Failed to execute goal on project spark-assembly_2.10:…

bsam
- 1,838
- 3
- 20
- 26
0
votes
1 answer
SPARK 1.2.0 SQLContext need conditional value like case condition in oracle
I am using spark 1.2.0 with python.
My problem is that in a sql query if the value of a field is zero , i need to replace it by some other value.
I have tried case /coalese which works for 1.4.0 but not for 1.2.0
case when COALESCE("+fld+",0)=0 then…

P RAY
- 353
- 1
- 2
- 12