Questions tagged [spark-jobserver]

spark-jobserver provides a RESTful interface for submitting and managing Apache Spark jobs, jars, and job contexts.

Reference: https://github.com/spark-jobserver/spark-jobserver

RealTime Example: https://nishutayaltech.blogspot.com/2016/05/how-to-run-spark-job-server-and-spark.html

165 questions
2
votes
1 answer

Spark Jobserver max-jobs-per-context

How would you determine a safe max threshold value for the max-jobs-per-context setting, which controls the number of concurrent Spark jobs that are running on a context? What would happen if you go too high? The default is set to 8 (see link…
Lawrence
  • 21
  • 2
2
votes
1 answer

Deploying spark-jobserver to BlueMix Spark Node

I'm trying to deploy and connect instance of spark-jobserver in a docker container to BlueMix Spark service. Locally, container start perfectly with a command docker -d -p 8090:8090 {image-name}, but it looks like BlueMix ice -p command works…
Samuell Gretkins
  • 197
  • 1
  • 11
2
votes
0 answers

How do I fix this Exception when starting spark-job-server?

I am trying to start the spark-job-server on my linux machine. I did the following: Installed the cloudera distriution CDH(5.x) and gotten it up and running Downloaded spark-job-server from the above mentioned github Extracted the project into some…
Pradeep Poojari
  • 155
  • 1
  • 1
  • 9
2
votes
1 answer

how to cache data in apache spark that can be used by other spark job

I have a simple spark code in which I read a file using SparkContext.textFile() and then doing some operations on that data, and I am using spark-jobserver for getting output. In code I am caching the data but after job ends and I execute that…
Gourav
  • 1,245
  • 2
  • 10
  • 12
2
votes
1 answer

How to handle cassandra connections in spark job?

I am doing stress test on my spark application which uses spark cassandra connector as well as cassandra driver. In my application , I am using cassandra driver to select the most recent value from the C* table. This is working fine as long as the…
2
votes
2 answers

Cassandra Connection Issue with Spark-jobserver

I am new to Spark world and Job Server My Code : package spark.jobserver import java.nio.ByteBuffer import scala.collection.JavaConversions._ import scala.collection.mutable.ListBuffer import scala.collection.immutable.Map import…
2
votes
3 answers

spark jobserver ERROR classnotfoundexception

I have been trying spark using spark-shell. All my data is in sql. I used to include external jars using the --jars flag like /bin/spark-shell --jars /path/to/mysql-connector-java-5.1.23-bin.jar --master spark://sparkmaster.com:7077 I have…
1
vote
0 answers

How to find which input image/data is processed on which worker node in spark?

I have huge data (images) that uses machine learning model (CNN) to process image and gives results. As part of spark job performance, I'm trying to see internal spark (YARN) job flow. Spark UI shows list of Jobs, Stages - DAG, Executors and worker…
1
vote
1 answer

Unable to build Spark application with multiple main classes for Databricks job

I have a spark application that contains multiple spark jobs to be run on Azure data bricks. I want to build and package the application into a fat jar. The application is able to compile successfully. While I am trying to package (command: sbt…
1
vote
0 answers

Job submitted via Spark job server fails with error

I am using Spark Job Server to submit spark jobs in cluster .The application I am trying to test is a spark program based on Sansa query and Sansa stack . Sansa is used for scalable processing of huge amounts of RDF data and Sansa query is one of…
dileepVikram
  • 890
  • 4
  • 14
  • 30
1
vote
0 answers

Spark in-memory cache for metadata & block locations

In sake of Spark low latency jobs, Spark Job Server provides a Persistent Context option. But I'm not sure, does persistent context contains metadata, block locations & any other information required for query planning?. By default Spark should read…
VB_
  • 45,112
  • 42
  • 145
  • 293
1
vote
0 answers

sparkJobServer - running spark sql

It looks like spark.jobserver.context.SQLContextFactory is deprecated. Could somebody help me with the example on how to run Spark SQL with the latest ( 0.8) version of SparkJobServer. Thank you.
1
vote
0 answers

spark-jobServer 0.8.x SQL examples

I have installed spark-observer v 0.8.0 with Scala 2.11. I am able to run examples from job-server-tests. However Im unable to find any examples related to running SQL ( load some file/create temporary table and then run SQL against this table). Do…
1
vote
0 answers

Can we have spark-jobserver in windows?

Is it possible to run spark-jobserver in windows without using any emulator like cygwin ? I have tried gitbash as well, as I thought it supports .sh files but I didn't get any luck. Note : I have tried building the source code of spark-jobserver…
TechnocratSid
  • 2,245
  • 1
  • 16
  • 26
1
vote
0 answers

Getting java.lang.OutOfMemoryError: Java heap space in Spark job server

Getting java.lang.OutOfMemoryError: Java heap space in Spark job server logs and the job server goes down : [2017-06-01 19:09:26,708] ERROR akka.actor.ActorSystemImpl [] [ActorSystem(JobServer)] - Uncaught error from thread…
User_qwerty
  • 375
  • 1
  • 2
  • 10
1 2
3
10 11