Questions tagged [spark-jobserver]

spark-jobserver provides a RESTful interface for submitting and managing Apache Spark jobs, jars, and job contexts.

Reference: https://github.com/spark-jobserver/spark-jobserver

RealTime Example: https://nishutayaltech.blogspot.com/2016/05/how-to-run-spark-job-server-and-spark.html

165 questions
0
votes
1 answer

mkdir: cannot create directory ‘/mnt/var/log/spark-jobserver\r’: Permission denied

I was trying to deploy spark-jobserver on a EMR cluster, as per this documentation "https://github.com/spark-jobserver/spark-jobserver/blob/master/doc/EMR.md#configure-master-box" Was able to install the job-server on emr, but while starting the…
0
votes
1 answer

Run both batch and real time jobs on Spark with jobserver

I have a spark job that runs every day as part of a pipeline and perform simple batch processing - let's say, adding a column to DF with other column's value squared. (old DF: x, new DF: x,x^2). I also have a front app that consumes these 2…
0
votes
1 answer

Spark Job server context user change at run time

How can I change the user of the context created in Spark Job Server? I want to change the user which I am getting on sparkSession.sparkContext.sparkUser();
0
votes
1 answer

Spark Performance Monitoring

I have got a requirement to show the management/ Client that the executor-memory, number of cores, default parallelism, number of shuffle partitions and other configuration properties for running the spark job are not excessive or more than…
Amit
  • 9
  • 2
0
votes
1 answer

Spark JobServer can use Cassandra as SharedDb

I have been doing a research about Configuring Spark JobServer Backend (SharedDb) with Cassandra. And I saw in the SJS documentation that they cited Cassandra as one of the Shared DBs that can be used. Here is the documentation part: Spark…
0
votes
1 answer

How to import spark.jobserver.SparkSessionJob

I have added job-server 0.9.0 dependencies in build.sbt by add scalaVersion := "2.11.0" resolvers += "Job Server Bintray" at "https://dl.bintray.com/spark-jobserver/maven" libraryDependencies ++= Seq( "spark.jobserver" %% "job-server-api" %…
Ben
  • 1
  • 2
0
votes
1 answer

CONTEXT_ID added in SJS 0.9.0 is being set as null in the table

Im trying to catch up with the new SJS 0.9.0 in my application. Once after the context is created , I am trying to submit a job -> this happens 19/04/10 21:45:06 ERROR JobDAOActor: About to restart actor due to…
H.Aadhithya
  • 205
  • 1
  • 12
0
votes
1 answer

why spark always try to connect to localhost:9042?

When I build context I used following parameters: spark.cassandra.connection.host=somehosts&spark.cassandra.auth.username=app&spark.cassandra.auth.password=app more detail as…
user10864296
0
votes
0 answers

Spark job-server scala.MatchError while submitting WordCountExample

When I submit a job to the Spark job-server, I can see that the Spark context is created. However, there is an error in WebApi.getJobManagerForContext method: [2018-06-11 07:05:24,495] INFO ocalContextSupervisorActor [] [] - SparkContext…
Sameer
  • 4,379
  • 1
  • 23
  • 23
0
votes
0 answers

Node JS integration with spark job server

I'm trying to call spark job server API from node js. The API which is the python egg file does provide the count of nulls from the file. So once I call the API from the node, it is reaching the SJS server and the job starts which triggers…
0
votes
0 answers

Failed to create database "metastore_db" using spark-jobserver 0.8.1

I am upgrading my server to spark 2.3.0 and job-server 0.8.1-SNAPSHOT from spark 2.1.1 and job-server 0.8.0 (which were working fine). I am using the JobSqlDao with MySql and am using the SessionContextFactory to create a sqlContext. In local.conf,…
user1933178
  • 340
  • 3
  • 12
0
votes
1 answer

What setup is needed to use the Spark Cassandra Connector with Spark Job Server

I am working with Spark and Cassandra and in general things are straight forward and working as intended; in particular the spark-shell and running .scala processes to get results. I'm now looking at utilisation of the Spark Job Server; I have the…
ChrisHDog
  • 4,473
  • 8
  • 51
  • 77
0
votes
2 answers

spark job server api dependency not found. what can i do?

Im putting the following in my pom.xml file: spark.jobserver job-server-api_2.11 0.8.0 I tried to use other versions too but maven don't seem to…
user2014377
  • 141
  • 1
  • 1
  • 5
0
votes
1 answer

How to get percentage of job completion for a Spark Job?

I have been looking for a way to get percentage of Job completed for the corresponding job id. Right now, the Spark JobServer UI shows the corresponding status for a running job: { "duration": "Job not done yet", "classPath":…
Ani
  • 1
0
votes
1 answer

Why there is no support for sparkSession with namedObject in spark job server?

I am trying to build an application with spark job server API(for spark 2.2.0). But I found that there is no support for namedObject with sparkSession. my looks like: import com.typesafe.config.Config import org.apache.spark.sql.SparkSession import…
arglee
  • 1,374
  • 4
  • 17
  • 30