Questions tagged [spark-operator]

39 questions
2
votes
1 answer

Helm install spark-operator tries to download a version that does not exist and cannot be force to use the correct one

I am trying to install ANY working version of spark-operator https://console.cloud.google.com/gcr/images/spark-operator/GLOBAL/spark-operator?tag=v1beta2-1.3.1-3.1.1 on my local kubernetes. However, spark pod is stuck on ImagePullBackOff trying to…
2
votes
2 answers

Argo Workflow + Spark operator + App logs not generated

Am in very early stages of exploring Argo with Spark operator to run Spark samples on the minikube setup on my EC2 instance. Following are the resources details, not sure why am not able to see the spark app logs. WORKFLOW.YAML kind:…
Gurupraveen
  • 181
  • 1
  • 13
2
votes
1 answer

spark-operator logging in k8s

I am using spark-operator for Kubernetes. https://github.com/GoogleCloudPlatform/spark-on-k8s-operator I am able to run the jobs successfully but there is a lot of trouble for monitoring and troubleshooting the job since the pods are dynamic. I want…
Sumit G
  • 436
  • 8
  • 21
1
vote
0 answers

spark streaming from kafka on spark operator(Kubernetes)

I have a spark structured streaming job in scala, reading from kafka and writing to S3 as hudi tables. Now I am trying to move this job to spark operator on EKS. When I give the option in the yaml file. spark.jars.packages:…
1
vote
0 answers

JMX exporter & spark-on-k8s-operator

I'm trying to submit a spark application using spark operator and to expose metrics using JMX exporter. I'm using Spark 3.1.1 & spark operator v1beta2-1.3.3-3.1.1 Here is a snippet from the configuration. monitoring: exposeDriverMetrics: true …
1
vote
2 answers

How to deploy a Beam Application on already deployed Spark application on Kubernetes?

I have already deployed Spark on Kubernetes, below is the deployment.yaml, apiVersion: "sparkoperator.k8s.io/v1beta2" kind: SparkApplication metadata: name: pyspark-pi namespace: default spec: type: Python pythonVersion: "3" mode: cluster …
user19930511
  • 299
  • 2
  • 15
1
vote
0 answers

Failed to connect to spark-master:7077

I am trying to deploy my spark application on Kubernetes. I followed the below steps: Installed spark-kubernetes-operator: helm repo add spark-operator https://googlecloudplatform.github.io/spark-on-k8s-operator helm install gcp-spark-operator…
1
vote
0 answers

Spark Streaming on Kubernetes - Executor Pods not Restarted/Rescheduled

We are running DStream applications on Kubernetes cluster using Spark Operator (Spark 2.4.7). Sometimes due to various reasons (OOM's, Kubernetes node restarts) executor pods are getting lost, and while many times Spark sees this and reschedules a…
eugen-fried
  • 2,111
  • 3
  • 27
  • 48
1
vote
1 answer

aws libraries compatible with spark 3.1.1

I have a spark operator with sparkVersion: "3.1.1" and would like to use it for structured streaming to/from minIO. However, I have not been able to find a compatible combination of libraries for anything newer than hadoop 2.7.0. (which does not…
1
vote
0 answers

spark streaming - waiting for a dead executor

I have a spark streaming application running inside a k8s cluster (using spark-operator). I have 1 executor, reading batches every 5s from a Kinesis stream. The Kinesis stream has 12 shards, and the executor creates 1 receiver per shard. I gave it…
Malki
  • 2,335
  • 8
  • 31
  • 61
1
vote
1 answer

executor.CoarseGrainedExecutorBackend (Logging.scala:logInfo(57)) - Driver commanded a shutdown - how I can debug on driver side?

I'm getting that logs from the executor (beginning at the buttom): 2021-11-30 21:44:42 2021-11-30 18:44:42,911 INFO [shutdown-hook-0] util.ShutdownHookManager (Logging.scala:logInfo(57)) - Deleting directory…
Eljah
  • 4,188
  • 4
  • 41
  • 85
1
vote
1 answer

Spark-operator on EKS Apache spark failed to create temp directory

I am trying to deploy simple spark-pi.yaml onto AWS EKS using spark-operator. I have successfully deployed spark-operator Refer deployment YAML here spark-operator example I am getting the following error when I do helm install Events: Type …
1
vote
0 answers

Spark on K8s: UnknownHostException when spark app is trying to resolve DNS (getting) of another pod in diffrent namespace on same cluster

I am able to execute SparkPi in k8s and deployed (in GKE) as well. But, when I am trying to broadcast PI value to my microservice which is in toys-broadcast-svc.toys.svc.cluster.local I am unable to resolve DNS (getting UnknownHostException) . Can…
1
vote
0 answers

Kubernets Spark Operator: Configuring Automatic Application Restart and Failure Handling

On kubernetes a Container restart policy can be configured with an optional field .spec.restartPolicy which can be set to type: OnFailure. I read here that there is a cap of 300 secs (5 mins) on an exponential back-off delay before a failed pod is…
Azeem
  • 33
  • 7
1
vote
0 answers

Spark on Kubernetes troubleshooting

I have a kubernetes cluster where I try to run a spark example application (spark-pi). I have setup a service account = spark Scenario When I do a spark-submit from the command line like below, I am getting a container error. bin/spark-submit…
user3370144
  • 99
  • 1
  • 5
1
2 3