SparkException: Python worker failed to connect back

Asked Jun 29 '23 at 18:26

Active Jun 30 '23 at 04:39

Viewed 29 times

When executing from within my Jupyter Notebook some cells containing some Spark commands (eg., some DataFrame.show() methods or some spark.sql select commands involving 6 million row DataFrames), I get the following sequence of message errors:

Py4JJavaError: An error occurred while calling xxxx.showString.
SparkException: Job aborted due to stage failure.
Caused by: org.apache.spark.SparkException: Python worker failed to connect back.
Caused by: java.net.SocketTimeoutException: Accept timed out.

How can I interpret them?
I work in a local 8g Spark session.

edited Jun 30 '23 at 04:39

desertnaut

57,590
26
140
166

asked Jun 29 '23 at 18:26

Antonio Piemontese

Get some other messages before the one _Py4JJavaError: An error occurred while calling xxxx.showString_. Something happened before. – Marc Le Bihan Jun 29 '23 at 18:30
No other messages before this. – Antonio Piemontese Jun 30 '23 at 11:43

SparkException: Python worker failed to connect back

0 Answers0