Which of the many Spark/Scala kernels for Jupyter/IPython to choose?

Question

There are a lot of Scala/Spark kernels for IPython/Jupyter:

Does anybody know wich of them is most compatible with IPython/Jupyter and most comfortable to use with:

Scala
Spark(Scala)

The IPython wiki has a list of many kernels (including other languages besides scala). Thought I would add it here: https://github.com/ipython/ipython/wiki/IPython-kernels-for-other-languages — Luciano, Jan 01 '17 at 16:24
Useful to comment if these come as source, binary or both. And the ease of installation, both on Win10/Linux/MacOS. Also, how do they compare to each other on CPU and memory performance? security? patches? magic commands? — smci, Oct 14 '17 at 17:27

score 15 · Accepted Answer · answered Oct 01 '15 at 11:53

15

I can't speak for all of them, but I use Spark Kernel and it works very well for using both Scala and Spark.

I found IScala and Jupyter Scala less stable and less polished. Jupyter Scala always prints every variable value after I execute a cell; I don't want to see this 99% of the time.

Spark Kernel is my favourite. Both for Spark and plain old Scala.

answered Oct 01 '15 at 11:53

Al M

557
4
10

How is difficult to run it? – Lunigorn Oct 01 '15 at 15:26
Can it drow plot to IPython? – Lunigorn Oct 01 '15 at 15:31
I haven't tried drawing plots with it, but i see no reason why it would not work. They were all very easy to run once they are installed. – Al M Oct 02 '15 at 16:13

score 5 · Answer 2 · answered Oct 07 '16 at 18:40

5

Spark Kernel has been accepted into Apache Incubator and has moved all development to Apache Toree.

answered Oct 07 '16 at 18:40

artyomboyko

2,781
5
40
54

Are you recommending it or just commenting? How does it compare on CPU and memory performance, install size, ease of install, etc? – smci Oct 14 '17 at 17:24

score 4 · Answer 3 · answered Jan 07 '16 at 10:22

I have been using spark-kernel (your option #4) and quite satisfied.

You can find a nice how-to installation (CDH 5.5 on CentOS 7) here (I have used it myself to install it in a Single node in pseudo-distributed mode).

http://www.davidgreco.me/blog/2015/12/24/how-to-use-jupyter-with-spark-kernel-and-cloudera-hadoop-slash-spark/

Which of the many Spark/Scala kernels for Jupyter/IPython to choose?

3 Answers3