Questions tagged [apache-toree]

Toree provides applications with a mechanism to interactively and remotely access Apache Spark.

About

Toree provides applications with a mechanism to interactively and remotely access Apache Spark.

Links

65 questions
9
votes
6 answers

Jupyter pyspark : no module named pyspark

Google is literally littered with solutions to this problem, but unfortunately even after trying out all the possibilities, am unable to get it working, so please bear with me and see if something strikes you. OS: MAC Spark : 1.6.3 (2.10) Jupyter…
Saurabh Mishra
  • 1,703
  • 3
  • 17
  • 27
9
votes
2 answers

Work with Jupyter on Windows and Apache Toree Kernel for Spark compatibility

I´m trying to install Apache Toree kernel for spark compatibility and I´m running into a strange environmental message. This is the process I followed: Install last Anaconda Version with Jupyter 4.1.0 Run: pip install --pre toree Run: jupyter toree…
Luis
  • 159
  • 4
  • 12
8
votes
0 answers

Jupyter + Apache Toree Kernel error on Windows 10

I get a kernel error when I am creating a Jupyter notebook with the Apache Toree - Scala kernel. Here is the stack trace: Traceback (most recent call last): File "C:\Users\darie\Anaconda3\lib\site-packages\notebook\base\handlers.py", line 516,…
8
votes
2 answers

Apache Toree to connect to a remote spark cluster

Is there a way to connect Apache Toree to a remote spark cluster? I see the common command is jupyter toree install --spark_home=/usr/local/bin/apache-spark/ How can I go about using spark on a remote server without having to install locally?
yunli.tang
  • 115
  • 1
  • 6
6
votes
1 answer

Apache Toree and Spark Scala Not Working in Jupyter

I'm having problems running Scala Spark on Jupyter. Below is my error message when I load Apache Toree - Scala notebook in jupyter. root@ubuntu-2gb-sgp1-01:~# jupyter notebook --ip 0.0.0.0 --port 8888 [I 03:14:54.281 NotebookApp] Serving notebooks…
Ming
  • 332
  • 4
  • 17
6
votes
1 answer

Apache Toree Syntax Highlighting for Scala

The syntax highlighting for "Apache Toree - Scala" doesn't work correctly. When I write some code in the cell, Jupyter doesn't highlight it. However, when I refresh the notebook, it highlights everything: I have already configured kernel.json…
Ashkan
  • 1,643
  • 5
  • 23
  • 45
6
votes
1 answer

How to get iPython inbuild magic command to work in Jupyter notebook Pyspark kernel?

I am using PySpark kernel installed through Apache Toree in Jupyter Notebook using Anaconda v4.0.0 (Python 2.7.11). After getting a table from Hive, use matplotlib/panda to plot some graph in Jupyter notebook, following the tutorial as…
Angletear
  • 71
  • 4
6
votes
2 answers

How to install Apache Toree for Spark Kernel in Jupyter in (ana)conda environment?

I am trying to install Jupyter-support for Spark in a conda environment (which I set up using http://conda.pydata.org/docs/test-drive.html) of the anaconda distribution. I am trying to use apache toree as Jupyter Kernel for this. Here is what I did…
Make42
  • 12,236
  • 24
  • 79
  • 155
5
votes
3 answers

Installing Scala kernel (or Spark/Toree) for Jupyter (Anaconda)

I'm running RHEL 6.7, and have Anaconda set up. (anaconda 4.10). Jupyter is working OOTB, and it by default has the Python kernel. Everything is dandy so I can select "python notebook" in Jupyter. I'm now looking to get Scala set up with Jupyter as…
robromo
  • 321
  • 1
  • 4
  • 15
4
votes
1 answer

How to add external jar to Scala in Jupyter kernel

I would like to add the jar files from Stanford's CoreNLP into my Scala project. The part I'm struggling with in doing this in the context of a Scala kernel for Jupyter notebooks. I'm using the Apachee Toree distribution for the kernel. There may be…
Aus_10
  • 670
  • 7
  • 15
4
votes
2 answers

Running Spark + Scala + Jupyter on Dataproc

I haven't yet managed to get Spark, Scala, and Jupyter to co-operate. Does anyone have a simple recipe? Which version of each component did you use?
4
votes
3 answers

install spark packages in toree

I usually start my spark-shell with: ./bin/spark-shell --packages com.databricks:spark-csv_2.10:1.2.0,graphframes:graphframes:0.1.0-spark1.6,com.databricks:spark-avro_2.10:2.0.1 I'm trying to use Apache Toree now, any idea of how should I load…
elelias
  • 4,552
  • 5
  • 30
  • 45
3
votes
0 answers

How to make Apache Toree work with Jupyter

I am trying to work with Apache Three and jupyter, in Mac OS High Sierra. While I am installing everything appropriately with: pip3 install --user toree when I am trying to install anything with jupyter toree install I end up with a message as…
Noowada
  • 53
  • 1
  • 9
3
votes
0 answers

Error connecting to Spark from Jupyter using Apache Toree SparkR kernel

I am trying to connect to Spark 2.1.0 from Jupyter using Apache Toree SparkR kernel. The kernel loads correctly but when I try to execute a cell an error appears and repeat infinitely. Connecting to Spark using Scala and Python kernels work…
3
votes
3 answers

Toree Installation Issue

I wanted to pip install Toree package, but I ended up with the following error msg: Could not find a version that satisfies the requirement toree (from versions: ) No matching distribution found for toree I couldn't find any documentation on…
MLhacker
  • 1,382
  • 4
  • 20
  • 35
1
2 3 4 5