Apache Zeppelin is a web-based notebook that enables data-driven interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Python, Scala and more. It also supports Markdown syntax.
Questions tagged [apache-zeppelin]
1460 questions
7
votes
2 answers
com.fasterxml.jackson.databind.JsonMappingException: Jackson version is too old 2.5.3
My OS is OS X 10.11.6.
I'm running Spark 2.0, Zeppelin 0.6, Scala 2.11
When I run this code in Zeppelin I get an exception from Jackson.
When I run this code in spark-shell - no exception.
val filestream =…

dbl001
- 2,259
- 8
- 39
- 53
7
votes
2 answers
How to put a variable into z ZeppelinContext in javascript in Zeppelin?
In Scala and Python it's:
z.put("varname", variable)
But in javascript I get (in the console)
Uncaught ReferenceError: z is not defined
What I really want to do is access a javascript variable from Scala code using z.angular("varname") in…

samthebest
- 30,803
- 25
- 102
- 142
7
votes
4 answers
Zeppelin - Cannot query with %sql a table I registered with pyspark
I am new to spark/zeppelin and I wanted to complete a simple exercise, where I will transform a csv file from pandas to Spark data frame and then register the table to query it with sql and visualise it using Zeppelin.
But I seem to be failing in…

StefanK
- 2,030
- 1
- 21
- 26
7
votes
2 answers
org.apache.thrift.transport.TTransportException error while Reading large JSON file in zeppelin scala
I am trying to read a large JSON file (1.5 GB) using Zeppelin and Scala.
Zeppelin is working on SPARK in local mode installed on Ubuntu OS on a VM with 10 GB RAM. I have alloted 8GB to the spark.executor.memory
My Code is as below
val…

Kiran Shashi
- 71
- 1
- 1
- 3
7
votes
2 answers
How to run zeppelin notebook from command line (automatically)
How do we run the notebook from command line?
Further to 1, how would I pass command line arguments into the notebook? I.e. access the command line args from within the notebook code?

thousif ahmed
- 71
- 1
- 3
7
votes
2 answers
Zeppelin SqlContext registerTempTable issue
I am trying to access some json data using sqlContext.jsonFile in zeppelin...
following code execute without any error:
import sys.process._
val sqlCon = new org.apache.spark.sql.SQLContext(sc)
val jfile =…

Nihal Bhagchandani
- 239
- 5
- 9
6
votes
1 answer
Apache Zeppelin not working with https for maven repo
I'm running Apache Zeppelin 0.8.0 in Amazon EMR. Recently the spark interpreter started to fail to pull down library dependencies. This was because the zeppelin.interpreter.dep.mvnRepo configuration parameter was set to…

Marcus
- 73
- 5
6
votes
1 answer
py4JJava Error - error while using select statement
I am using pspark in Zeppelin notebook and trying to fetch data using SELECT statement. I am simply trying to query a table but getting weird error for the following command:
%pyspark
spark.sql('select * from default.abc').show()
Here is the error…

user1584253
- 975
- 2
- 18
- 55
6
votes
0 answers
Create a Spark pool by user by default on Zeppelin Notebook
I am working with Spark inside Zeppelin in a collaborative environment. So we have only one interpreter and many users are using this interpreter. For this reason, I defined it using instantiation per user in scoped mode.
With this configuration, a…

Igor Uchôa
- 329
- 1
- 9
6
votes
1 answer
Rerun Scala code with -deprecation using Apache Zeppelin
I am using a Zeppelin notebook to create a Spark script in Scala.
Code:
def test: DataFrame= {
//code
}
I am getting the following warning:
warning: there was one deprecation warning; re-run with -deprecation for details.
Any suggestions of…

data_person
- 4,194
- 7
- 40
- 75
6
votes
1 answer
[AWS Glue]: org.apache.thrift.TApplicationException: Internal error processing createInterpreter
I'm trying to use zeppelin-0.8.0 to connect to AWS Glue Development endpoint and when executing a cell below error occurs.
And there is no helpful message to understand what could be the problem. Any leads appreciated
172318_1906434757 is finished,…

Somasundaram Sekar
- 5,244
- 6
- 43
- 85
6
votes
1 answer
Container killed by YARN for exceeding memory limits
I am creating a cluster in google dataproc with the following characteristics:
Master Standard (1 master, N workers)
Machine n1-highmem-2 (2 vCPU, 13.0 GB memory)
Primary disk 250 GB
Worker nodes 2
Machine type n1-highmem-2 (2…

Mpizos Dimitris
- 4,819
- 12
- 58
- 100
6
votes
1 answer
Accessing hdfs from docker-hadoop-spark--workbench via zeppelin
I have installed https://github.com/big-data-europe/docker-hadoop-spark-workbench
Then started it up with docker-compose up . I navigated to the various urls mentioned in the git readme and all appears to be up.
I then started a local apache…

Jas
- 14,493
- 27
- 97
- 148
6
votes
1 answer
What is the equivalent of jupyter's Restart Kernel in Zeppelin/Spark
I'm just starting my journey on learning Apache Spark and I'm using Zeppelin to play with it interactively. So far so good, but once in a while I want to start clean and want to restart the kernel. I coulnd't find such function in Zeppelin....…

fatdragon
- 2,211
- 4
- 26
- 43
6
votes
1 answer
How to auto update %spark.sql result in zeppelin for structured streaming query
I'm running structured streaming in (spark 2.1.0 with zeppelin 0.7) for data coming from kafka and I'm trying to visualize the streaming result with spark.sql
as below :
%spark2
val spark = SparkSession
.builder()
.appName("Spark structured…

Ibrahim Mousa
- 61
- 4