Questions tagged [apache-zeppelin]

Apache Zeppelin is a web-based notebook that enables data-driven interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Python, Scala and more. It also supports Markdown syntax.

Apache Zeppelin home page

1460 questions
8
votes
1 answer

Saving/Exporting the results of a Spark SQL Zeppelin query

We're using apache zeppelin to analyse our datasets. We have some queries that we would like to run that have a large number of results that come back from them and would like to run the query in zeppelin but save the results (display is limited to…
vcetinick
  • 1,957
  • 1
  • 19
  • 41
8
votes
1 answer

zeppelin with sparkr is not displaying dataframe as table

The zeppelin R interpreter documentation states: If you return a data.frame, Zeppelin will attempt to display it using Zeppelin's built-in visualizations. This can be seen in the documentation example: However, when I attempt to run the same R…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
8
votes
1 answer

Moving Spark DataFrame from Python to Scala whithn Zeppelin

I created a spark DataFrame in a Python paragraph in Zeppelin. sqlCtx = SQLContext(sc) spDf = sqlCtx.createDataFrame(df) and df is a pandas dataframe print(type(df)) what I want to do is moving spDf from one…
MTT
  • 5,113
  • 7
  • 35
  • 61
8
votes
1 answer

How to use Plotly with Zeppelin

I've seen zeppelin-plotly but it seems too complicated. The other things that worries me is that it involves modifying zeppelin's .war file and I don't want to break things by error. Is there another way to use Plotly with Zeppelin?
Cristian Garcia
  • 9,630
  • 6
  • 54
  • 75
8
votes
1 answer

Display multiple data points in single chart

scala : val df = sc.parallelize(Seq( ("Jan" , "1", "1","3"), ("Feb" , "2", "5","2"), ("Mar" , "2", "3","1"))) .toDF("time" , "f1", "f2", "f3") df.registerTempTable("inout") sql : %sql select time , f1 , f2 , f3 from inout…
blue-sky
  • 51,962
  • 152
  • 427
  • 752
8
votes
2 answers

Register UDF to SqlContext from Scala to use in PySpark

Is it possible to register a UDF (or function) written in Scala to use in PySpark ? E.g.: val mytable = sc.parallelize(1 to 2).toDF("spam") mytable.registerTempTable("mytable") def addOne(m: Integer): Integer = m + 1 // Spam: 1, 2 In Scala, the…
8
votes
2 answers

Apache zeppelin tutorial, error "sql interpreter not found"

In the "Zeppelin tutorial" notebook, I can't use the %sql interpreter. It will output "sql interpreter not found". But the spark commands work fine, as well as %md and %sh. Here's the log : ERROR [2015-10-20 10:13:35,045] ({qtp885851948-51}…
thomas legrand
  • 493
  • 1
  • 5
  • 16
8
votes
4 answers

Zeppelin change port already in use by Spark Master

When I start zeppelin on AWS, It starts on port 8080, but there is Spark Master so it says port already in use... I tried changing port in zeppelin in config files, in "zeppelin-site.xml.template" zeppelin.server.port
Leandro Latorre
  • 161
  • 1
  • 1
  • 6
7
votes
1 answer

Job 65 cancelled because SparkContext was shut down

I'm working on a shared Apache Zeppelin server. Almost every day, I try to run a command and get this error: Job 65 cancelled because SparkContext was shut down I would love to learn more about what causes the SparkContext to shut down. My…
Cauder
  • 2,157
  • 4
  • 30
  • 69
7
votes
1 answer

Zeppelin does not display stack trace

I installed Zeppelin 0.8.0 on Windows 10. When I write in a notebook with spark interpreter, correct code works fine, outputs are displayed, but stack trace is not displayed in case of error. It's like in is written in white, but actually lines…
Baptiste Merliot
  • 841
  • 11
  • 24
7
votes
1 answer

no $ZEPPELIN_HOME/scripts/docker/spark-cluster-managers/spark_standalone file

I was looking at zeppelin documentation for starting local spark it says: cd $ZEPPELIN_HOME/scripts/docker/spark-cluster-managers/spark_standalone However at $ZEPPELIN_HOME I have no scripts folder: $ ls -l ~/dev/zeppelin-0.7.3-bin-all/ LICENSE …
Jas
  • 14,493
  • 27
  • 97
  • 148
7
votes
0 answers

Zeppelin load full project /external files

I have a regular sbt scala spark project and would like to integrate with zeppelin. As you can imagine the project not only contains a single big file but rather some nice classes to modularize functionality. I is unclear to me how this could…
Georg Heiler
  • 16,916
  • 36
  • 162
  • 292
7
votes
1 answer

Why --py-files is not supported in zeppelin?

I have a python package with many modules built into an .egg file and I want to use this inside zeppelin notebook. Acc to the zeppelin documentation, to pass this package to zeppelin spark interpreter, you can export it through --files option in…
Meethu Mathew
  • 431
  • 1
  • 6
  • 15
7
votes
2 answers

Is it possible to set global variables in a Zeppelin Notebook?

I'm trying to create a multi-paragraph dashboard using a Zeppelin notebook. I'd like people using the dashboard to only have to enter certain parameters once. E.g. if I'm making a dashboard with information about different websites, the dashboard…
Danny David Leybzon
  • 670
  • 1
  • 9
  • 21
7
votes
2 answers

Integrating Apache Zeppelin notebook into a web application

I would like to integrate Apache Zeppelin notebooks into another web application that has existing integration with Apache Spark. What is the best way of doing this? I am considering two possibilities: Render notebooks and paragraphs myself using…
avanwyk
  • 700
  • 6
  • 13