Questions tagged [apache-zeppelin]

Apache Zeppelin is a web-based notebook that enables data-driven interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Python, Scala and more. It also supports Markdown syntax.

Apache Zeppelin home page

1460 questions
0
votes
2 answers

How to create notebook with paragraph loaded with code using zeppelin REST API

I want to create zeppelin notebook via zeppelin REST API. I don't want to create an empty notebook, instead I want to create a notebook with some scala code in it. What parameters do I need to pass to create such a notebook? Or is it possible to…
sag
  • 5,333
  • 8
  • 54
  • 91
0
votes
1 answer

Spark Scala API: Map PairDStream to another PairDStream

I'm working with Spark's Java API. Now I need to "translate" my Java Application into Scala code because I want to try out Apache Zeppelin. I have a PairDStream (1.1.1.123,(1,1,1,1)) and need to map its pairs to a PairDStream of form…
D. Müller
  • 3,336
  • 4
  • 36
  • 84
0
votes
2 answers

Apache Zeppelin + EMR(Spark) Cluster for opening the firewall

I tried to use apache zeppelin with EMR(Spark) Cluster. I get some requirements for using apache zeppelin + EMR cluster with opening firewall. In the workplace, there is the static ip which is blocked by the firewall. As you know, EMR clusters…
Kangrok Lee
  • 101
  • 13
0
votes
1 answer

Why doesn't zeppelin server on spark cluster respond?

I installed zeppelin in a local model and cluster model. They all installed and connected successfully. But the cluster model cannot process my code, despite the zeppelin examples. It started and was pending and running for a long time then resulted…
alex44jzy
  • 51
  • 1
  • 6
0
votes
2 answers

Issues Installing Zeppelin on CentOS 6 with Vagrant

We are trying to stand up a sandbox/evaluation instance of Zeppelin on a 4-node CentOS 6 cluster with Vagrant and having some issues with dependencies in the build process. Here is the high level script we’re running. (Have tried running this as…
0
votes
1 answer

Using the public IP on AWS from java getCanonicalHostName method

I'm trying to use AWS as a scalable analytics tool. I'm using apache zeppelin as an interactive shell to a Spark cluster and trying to plot using wisp. This is causing a problem as the plotting approach in wisp is to start a web app based on what I…
-1
votes
1 answer

How to pass a variable into an Pyspark sequence to generate time series?

I want to generate a time series, from 2021-12-01 to 2021-12-31, but I want to pass the values with variables into de function secuence. This is my code: spark = SparkSession.builder.appName('sparkdf').getOrCreate() TyP_dias =…
Artemination
  • 703
  • 2
  • 10
  • 30
-1
votes
1 answer

Why does StringIndexer has no outputCols?

I am using Apache Zeppelin. My anaconda version is conda 4.8.4. and my spark version is: %spark2.pyspark spark.version u'2.3.1.3.0.1.0-187' When I run my code, it throws followed error: Exception AttributeError: "'StringIndexer' object has no…
JAdel
  • 1,309
  • 1
  • 7
  • 24
-1
votes
1 answer

Problems when save the parquet file from Zeppelin executor to HDFS

We have problems when save the parquet file from Zeppelin executor to HDFS like resultIndexed .repartition(1) .write .parquet(s"hdfs://hdfs-hadoop-hdfs-nn.svc.cluster.local:9000/home/zeppelin/sensor_parquet") and got the…
Eljah
  • 4,188
  • 4
  • 41
  • 85
-1
votes
1 answer

How to convert dataframe to rdd in Zeppelin to use graphX

I want to use graphX in zeppelin with my dataframe First, my dataframe is as below. +---+-----+---+ | id| name|age| +---+-----+---+ | a| AA| 34| | b| BB| 36| | c| CC| 30| | d| DD| 29| | e| EE| 32| | f| FF| 36| | g| GG|…
-1
votes
1 answer

How to fix the error mismatched input 'partition' for window functions in spark sql?

I'd like to run a window function in Spark SQL. I have Zeppelin sitting on top of a Spark cluster with Hadoop. I'd like to add a row number to a table and group it by a combination of two IDs. This is my data. food aisle item …
Cauder
  • 2,157
  • 4
  • 30
  • 69
-1
votes
1 answer

Can I write an HTML script and pass information from the script to a cell on Qubole?

Is it possible to write an HTML script and have the user interact on the HTML script and pass the data back to the zeppelin cell and have it rerun the data passed back? Thank you! Update: Have some progress on rerunning the cell on a html…
Dillon
  • 11
  • 1
-1
votes
1 answer

Is there a way to compare all rows in one column of a dataframe against all rows in another column of another dataframe (spark)?

I have two dataframes in Spark, both with an IP column. One column has over 800000 entries while the other has 4000 entries. What I want to do is to see if the IP's in the smaller dataframe appear in the IP column of the large dataframe. At the…
-1
votes
1 answer

pyspark df.write.format("jdbc") grant statement DB2

I want add a command to add my grant statement so i can provide Select access to users to the table i have loaded using pyspark. df.write.format("jdbc") .option("url", "jdbc:db2:@//:/")…
-1
votes
1 answer

Zeppelin: Facing error after running scala query

PooledByteBufAllocator.metric()Lio/netty/buffer/PooledByteBufAllocatorMetric; Zeppelin Version: zeppelin-0.8.0
Huzefa Khan
  • 127
  • 5