Questions tagged [databricks-community-edition]
85 questions
0
votes
1 answer
Pyspark streams proccess 1 record per trigger
I am trying the Databricks Developer Foundation Capstone but I cant seem to pass the streaming Exercises.
I am supposed to read a stream of json data, transform it and append it back to a table.
I created the DataFrame like this:
df =…

LeM4
- 31
- 1
- 5
0
votes
0 answers
Cannot find '/dbfs/databricks-datasets' in my notebook
I am using the Databricks community edition and working through the ML intro tutorial.
I am able to %fs ls databricks-datasets/COVID/covid-19-data/us-states.csv, but not able to read it through pandas
>>> df =…

Ying Xiong
- 4,578
- 8
- 33
- 69
0
votes
1 answer
How to build a custom docker image which enables the connection of Apache Databricks with Denodo?
For now we are giving all the configurations manually in the Databricks to connect to denodo database.Now the issue is we need to have a docker image which can enable the connection so that a new user doesnt have to do this manually.

Laster
- 388
- 5
- 18
0
votes
1 answer
How to connect to Community Edition Databricks Cluster via Outside Public Address / Application
Can someone let me know if its possible to connect or PING a Databricks Cluster via its public ip address?
For example I have issued the command ping --all-ip-addresses and I get the ip address 10.172.226.115.
I would like to be able to PING that ip…

Carltonp
- 1,166
- 5
- 19
- 39
0
votes
1 answer
Create SQL user in Databricks
Is there a way to create a database (sql) user (analogous to an Oracle or MySql user) in Databricks? It doesn't seem possible in the Community edition.
For a long list of reasons... I have a need to create a very large number of tables and would…

John
- 3,458
- 4
- 33
- 54
0
votes
1 answer
Too many files on my Databricks Community cluster, but where?
I started playing with streaming on my Community Edition Databricks but after some minutes of producing test events I encountered some problem. I believe it's somehow connected with the fact of some temporary small files produced during streaming…

luk
- 105
- 4
-1
votes
1 answer
Data-bricks - SQL - SUM of years
I'm would like to get the SUM of each column by years. Rather then displays several individual rows for the same year.
spark.sql("""
SELECT YEAR(date) AS year,
useful, funny, cool
FROM reviews_without_text_table
ORDER by year…

Misterbenzly
- 1
- 2
-1
votes
1 answer
Course indication for community databricks
I need to learn how to use databricks, using the Python programming language. Does anyone have any introductory course tips that teach how to use the Databricks community (which is free), where I can use the Python language?
Any tips swill be…

Amanda SB
- 16
- 7
-1
votes
1 answer
Can't Access a global view table with SQL by a selected field code converted from a dropdown selection
I am using a SQL notebook in Databricks/Spark community edition
%python
education_DF = sqlContext.sql('select * from global_temp.population_Globaltmp_view where `Education` = "2YD"')
display(education_DF)
The above code cell works fine and…

Christopher Pfeifer
- 99
- 6
-1
votes
1 answer
How to run scala code inside databricks community edition notebook?
Databricks community edition notebook allows one to practice scala guide and try examples in real-time. However, the default is set to pySpark and I was wondering how I can run scala code instead?

Vadim Smolyakov
- 1,187
- 11
- 24