Questions tagged [databricks-community-edition]

85 questions
0
votes
1 answer

Pyspark streams proccess 1 record per trigger

I am trying the Databricks Developer Foundation Capstone but I cant seem to pass the streaming Exercises. I am supposed to read a stream of json data, transform it and append it back to a table. I created the DataFrame like this: df =…
0
votes
0 answers

Cannot find '/dbfs/databricks-datasets' in my notebook

I am using the Databricks community edition and working through the ML intro tutorial. I am able to %fs ls databricks-datasets/COVID/covid-19-data/us-states.csv, but not able to read it through pandas >>> df =…
Ying Xiong
  • 4,578
  • 8
  • 33
  • 69
0
votes
1 answer

How to build a custom docker image which enables the connection of Apache Databricks with Denodo?

For now we are giving all the configurations manually in the Databricks to connect to denodo database.Now the issue is we need to have a docker image which can enable the connection so that a new user doesnt have to do this manually.
0
votes
1 answer

How to connect to Community Edition Databricks Cluster via Outside Public Address / Application

Can someone let me know if its possible to connect or PING a Databricks Cluster via its public ip address? For example I have issued the command ping --all-ip-addresses and I get the ip address 10.172.226.115. I would like to be able to PING that ip…
Carltonp
  • 1,166
  • 5
  • 19
  • 39
0
votes
1 answer

Create SQL user in Databricks

Is there a way to create a database (sql) user (analogous to an Oracle or MySql user) in Databricks? It doesn't seem possible in the Community edition. For a long list of reasons... I have a need to create a very large number of tables and would…
John
  • 3,458
  • 4
  • 33
  • 54
0
votes
1 answer

Too many files on my Databricks Community cluster, but where?

I started playing with streaming on my Community Edition Databricks but after some minutes of producing test events I encountered some problem. I believe it's somehow connected with the fact of some temporary small files produced during streaming…
-1
votes
1 answer

Data-bricks - SQL - SUM of years

I'm would like to get the SUM of each column by years. Rather then displays several individual rows for the same year. spark.sql(""" SELECT YEAR(date) AS year, useful, funny, cool FROM reviews_without_text_table ORDER by year…
-1
votes
1 answer

Course indication for community databricks

I need to learn how to use databricks, using the Python programming language. Does anyone have any introductory course tips that teach how to use the Databricks community (which is free), where I can use the Python language? Any tips swill be…
-1
votes
1 answer

Can't Access a global view table with SQL by a selected field code converted from a dropdown selection

I am using a SQL notebook in Databricks/Spark community edition %python education_DF = sqlContext.sql('select * from global_temp.population_Globaltmp_view where `Education` = "2YD"') display(education_DF) The above code cell works fine and…
-1
votes
1 answer

How to run scala code inside databricks community edition notebook?

Databricks community edition notebook allows one to practice scala guide and try examples in real-time. However, the default is set to pySpark and I was wondering how I can run scala code instead?
Vadim Smolyakov
  • 1,187
  • 11
  • 24
1 2 3 4 5
6