Questions tagged [databricks-connect]

172 questions
0
votes
1 answer

Databricks (SQL Warehouse) How to read data from postgres (jdbc)

Trying to connect database sql warehouse from postgres db but the problem I am facing is that it shows me that data source is not supported. Please see screenshot below. enter image description here
0
votes
0 answers

Pyspark databricks connect library issue

I am currently coding pyspark pipelines using databricks connect library. The steps I followed are given here. This library has been installed in a virtual environment. When I try to execute this code spark.read.load(path).first() I get this…
Trijit
  • 501
  • 1
  • 4
  • 18
0
votes
1 answer

Handling spaces in the abfss using COPY INTO with Azure Databricks

I am trying to use the COPY INTO statement in Databricks to ingest CSV files from Cloud Storage. The problem is that the folder name has a space in it /AP Posted/ and when I try to refer to the path the code execution raises the below error: Error…
0
votes
0 answers

Automate - data import from databricks to local using databricks CLI

I have configured databricks-CLI and its working fine for the copy to local command. databricks fs cp dbfs:/FileStore/data/filename.csv C:\Users Can i automate this in python. Using "import databricks_cli". Please let me know how to do it?
0
votes
0 answers

Issue while integrating PyCharm with Databricks cluster

I am trying to integrate PyCharm with my team's Databricks cluster using databricks-connect. I created a venv and did a pip install databricks-connect. Then I configured it with the necessary info of databricks cluster. When I run databricks-test, I…
Niranjan
  • 13
  • 4
0
votes
1 answer

Cannot load spark-avro jars with databricksversion 10.4

Currently, I am facing an issue since the databricks-connect runtime on our cluster was updated to 10.4. Since then, I cannot load the jars for spark-avro anymore. By Running the following code from pyspark.sql import SparkSession spark =…
Lazloo Xp
  • 858
  • 1
  • 11
  • 36
0
votes
1 answer

Error logging Spark model with mlflow to databricks registry, via databricks-connect

I'm trying to log a trained spark model on mlflow using databricks-connect. I want this model to be logged in the Databricks registry. For now, my code looks like…
itscarlayall
  • 128
  • 1
  • 14
0
votes
0 answers

`NoClassDefFoundError: spray/json/JsonWriter` error when tying to fit a synapse.ml LightGBMRegressor

I am trying to run a spark LightGBMRegressor conncting to databricks with databricks-connect using pycharm. when trying to "fit" my data I get an error NoClassDefFoundError: spray/json/JsonWriter. code i am trying to run: if…
eliavs
  • 2,306
  • 4
  • 23
  • 33
0
votes
1 answer

how to pass values to a command in Linux if it is asking yes and other parameters

I am working on connecting to a data bricks workspace from databricks-connect command using bash script i have tried the following command to configure echo "y $(databricks url) $(token) $(cluster_id) $(org_id) $(port)" | databricks-connect…
0
votes
1 answer

Converting SQL Query to Databricks SQL

I have a query that I need to convert to Databricks SQL or run against a table in a Databrick environment but failing even though it works very well against tables SQL Server. The tables and query can be found here The query to convert or run in…
0
votes
0 answers

How do I get file metadata using Databricks Connect?

I am using Azure Databricks, which I have hooked up to a data lake and I want to get metadata such as modified date for the files in the lake. I am able to do this within Databricks itself using os.stat() as detailed in this answer, but I am…
Tim Hoare
  • 3
  • 1
0
votes
1 answer

Connect to Amazon Aurora using Pyspark

I am new in Pyspark. I am trying to connect to Amazon Aurora from Databricks using Pyspark. Can anyone please share a sample code to connect to Amazon Aurora from Databricks using Pysparks.
0
votes
1 answer

Databricks Connect: Unable to run scala program in databricks cluster from IntelliJ

Followed the steps mentioned in this docs. databricks-connect test command works fine. However, when I launch the test scala program from Intellij, I'm seeing following error: Exception in thread "main" java.lang.NoSuchMethodError:…
0
votes
1 answer

Databricks streaming "A file referenced in the transaction log cannot be found"

So whenever i am trying to read from a source with stream i get this error "A file referenced in the transaction log cannot be found" and it points to a file that does not exist. I have tried: Changing the checkpoint location Changing the start…
0
votes
0 answers

ERROR Shell: Failed to locate the winutils binary in the hadoop binary path

I know there are many questions like this but i tried all solutions trust me. And i keep getting the same error again and again. I am trying to access spark of remote clusters and running localy by using data-bricks connect and conda env and the IDE…
Boyuis
  • 1
  • 4