Questions tagged [databricks-connect]

172 questions
1
vote
1 answer

Creating database in Azure databricks on External Blob Storage giving error

I have mapped my blob storage to dbfs:/mnt/ under name /mnt/deltalake and blob storage container name is deltalake. Mounting to Dbfs is done using Azure KeyVault backed secret scope. When I try to create a database using CREATE DATABASE abc with…
user12823830
1
vote
2 answers

Configuring databricks-connect using python OS module

I want to configure databricks-connect configure through python OS module after installing databricks-connect through os.system("pip install databricks-connect==6.5") Once databricks-connect is successfully install we need to configure it by passing…
qaiser
  • 2,770
  • 2
  • 17
  • 29
1
vote
1 answer

How to fix spark.read.format("parquet") error

I'm running Scala code on Azure databricks well. Now I want to move this code from Azure notebook to eclipse. I install databricks connection following Microsoft document successfully. Pass databricks data connection test. I also installed SBT and…
1
vote
0 answers

Databricks-Connect 6.6 doesn't add custom modules to Spark Context

I'm experiencing a strange behaviour in Databricks-Connect 6.6 and was wondering if anybody has seen this before and knows what causes this problem. I have created a local Spark Context via Databricks-Connect and can successfully connect to my…
Jonas
  • 61
  • 3
1
vote
1 answer

Databricks-Connect also return module not found for multiple python files job

Currently I'm connecting to databricks with local VS Code via databricks-connect. But my submmission all comes with error of module not found, which means the code in other python files not found. I tried: Move code into the folder with…
1
vote
0 answers

Access Network Drive in Azure Databrick Notebook

I have a network drive that stores some files. I need to access these files via my Azure Databricks notebook. Any ideas? I tried for root, dirs, files in os.walk('\\\\\\'): for name in files: print(os.path.join(root,…
1
vote
2 answers

Equivalent of Databricks sc.broadcast() locally with Databricks connect

I am transferring some code from Databricks notebook into Jupyter notebook locally. The following code that works in the Databricks Notebook is not working locally. res = sc.broadcast(spark.read.table(my_table)) Here is my local code: from…
Liky
  • 1,105
  • 2
  • 11
  • 32
1
vote
0 answers

SparkSession Connect to Databricks Azure

I'm using maven and scala to create a spark application that needs to connect to a cluster on azure databricks. How can i point my sparksession to connect to the databricks cluster? I saw databricks-connect, but it loads some jar files using sbt. I…
1
vote
1 answer

databricks-connect, py4j.protocol.Py4JJavaError: An error occurred while calling o342.cache

Connection to databricks works fine, working with DataFrames goes smoothly (operations like join, filter, etc). The problem appears when I call cache on a dataframe. py4j.protocol.Py4JJavaError: An error occurred while calling o342.cache. :…
kabot
  • 81
  • 2
  • 6
0
votes
0 answers

Databricks ADSL Mount with Enabled from selected virtual networks and IP addresses

I am trying to access the ADSL file from databricks. I tried using SAS Token, Access key and Also with SPark.config (Service Principle) But its still not working. storageAccountName = "adlskush" storageAccountAccessKey = "AccountKye" sasToken =…
Kushal Patil
  • 205
  • 1
  • 2
  • 14
0
votes
0 answers

Configure remote tests on Databricks with the Testing Extension

I'm currently testing the Databricks VSCode extension. I find it way easier to use for developers than dbx because of its native integration. I'd like to test its integration with the "Test Lab" extension (cf image below) I managed to make my tests…
0
votes
0 answers

How to import datawarehouse to excel without using odbc?

I want to know how to import data warehouse to excel and load fast without using odbc. It's taking too long to load platform datawarehouse DATABRICKS I just tried to import the normal way via odbc, however, it's taking too long and I need to know if…
0
votes
0 answers

% pip install packages in a standalone python script

I would like to install packages as I create a sparksession in a standalone python script (.py file) that will be uploaded and run in Databricks. Something like this: from pyspark.sql import SparkSession spark =…
hhp
  • 109
  • 1
  • 7
0
votes
0 answers

Installing Azure Databricks Connect

I am trying to install Azure Databricks Connect following this article: Azure Databricks installation I am running: pip3 install --upgrade "databricks-connect==13.1.*" and getting back the following error: ERROR: Could not find a version that…
user1700890
  • 7,144
  • 18
  • 87
  • 183
0
votes
1 answer

How to declare variables in Databricks SQL editor

In Databricks Notebook (Sql) I was able to declare a variable and use it also with below syntax: set name.table=(select distinct name from t1); select * from t2 where name IN ${name.table} but the same is not working in the SQL Editor. Getting…