Questions tagged [databricks-connect]

172 questions
1
vote
0 answers

databricks-connect version 13: spark-class2.cmd not found

I install the newest version "databricks-connect==13.0.0". Now get the issue Command C:\Users\Y\AppData\Local\pypoetry\Cache\virtualenvs\X-py3.9\Lib\site-packages\pyspark\bin\spark-class2.cmd"" not found konnte nicht gefunden werden. Traceback (most…
Lazloo Xp
  • 858
  • 1
  • 11
  • 36
1
vote
1 answer

Can't import modules installed on cluster when connecting with databricks-connect

Original question I'm trying to move from databricks runtime 10.4 LTS to 11.3 LTS. I'm able to connect fine with databricks-connect, but I am not able to import the correct version of the modules that I have loaded on the cluster. I start by…
fskj
  • 874
  • 4
  • 15
1
vote
1 answer

Python in Databricks

How to even start a basic query in databricks using python? The data I need is in databricks and so far I have been using Juypterhub to pull the data and modify few things. But now I want to eliminate a step of pulling the data in Jupyterhub and…
1
vote
1 answer

Moving a Pyspark project development form Databricks UI to VSCode using databricks connect

I am inheriting a huge pyspark project and instead of using the the Databricks UI for development I would like to use vscode via databricks-connect. Because of this I am failing to determine the best practices for the following: Because the project…
1
vote
1 answer

Is there any way to create Databricks jobCluster through Databricks connect?

Is there any way to create Databricks jobCluster through Databricks connect? We are using All purposed cluster so far, to reduce Databricks cost we are planning to go ahead with jobCluster but unfortunately, I couldn't find a way to create…
Sandesh
  • 109
  • 7
1
vote
1 answer

Snowflake -Data-bricks : Creating snowflake temp table in data-bricks

We wanted to create snowflake temp table in databricks. Well temp table gets created but when we try to access it it says the table does note exist. We checked snowflake history and saw the temp table being created but other query for access is…
1
vote
1 answer

Practice Exam Databricks Certified Data Engineer Associate - Question 40

Why option B is not the answer for this question? Question 40 A data analyst has noticed that their Databricks SQL queries are running too slowly. They claim that this issue is affecting all of their sequentially run queries. They ask the data…
1
vote
1 answer

Get cluster metric (Ganglia charts) of all clusters via REST API in Databricks

The question is specific to databricks. Is there any API to get the ganglia chart showing cluster usage? Need to get all the Ganglia charts that are available in the Databricks cluster metrics section for all the clusters via REST API calls. We are…
1
vote
0 answers

Graphframes connectedComponents is not working if I run my spark jobs via databricks connect

Graphframe connectedComponents is throwing exceptions when i try to run my spark job from databricks connect. Here are the configurations i am using for spark session spark = ( SparkSession .builder .config( "spark.jars.packages", …
shahidammer
  • 1,026
  • 2
  • 10
  • 24
1
vote
1 answer

DbUtils: For Development environment

We have created secrets and scopes in datababricks and we are using dbUtils to access those secrets and scopes. dbUtils.getSecret(SCOPE,KEY) Unfortunately, this dbUtils function is not working on development environment, so I need advice on how to…
ConfusedDeveloper
  • 6,389
  • 4
  • 21
  • 36
1
vote
1 answer

Use remote driver with Databricks Connect

When connecting to Databricks cluster from local IDE, I take that only spark-related commands are executed in remote mode (on cluster). How about single-node operations such as scikit-learn, to_pandas. If these functions only use local machine, the…
1
vote
0 answers

Databricks-connect not able to load data

After a lot of struggling i finally managed to connect to databricks remotely only to find that i'm not allowed to load for some reason. I'm posting here in the hope that someone out there has the answer. What's trying to happen here is using…
1
vote
0 answers

Databricks - Unable to run databricks-connect. Get Access Denied

I am using JRE - 1.8 and databricks-connect version 9.1.15 with python version - Python 3.7.9 openjdk version "1.8.0_41" OpenJDK Runtime Environment (build 1.8.0_41-b04) Python 3.7.9 (tags/v3.7.9:13c94747c7, Aug 17 2020, 16:30:00) I see that…
akshat thakar
  • 1,445
  • 21
  • 29
1
vote
0 answers

Best Pyspark Testing : issue with databricks -connect

I'm currently using databricks and in order to test my databricks code I'm using databricks connect in VS code. While I'm using databricks connect, since yesterday suddenly it started behaving strange, while I'm submitting a code from VS Code, the…
dsk
  • 1,863
  • 2
  • 10
  • 13
1
vote
0 answers

How to resolve "java.lang.ClassNotFoundException" when running Scala Spark project using databricks-connect?

Currently I am learning how to use databricks-connect to develop Scala code using IDE locally. The set-up of the databricks-connect as described here was succuesful. This is an assumption as I can run PySpark Code against the cluster with no…
DataBach
  • 1,330
  • 2
  • 16
  • 31