Questions tagged [google-cloud-datalab]

An online tool for data analysis provided by Google as part of its Cloud Platform.

An easy to use interactive tool for large-scale data exploration, analysis, and visualization.

See more at https://cloud.google.com/datalab.

468 questions
3
votes
2 answers

Difference between google.datalab.bigquery and google.cloud.bigquery

I'm using Datalab on Google Cloud Platform and was trying to create a BigQuery dataset with google.datalab.bigquery when I found I needed the .Client method that was only in google.cloud.bigquery library. What's the difference between the datalab…
3
votes
1 answer

Installing GDAL on Google Cloud Datalab

I am having trouble installing GDAL on Google Cloud Datalab. When I run: !pip install gdal I get the following error Collecting gdal Using cached GDAL-2.2.4.tar.gz Complete output from command python setup.py egg_info: running egg_info creating…
3
votes
1 answer

How to read data from Google storage cloud to Google cloud datalab

I have a few CSV files storing in Google storage and I want to read those into Google datalab. So far, I have no idea how to do it. I found this and followed the first answer but didn't work and raised File "", line…
bensw
  • 2,818
  • 5
  • 21
  • 31
3
votes
1 answer

Authentication issue on Google Cloud Datalab

I'm running Datalab locally using docker after following advice here: https://github.com/googledatalab/datalab/wiki/Getting-Started I'm trying to run the following test snippet: %%bq query #SQL query to return total births by year SELECT…
goose
  • 2,502
  • 6
  • 42
  • 69
3
votes
1 answer

Can't reconnect to Datalab vm through cloud shell

I created and successfully used a datalab instance. Afterwards, I stopped the instance. Now, I'm trying to reconnect to the same datalab instance (using 'datalab connect {instance_name}') but this is resulting in the cloud shell hanging on the…
Balkan
  • 691
  • 1
  • 8
  • 22
3
votes
0 answers

Random Forest on Tensorflow at Google Cloud Datalab restarting kernel (Code not working)

I am using the data from the following Kaggle competition to train Random Forest on Tensorflow - https://www.kaggle.com/c/santander-product-recommendation The code was working fine a day ago but now whenever I run the training code for the Random…
3
votes
0 answers

No Datalab repository created when using Datalab init script to create Dataproc cluster

I am creating a Dataproc cluster that uses Datalab. To initialize Datalab on the Dataproc cluster, I am creating the cluster from the cloud console and pointing to the datalab.sh initialization script from Google's dataproc-initialization-actions…
charlesreid1
  • 4,360
  • 4
  • 30
  • 52
3
votes
1 answer

BigQuery in Google Cloud Datalab - Get Query Text?

We have a working prototype in Google Cloud DataLab where we have made extensive use of the %%bq 'magics' to build up a parameterized query using various UDFs. So something like: customFn1 =…
Stewart_R
  • 13,764
  • 11
  • 60
  • 106
3
votes
3 answers

Read CSV file to Datalab from Google Cloud Storage and convert to pandas dataframe

I am trying to read a csv file save in gs to a dataframe for analysis I have follow the following steps without success mybucket = storage.Bucket('bucket-name') data_csv = mybucket.object('data.csv') df = pd.read_csv(data_csv) this doesn't work…
irkinosor
  • 766
  • 12
  • 26
3
votes
1 answer

how to update a datalab to a new version

I have a message that a newer version of datalab is available: Datalab 1.2.20170818 What would be the easiest way to update the current datalab notebook to the newer version and run the code that is available with the udpated…
eilalan
  • 669
  • 4
  • 20
3
votes
3 answers

-bash: datalab: command not found

I installed the cloud datalab component by running gcloud components install datalab However, I keep getting the following error from my terminal when attempting to create a datalab instance: -bash: datalab: command not found When I run gcloud…
Ekaba Bisong
  • 2,918
  • 2
  • 23
  • 38
3
votes
2 answers

Google Datalab Autosave Failed. Output block max?

I am having a problem with the autosave failing in a Google datalab notebook. I am using the ssh shell on port 8081 from the console. I open the notebook. Make a few changes. Click save. Works fine. I run the code, make another change. Click…
Brian F
  • 107
  • 1
  • 7
3
votes
0 answers

Kernel automatically dies when loading large data from GCS to Cloud Datalab

I'm uploading a 2 GB file from GCS ( a .vec file containing words and their vector representation). I want to use this file with gensim ( I already installed the library). I'm using a n1-highmem-2 (2 vCPUs, 13 GB memory) VM. The notebook…
3
votes
1 answer

Failed to import large data as dataframe, from Google BigQuery to Google Cloud DataLab

I tried 2 approaches to import a large table in Google BigQuery, about 50,000,000 rows, 18GB, into dataframe to Google Datalab, in order to do the machine learning using Tensorflow. Firstly I use (all modules needed are imported) : data =…
Zrisound
  • 43
  • 1
  • 3
3
votes
1 answer

NameError using LinearRegression() with python API

I'm trying to run some regressions in Earth Engine using Cloud Datalab. When I replicate the code in this regression tutorial (modifying it for python), I get this error: NameErrorTraceback (most recent call last) in…
Andrew
  • 33
  • 3