Questions tagged [google-cloud-ml-engine]

Google Cloud ML Engine is a managed, scalable service that enables building and training of machine learning models in the cloud as part of the Google Cloud suite of products: including TensorFlow, storage, BigQuery, Dataflow and others.

185 questions
0
votes
3 answers

Google Machine Learning API Issue

I'm trying to use the Google Machine Learning API and I'm facing two problems. In the API explorer I put the correct information and I get a response error: Code 200 "error": "Missing \"instances\" field in request body: {\n \"httpBody\":…
RMH
  • 222
  • 1
  • 11
0
votes
1 answer

TensorFlow 1.0.1 SavedModelBuilder

I'm currently doing exploration on deploying models on Google ML Engine. At first, I developed a model using TensorFlow 1.1.0 as it's the latest version exist (by the time this question is asked). However, it turned out that the highest supported…
0
votes
1 answer

Internal Error encountered while invoking CloudML Predict via REST

I have my application running in GAE. This application makes REST call to my CloudML. Here is the code for that GoogleCredential credential = GoogleCredential.getApplicationDefault() …
sag
  • 5,333
  • 8
  • 54
  • 91
0
votes
1 answer

Job not generating /export directory

I'm following the guide to deploy a model having previously generated the job: $ gcloud ml-engine jobs submit training testX --job-dir="gs://testxxx/run1" --package-path=trainer --module-name=trainer.task --region us-central1 …
Neurus
  • 657
  • 4
  • 27
0
votes
1 answer

Permission denied error while running a code on google cloud ML

I am getting an error while trying to read a file within the code. File isn't open for reading") PermissionDeniedError: File isn't open for reading file is located within the trainer folder on the google cloud shell. This is what I have given to…
Appu
  • 83
  • 9
0
votes
2 answers

scipy.misc.imresize not working in GCP ml-engine

I'm trying to submit the following toy snippet as a job in GCP ml-engine: import tensorflow as tf import numpy as np import scipy.misc x = np.zeros([10, 10, 1]) y = scipy.misc.imresize(x[:, :, 0], [50, 50, 1],…
Willian Mitsuda
  • 1,249
  • 1
  • 12
  • 14
0
votes
1 answer

how to modify tensorflow example "census" with LABEL_COLUMN as continuous base column?

In the census example of tensorflow, The LABEL_COLUMN(income_bracket) has predefined values [' <=50K', ' >50K']. It is a Categorical base column. 1) How to modify the model.py where LABEL_COLUMN is a Continuous base column with float values? 2)…
0
votes
1 answer

Is GPU efficient on parameter server for data parallel training?

On data parallel training, I guess the GPU instance is not necessarily efficient for parameter servers because parameter servers only keep the values and don't run any computation such as matrix multiplication. Therefore, I think the example config…
0
votes
1 answer

Error in importing dependencies on google cloud ML

While submittng a job on gogle cloud ML, I am getting an error where main training python file i.e. task.py is not able to import a function from a python script in util folder. Generally, we write : from util.xyz import abc this is not getting…
Appu
  • 83
  • 9
0
votes
1 answer

Google Cloud ML Engine GPUs error

I've created several jobs for training CNN using Google Cloud ML Engine, each time job finished successfully with GPU error. The printed device placement included some GPU activity, but there was no GPU usage in job details/utilization. Here is the…
Fei
  • 23
  • 4
0
votes
1 answer

Distribute data between workers on google cloud ML

I have been able to run code in distributed google cloud ML but when I run it that data gets replicated on each machine within the cluster but I want to distribute the data on each machine. How can i distribute data on each machine within the…
Appu
  • 83
  • 9
0
votes
1 answer

gcloud ML-engine long stalls when using multiprocessing or multithreading during training

I am trying to train a convolutional model for computer vision in the google ml-engine on a BASIC_GPU tiered instance but the training is stalling for up to an hour at seemingly random time intervals. As can be seen in this picture of the cost…
0
votes
3 answers

Run a tensorflow code in distributed mode on google cloud ML

Does anybody know what changes need to be made to trainer in order to run a job on distributed platform on google cloud ML ? It will of great help if somebody can share few articles or docs about the same.
0
votes
1 answer

Creates package but no export

My job completes with no error. The logs show "accuracy", "auc", and other statistical measures of my model. ML-engine creates a package subdirectory, and a tar under that, as expected. But, there's no export directory, checkpoint, eval, graph or…
0
votes
1 answer

--region versus --regions argument in cloudml-samples/flowers/sample.sh

In Google's sample code found at cloudml-samples/flowers/sample.sh, between lines 66 and 69, is the argument "region": # Tell CloudML about a new type of model coming. Think of a "model" here as # a namespace for deployed Tensorflow graphs. gcloud…
Chuck Finley
  • 250
  • 1
  • 10
1 2 3
12
13