Google Cloud ML Engine is a managed, scalable service that enables building and training of machine learning models in the cloud as part of the Google Cloud suite of products: including TensorFlow, storage, BigQuery, Dataflow and others.
Questions tagged [google-cloud-ml-engine]
185 questions
0
votes
3 answers
Google Machine Learning API Issue
I'm trying to use the Google Machine Learning API and I'm facing two problems.
In the API explorer I put the correct information and I get a response error:
Code 200
"error": "Missing \"instances\" field in request body: {\n \"httpBody\":…

RMH
- 222
- 1
- 11
0
votes
1 answer
TensorFlow 1.0.1 SavedModelBuilder
I'm currently doing exploration on deploying models on Google ML Engine. At first, I developed a model using TensorFlow 1.1.0 as it's the latest version exist (by the time this question is asked). However, it turned out that the highest supported…

sokokaleb
- 115
- 7
0
votes
1 answer
Internal Error encountered while invoking CloudML Predict via REST
I have my application running in GAE. This application makes REST call to my CloudML.
Here is the code for that
GoogleCredential credential = GoogleCredential.getApplicationDefault()
…

sag
- 5,333
- 8
- 54
- 91
0
votes
1 answer
Job not generating /export directory
I'm following the guide to deploy a model having previously generated the job:
$ gcloud ml-engine jobs submit training testX
--job-dir="gs://testxxx/run1"
--package-path=trainer
--module-name=trainer.task
--region us-central1
…

Neurus
- 657
- 4
- 27
0
votes
1 answer
Permission denied error while running a code on google cloud ML
I am getting an error while trying to read a file within the code.
File isn't open for reading") PermissionDeniedError: File isn't open for reading
file is located within the trainer folder on the google cloud shell. This is what I have given to…

Appu
- 83
- 9
0
votes
2 answers
scipy.misc.imresize not working in GCP ml-engine
I'm trying to submit the following toy snippet as a job in GCP ml-engine:
import tensorflow as tf
import numpy as np
import scipy.misc
x = np.zeros([10, 10, 1])
y = scipy.misc.imresize(x[:, :, 0], [50, 50, 1],…

Willian Mitsuda
- 1,249
- 1
- 12
- 14
0
votes
1 answer
how to modify tensorflow example "census" with LABEL_COLUMN as continuous base column?
In the census example of tensorflow, The LABEL_COLUMN(income_bracket) has predefined values [' <=50K', ' >50K']. It is a Categorical base column.
1) How to modify the model.py where LABEL_COLUMN is a Continuous base column with float values?
2)…

Madhukar Mohanraju
- 2,793
- 11
- 28
0
votes
1 answer
Is GPU efficient on parameter server for data parallel training?
On data parallel training, I guess the GPU instance is not necessarily efficient for parameter servers because parameter servers only keep the values and don't run any computation such as matrix multiplication.
Therefore, I think the example config…

Shuhei Fujiwara
- 193
- 1
- 7
0
votes
1 answer
Error in importing dependencies on google cloud ML
While submittng a job on gogle cloud ML, I am getting an error where main training python file i.e. task.py is not able to import a function from a python script in util folder.
Generally, we write :
from util.xyz import abc
this is not getting…

Appu
- 83
- 9
0
votes
1 answer
Google Cloud ML Engine GPUs error
I've created several jobs for training CNN using Google Cloud ML Engine,
each time job finished successfully with GPU error. The printed device placement included some GPU activity, but there was no GPU usage in job details/utilization.
Here is the…

Fei
- 23
- 4
0
votes
1 answer
Distribute data between workers on google cloud ML
I have been able to run code in distributed google cloud ML but when I run it that data gets replicated on each machine within the cluster but I want to distribute the data on each machine.
How can i distribute data on each machine within the…

Appu
- 83
- 9
0
votes
1 answer
gcloud ML-engine long stalls when using multiprocessing or multithreading during training
I am trying to train a convolutional model for computer vision in the google ml-engine on a BASIC_GPU tiered instance but the training is stalling for up to an hour at seemingly random time intervals. As can be seen in this picture of the cost…

tijmen Verhulsdonck
- 356
- 3
- 4
0
votes
3 answers
Run a tensorflow code in distributed mode on google cloud ML
Does anybody know what changes need to be made to trainer in order to run a job on distributed platform on google cloud ML ?
It will of great help if somebody can share few articles or docs about the same.

Appu
- 83
- 9
0
votes
1 answer
Creates package but no export
My job completes with no error. The logs show "accuracy", "auc", and other statistical measures of my model. ML-engine creates a package subdirectory, and a tar under that, as expected. But, there's no export directory, checkpoint, eval, graph or…

Eric M Sheetz
- 25
- 6
0
votes
1 answer
--region versus --regions argument in cloudml-samples/flowers/sample.sh
In Google's sample code found at cloudml-samples/flowers/sample.sh, between lines 66 and 69, is the argument "region":
# Tell CloudML about a new type of model coming. Think of a "model" here as
# a namespace for deployed Tensorflow graphs.
gcloud…

Chuck Finley
- 250
- 1
- 10