Google Cloud ML Engine is a managed, scalable service that enables building and training of machine learning models in the cloud as part of the Google Cloud suite of products: including TensorFlow, storage, BigQuery, Dataflow and others.
Questions tagged [google-cloud-ml-engine]
185 questions
1
vote
1 answer
How to increase the count of ML training units (25 by default) on Google cloud?
There is a limit for ML training units on Google cloud platform. By default, it is 25. I need to increase up to 100. As you may know, to use scale-tier PREMIUM_1 you need to have at least 75 ML Units.
Concurrent number of ML training units:…

Oleg Dats
- 3,933
- 9
- 38
- 61
1
vote
1 answer
Export a model with tf Estimator and export_savedmodel function
I'm doing a Deep Neural Network regressor with Tensorflow based on this tuorial. When I'm trying to save the model with tf.estimator export_savemodel I get the following error:
raise ValueError('Feature {} is not in features…

Marc
- 165
- 3
- 13
1
vote
2 answers
FAILED_PRECONDITION: Error: SavedModel directory gs://mybucket1/ is expected contain exactly one of [saved_model.pb, saved_model.pbtxt]
I'm trying to use google cloud platform to deploy a model to support prediction.
I train the model (locally) with the following instruction
~/$ gcloud ml-engine local train --module-name trainer.task --package-path trainer
and everything works…

Marc
- 165
- 3
- 13
1
vote
1 answer
issue submitting jobs to google cloud
I would like to train the object detection model on google cloud, following this post the link
Then I follow instructions on https://github.com/tensorflow/models/blob/master/object_detection/g3doc/running_on_cloud.md
I tried to submit a new job. And…

sean
- 49
- 1
- 8
1
vote
1 answer
Providing decryption key with gcloud jobs submit training
I have succesfully trained my first network with the Google Cloud ML engine, and now I am trying to make the setup a bit more secure by providing my own encryption key for encrypting the data. As explained in the manual I have now copied my data to…

jthread
- 13
- 2
1
vote
1 answer
ML Engine Experiment eval tf.summary.scalar not displaying in tensorboard
I am trying to output some summary scalars in an ML engine experiment at both train and eval time. tf.summary.scalar('loss', loss) is correctly outputting the summary scalars for both training and evaluation on the same plot in tensorboard.…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
2 answers
eval_metric_ops display under family on tensorboard
With scalar summaries in tensorflow 1.3 I can now define a 'family' that will help to group the relevant metrics under the same tensorboard tab. For example if I have two summary scalars defined below:
precision = precision(labels,…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
1 answer
add custom loss to eval_metric_ops
I have defined my own loss function using sequence_loss
loss = tf.contrib.legacy_seq2seq.sequence_loss(logits, labels, weights)
I am hoping to add this to eval_metric_ops so that in my ML engine package I can display the evaluation loss in…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
2 answers
tensorflow rnn_decoder perform softmax on each decoder_output
I tried to write my own estimator model_fn() for a GCP ML Engine package. I decoded a sequence of outputs using embedding_rnn_decoder as shown below:
outputs, state = tf.contrib.legacy_seq2seq.embedding_rnn_decoder(
decoder_inputs =…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
1 answer
TensorFlow embedding_rnn_decoder 'Tensor' object is not iterable
I am trying to construct a custom estimator for my ML Engine package and I seem to be having trouble properly constructing my decoder input sequence in the correct format. Consider the following where label1, label2 is supposed to be a sequence of…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
1 answer
Google Cloud ML Engine "Skipping evaluation due to same checkpoint"
So I have an ML engine package based off of the census tutorial and I am trying to perform evaluation every N steps using the --min-eval-frequency flag, but I keep getting the message in stackdriver logs saying: "Skipping evaluation due to same…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
1 answer
Google Cloud ML Engine + Tensorflow perform preprocessing/tokenization in input_fn()
I want to perform basic preprocessing and tokenization within my input function. My data is contained in csv's in a google cloud storage bucket location (gs://) that I cannot modify. Further, I to perform any modifications on input text within my…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
1 answer
How to package vocabulary file for Cloud ML Engine
I have a .txt file which contains a different label on each line. I use this file to create a label index lookup file, for example:
label_index = tf.contrib.lookup.index_table_from_file(vocabulary_file = 'labels.txt'
I am wondering how I should…

reese0106
- 2,011
- 2
- 16
- 46
1
vote
0 answers
Google Cloud ML Engine - Job failed due to an internal error . Can't execute a job
This is a ml-job I previously trained successfully . But when I tried it today it's not working .
So after that I tried removing all the things is the bucket and start over . Still it's not working . Giving the following error .
Internal error…

Shamane Siriwardhana
- 3,951
- 6
- 33
- 73
1
vote
3 answers
Cloud ML Engine batch predictions - How to simply match returned predictions with input data?
According to the ML Engine documentation, an instance key is required to match the returned predictions with the input data. For simplicity purposes, I would like to use a DNNClassifier but apparently canned estimators don't seem to support instance…

Fabrice
- 21
- 2