Questions tagged [google-cloud-ml-engine]

Google Cloud ML Engine is a managed, scalable service that enables building and training of machine learning models in the cloud as part of the Google Cloud suite of products: including TensorFlow, storage, BigQuery, Dataflow and others.

185 questions
2
votes
1 answer

Tensorflow: Resize Image Placeholder

I have a trained TF model that that operates on a serialized (TFRecord) input. The image data has variable shape and is converted to a 229x229x3 shape via tf.image.resize_images(...). I would like to use the gcloud ml-engine predict platform similar…
2
votes
1 answer

cloud ml predict FAILED_PRECONDITION and value for field [projectsId]

The predict command I have been using for quite some time began encountering an error this weekend: $ gcloud beta ml predict --model my_model --json-instances my_instance.json WARNING: The `gcloud beta ml` commands have been renamed and will soon…
Chuck Finley
  • 250
  • 1
  • 10
2
votes
1 answer

Tensorflow Serving - "No versions of servable" message for model trained with tf.contrib.learn.Experiment

I've trained a model using the Getting Started tutorial from Google Cloud ML Engine as a reference. I could manage to deploy and serve this model on Google Cloud ML without problems. Now I'm trying to serve it using Tensorflow Serving but I'm…
2
votes
1 answer

Figuring out bottleneck in Cloud ML training

I'm trying to train DCGAN on celebA. The training process seems to be going very slowly, and I want to figure out what is causing the bottleneck. The time it would take to train for one epoch seems to be very similar when I use scale tier…
2
votes
1 answer

How do I change the Signatures of my SavedModel without retraining the model?

I just finished training my model only to find out that I exported a model for serving that had problems with the signatures. How do I update them? (One common problem is setting the wrong shape for CloudML Engine).
2
votes
2 answers

Migrating from Google Cloud ML beta to Google Cloud Machine Learning Engine

Google announced beta version of a new machine learning environment today. Can someone update me on where Google CloudML stands? The docs seemed to have just changed overnight. I'm guessing that commands that were gcloud beta ml are now gcloud…
bw4sz
  • 2,237
  • 2
  • 29
  • 53
1
vote
1 answer

TensorFlow retrain label_image.py (Simple image classification with Inception)

For labelling objects, If I use TensorFlow's label_image.py like: $ python label_image.py --image=1.jpg --graph=output_inference_graph.pb/frozen_inference_graph.pb --labels=test.txt Error: KeyError: "The name 'final_result:0' refers to a Tensor…
1
vote
1 answer

My TensorFlow training is slow. How do I profile to find the hotspots?

Sometimes jobs run slowly and it would be nice to be able to profile them as they run to find hotspots. How can that be down in TensorFlow, and especially in Google Cloud Machine Learning Engine?
rhaertel80
  • 8,254
  • 1
  • 31
  • 47
1
vote
1 answer

Export a KMeans model using export_savedmodel to deploy on ml-engine

I'm doing a K-means clustering using tensorflow.contrib.learn.KMeansClustering. I can use it default model to predict local but since I want to use ml-engine online prediction, I must export it to a export_savedmodel format. I have google lot's of…
1
vote
1 answer

How to create an "exact match" eval_metric_op for TensorFlow?

I am trying the create an eval_metric_op function that will display the proportion of exact matches at a given threshold for a multi-label classification problem. The following function returns 0 (no exact match) or 1 (exact match) based on the…
reese0106
  • 2,011
  • 2
  • 16
  • 46
1
vote
2 answers

Train Tensorflow Model in Google Cloud ML Engine in Python

can i train a Tensorflow Model in Google cloud machine learning Engine in python without using the commande Line : `gcloud ml-engine jobs submit training $JOB_NAME \ --job-dir $OUTPUT_PATH \ --runtime-version 1.2 \ …
1
vote
1 answer

Google Cloud ML Engine can't locate local TFRecords

I am trying to use Google Cloud ML Engine to optimize hyperparameters for my variational autoencoder model, but the job fails because the .tfrecord files I specify for my input are not found. In my model code, I pass train.tfrecords to my input…
1
vote
2 answers

Trainer module not found in Google Cloud ML Engine

I am trying to tune my variational autoencoder's hyperparameters using Google Cloud ML Engine. I set up my package with the structure they recommend in the docs, so that I specify "trainer.task" as my main module name. Below is an image of my…
1
vote
1 answer

Error when following google object detection API tutorial

While following the tutorial in here, I've reached the stage of training the model on the cloud. Unfortunately, the command gcloud ml-engine jobs submit training `whoami`_object_detection_`date +%s` \ --job-dir=gs://${YOUR_GCS_BUCKET}/train \ …
1
vote
1 answer

"Cannot infer num from shape" when exporting tensorflow model

I'm trying to export a tensorflow model like feature_spec = { 'words': tf.FixedLenSequenceFeature([], tf.int64, allow_missing=True) } def serving_input_receiver_fn(): """Build the serving inputs.""" serialized_tf_example =…