Questions tagged [mlops]

This tag is for programming questions about MLOps, which is the application of DevOps principles in the design and deployment of Machine Learning (ML) systems.

See also:

Related tags

  • mlflow
  • kubeflow
  • feature-store
228 questions
1
vote
1 answer

Hyperparameter tunning with wandb - CommError: Sweep user not valid when trying to initial the sweep

I'mt rying to use wandb for hyperparameter tunning as described in this notebook (but using my dataframe and trying to do it on random forest regressor instead). I'm trying to initial the sweep but I get the error: sweep_configuration = { …
Reut
  • 1,555
  • 4
  • 23
  • 55
1
vote
1 answer

Python Vetiver model - use alternative prediction method

I'm trying to use Vetiver to deploy an isolation forest model (for anomaly detection) to an API endpoint. All is going well by adapting the example here. However, when deployed, the endpoint uses the model.predict() method by default (which returns…
1
vote
1 answer

How do I handle job secrets with MLRun?

I have a job that requires secrets to connect to S3 and a relational database. I can use environment variables to pass the connection information, but I am looking for a more secure way to handle this. My current code does something like: import…
1
vote
1 answer

How do I re-run specific experiments in Iguazio?

I have a job that I ran in Iguazio a few weeks ago, and I want to re-run that exact same job, (same inputs, params, etc). Is there any easy way to do that without submitting code or using schedulers?
Brennan
  • 39
  • 1
  • 5
1
vote
2 answers

How do I get rid of the http error when I try to cml send comment

this is the .yaml I'm using to run and output a comment with cml name: CNN on: [push] jobs: run: runs-on: ubuntu-latest steps: - uses: actions/checkout@v2 - uses: actions/setup-python@v2 - uses: iterative/setup-cml@v1 -…
1
vote
1 answer

How to see results via `mlflow ui` for experiments logged on a other server?

I was running ML experiments on a ssh server, the experiments were logged via mlflow and stored in local mlruns on the server. The code were just basic usage of mlflow and looks like this import numpy as np import mlflow import matplotlib.pyplot as…
Alaya
  • 3,287
  • 4
  • 27
  • 39
1
vote
1 answer

Add reserved tokens to `tft.vocabulary`

I would like to append words to the vocabulary created by tft.vocabulary that are not a part of the training samples (i.e. and tokens). I see in the docs that the tft.vocabulary function can take an argument key_fn which the docs…
1
vote
1 answer

AWS MLOps - Issue with SageMaker pipeline to deploy new version of model to existing endpoint

I have a problem using SageMaker pipeline for MLOps, I have followed this example, they seems to have only example of one time deployment, my project requires to retrain model weekly, and it will be error if retrain and deploy the model again, I…
1
vote
1 answer

How to increase resolution of CpuUtilization metric of ECS cluster past 1 min mark?

I'm trying to create a robust autoscaling process for my ECS cluster but am facing problems with resolution of CpuUtilization metric. I have turned on 'Detailed metrics' for 1-min resolution, but am not able to achieve good scaling results. I am…
1
vote
1 answer

How to trigger AWS Sagemaker training job from a front end application (via api gateway post call)?

I have made a training job on AWS Sagemaker and it runs well - reads from an s3 location and stores model checkpoints as intended in s3. Now, I need to trigger this trigger job with specified parameters (s3 location having data for eg.) from a…
1
vote
1 answer

Azure ML static Endpoint with dynamic model

Is it possible to deploy an endpoint with the latest model on Azure ML? So, I have a CICD Pipeline on Azure DevOps that will generate and evaluate a new model with the latest model and it will Register a model with a better one. But, the problem is…
MADFROST
  • 1,043
  • 2
  • 11
  • 29
1
vote
0 answers

How to install ODBC driver in azure DevOps built pipeline to run a script on Azure ML compute?

I am running a Python script to connect to my Azure SQL db in a DevOps built pipeline. I am attaching the azure ML compute to the script while running it from DevOps using azure CLI. But how can I install the ODBC driver on the Azure agent? or is…
Andy
  • 405
  • 1
  • 5
  • 16
1
vote
0 answers

Logging predict request/response in tensorflow serving

Currently using TensorFlow serving via docker to deploy an object detection model. Is there a way to log all predict requests/responses so I can save and view them locally for debugging? Input image coming in as a binary string. Is it possible with…
1
vote
0 answers

Triton inference server: Explicit model control

I need a little advice with deploying Triton inference server with explicit model control. From the looks of it, this mode gives the user the most control to which model goes live. But the problem I’m not able to solve is how to load models in case…
1
vote
0 answers

How to send a raw input image to an MLflow served endpoint for a CNN model?

I have started learning MLflow recently and I am trying out ML life cycle on a simple cat-dog classifier. Here is the code. I have correctly set up the model for serving using mlflow serve. However, there is one problem. I have configured the model…