Questions tagged [clearml]

48 questions
1
vote
1 answer

ModuleNotFoundError: No module named 'allegroai'

I am trying to use ClearML. The only line in my file is from allegroai import Dataset, DatasetVersion which yields ModuleNotFoundError: No module named 'allegroai' Looks like some pip package is missing, but I couldn't for the life of me find it…
Gulzar
  • 23,452
  • 27
  • 113
  • 201
1
vote
0 answers

clearml hyperparameter optimization task fails on creating python environment

Trying to use clearml HyperparametersOptimizers. Launching task, but each experiment fails while trying to create python environment. Specificaly the error I get is related to pytorch clearml_agent: Warning: could not resolve python wheel…
1
vote
2 answers

Mounting an S3 bucket in docker in a clearml agent

What is the best practice for mounting an S3 container inside a docker image that will be using as a ClearML agent? I can think of 3 solutions, but have been unable to get any to work currently: Use prefabbed configuration in ClearML, specifically…
johnml1135
  • 4,519
  • 3
  • 23
  • 18
1
vote
1 answer

ClearML Web UI custom column not persistent

I'm using the experiments page of a project in ClearML Web UI to visualize some custom metrics. Therefore I've customized my table vie (https://allegro.ai/clearml/docs/docs/webapp/webapp_exp_table.html?highlight=customize#adding-metrics) But…
1
vote
1 answer

ClearML multiple tasks in single script changes logged value names

I trained multiple models with different configuration for a custom hyperparameter search. I use pytorch_lightning and its logging (TensorboardLogger). When running my training script after Task.init() ClearML auto-creates a Task and connects the…
1
vote
1 answer

ClearML SSH port forwarding fileserver not available in WEB Ui

Trying to use clearml-server on own Ubuntu 18.04.5 with SSH Port Forwarding and not beeing able to see my debug samples. My setup: ClearML server on hostA SSH Tunnel connections to access Web App from working machine via localhost:18080 Web App:…
1
vote
1 answer

Trains: reusing previous task id

I am using reuse_last_task_id=True to overwrite an existing task (with same project and task name). But the experiment contains the torch model and therefore does not overwrite the existing task but creates a new one. How can I detach the model from…
kyc12
  • 349
  • 2
  • 15
1
vote
3 answers

pip install trains fails

upon running pip install trains in my virtual env I am getting ERROR: Command errored out with exit status 1: command: /home/epdadmin/noam/code/venv_linux/bin/python -u -c 'import sys, setuptools, tokenize; sys.argv[0] =…
Gulzar
  • 23,452
  • 27
  • 113
  • 201
1
vote
1 answer

Will Trains automagically log Tensorboard HParams?

I know that it's possible to send hyper-params as a dictionary to Trains. But can it also automagically log hyper-params that are logged using the TF2 HParams module? Edit: This is done in the HParams tutorial using hp.hparams(hparams).
Michael Litvin
  • 3,976
  • 1
  • 34
  • 40
1
vote
1 answer

Can Trains config file be specified dynamically or relative to the running script path?

Suppose I have a server where many users run different experiments, possibly with different Trains Servers. I know about the TRAINS_CONFIG_FILE environment variable, but I wonder if this can be made more flexible in one of the following…
Michael Litvin
  • 3,976
  • 1
  • 34
  • 40
1
vote
2 answers

trains with grid search

I would like to test trains usage during grid search and it not clear how to do so. from trains import Task Task.init(project_name="project name", task_name='name') creates an experiment in the demo server and logs all but you can't call init…
Sefi Erlich
  • 161
  • 10
1
vote
1 answer

How to Backup/Restore TRAINS-server when moving from AMI to local machine

I recently started using TRAINS, with the server in AWS AMI. We are currently using v0.9.0. I would like to move the TRAINS-server to run on our on-premises kubernetes cluster. However, I don't want to lose the data on the current server in AWS…
Oren
  • 31
  • 1
  • 5
1
vote
1 answer

How to manually register a sci-kit model with TRAINS python auto-magical experiment manager?

I'm working mostly with scikit-learn, as far as I understand, the TRAINS auto-magic doesn't catch scikit-learn model store/load automatically. How do I manually register the model after I have 'pickled' it. For Example: import pickle with…
Yael
  • 11
  • 1
1
vote
1 answer

Is there a way to create a graph comparing hyper-parameters vs model accuracy with TRAINS python package?

I would like to run multiple experiments, then report model accuracy per experiment. I'm training a toy MNIST example with pytorch (v1.1.0), but the goal is, once I can compare performance for the toy problem, to have it integrated with the actual…
fidlr
  • 81
  • 5
0
votes
1 answer

Replacing IPs in MongoDB

I have a nested schema for mongoDB collection. It looks smth like this: { "_id":"61d99bf5544f4822bd963bda0a9c213b", "execution": { "test_split":0, "artifacts":{ "9ed39_output": { "uri":…