Questions tagged [h2o.ai]

52 questions
1
vote
2 answers

Are H2O.ai products affected by log4shell vulnerability?

My question is if Open Source H2O-3, Open Source Sparkling Water and Driverless AI are affected by CVE-2021-44228 and CVE-2021-45046.
Michal
  • 437
  • 3
  • 8
1
vote
1 answer

What is the default target metric that H2O models use for their predict() method? Can change?

I am using a H2ORandomForestEsimator. What is the default target metric that H2O models use for their predict() method? https://docs.h2o.ai/h2o/latest-stable/h2o-py/docs/modeling.html#h2o.automl.H2OAutoML.predict Is there a way to set this? (Eg. to…
lampShadesDrifter
  • 3,925
  • 8
  • 40
  • 102
1
vote
1 answer

Can we set window size in h2o automl for prediction of time series dataset?

I am trying to do stock price prediction with time series dataset using h2o automl. If I want to predict next 3 days closing price using previous 5 days history, can I set this kind of window size in h2o automl?
1
vote
1 answer

What parameters in a EC2 virtual machine should I use to optimize H2O's XGBoost performance?

I'm trying to run H2O xgboost on r4.8x large. But it's taking too long to run (15+ hrs as opposed to 4 hours for GBM with same hyperparameter grid size). Knowing that XGBoost uses cache optimization, is there any particular instance type that works…
1
vote
2 answers

ModuleNotFoundError: No module named 'h2oaicore'

I am following the tutorial of driverless: Driverless AI Standalone Python Scoring Pipeline, you can check it in the following link: http://docs.h2o.ai/driverless-ai/latest-stable/docs/userguide/scoring-standalone-python.html#tar-method-py I am…
Eri
  • 11
  • 2
1
vote
1 answer

GLRM in H2O - Performance Metrics return NaN

I'm using a generalized low-rank estimator to infer missing values in a data set regarding sensor readings. I'm using H2O to create and train the model: glrm = H2OGeneralizedLowRankEstimator(k=10, …
1
vote
1 answer

How to access H2O Flow when using Google Colab

Does anyone know how to access H2O Flow when using Google Colab? My code is as follows: !pip install H2O import h2o h2o.init(bind_to_localhost=False, log_dir="./") from google.colab.output import…
Kohei TAMURA
  • 4,970
  • 7
  • 25
  • 49
1
vote
2 answers

Get accuracy metric from leaderboard function when using H20 AI

I am running a binary classification model using H2O autoML. I have explicitly told autoML to treat this as a classification model with the following line of code. # This line of code turns our int variable into a factor. # This is necessary to tell…
Lihka_nonem
  • 352
  • 1
  • 8
1
vote
2 answers

A question about the parallelism in h2o.grid() function

I try to use the h2o.grid() function from the h2o package to do some tuning using R, when I set the parameter parallelism larger then 1, it always shows the warning Some models were not built due to a failure, for more details run…
Kim.L
  • 121
  • 10
0
votes
1 answer

Facing issues with H2O MOJO deployment using sparkling water on Google Dataproc

I am trying to migrate a model from Hadoop to GCP. Model MOJO will not be retrained. I am running the model in Dataproc using Airflow spark submit. Source data format matches with Hadoop source and everything. While running the model, I am getting…
trougc
  • 329
  • 3
  • 14
0
votes
0 answers

Chat with spreadsheet using meta-llama/Llama-2-13b-chat-hf

I made a spreadsheet which contain around 2000 question-answer pair and use meta-llama/Llama-2-13b-chat-hf model. But when start querying through the spreadsheet using the above model it gives wrong answers most of the time & also repeat it many…
0
votes
1 answer

H20 cluster - generate the values in a tabular format for pdp plots

I am using the h20 cluster to train the model using tuned random forest and plot pdp plots using the below code tx = tuned_rf.explain(test_data, top_n_features=5, include_explanations='pdp') This is returning pdp plots in graphical format the graph…
rqqa
  • 35
  • 6
0
votes
1 answer

Installing top2vec package, particularly in H2O Notebooks, and the error

After installing python top2vec package in H2O notebooks (!pip install top2vec), I am getting the following error when importing top2vec: import top2vec ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C…
Sam S.
  • 627
  • 1
  • 7
  • 23
0
votes
1 answer

Why does my training confusion matrix in h2o AutoML only shows 10k total cases instead of 200k

I am currently using h2o autoML to train a model on a binary classification problem. I have a train (70% ~200k rows), valid (10% ~30k rows), test (10% ~30k rows) and blend (10% ~30k rows) datasets all coming from the time sensitive splitting of the…
Guest6117
  • 13
  • 3
0
votes
1 answer

Get Stage Results from H2O Sparkling Water model

I am looking to create a confidence interval for one of my model's outputs and I need to get the model outputs before the link function is applied. From what I've read, it seems like I am interested in getting the stage results of the model. So far…