Questions tagged [h2o.ai]
52 questions
1
vote
2 answers
Are H2O.ai products affected by log4shell vulnerability?
My question is if Open Source H2O-3, Open Source Sparkling Water and Driverless AI are affected by CVE-2021-44228 and CVE-2021-45046.

Michal
- 437
- 3
- 8
1
vote
1 answer
What is the default target metric that H2O models use for their predict() method? Can change?
I am using a H2ORandomForestEsimator. What is the default target metric that H2O models use for their predict() method?
https://docs.h2o.ai/h2o/latest-stable/h2o-py/docs/modeling.html#h2o.automl.H2OAutoML.predict
Is there a way to set this? (Eg. to…

lampShadesDrifter
- 3,925
- 8
- 40
- 102
1
vote
1 answer
Can we set window size in h2o automl for prediction of time series dataset?
I am trying to do stock price prediction with time series dataset using h2o automl. If I want to predict next 3 days closing price using previous 5 days history, can I set this kind of window size in h2o automl?

Sakshi Verma
- 11
- 1
1
vote
1 answer
What parameters in a EC2 virtual machine should I use to optimize H2O's XGBoost performance?
I'm trying to run H2O xgboost on r4.8x large. But it's taking too long to run (15+ hrs as opposed to 4 hours for GBM with same hyperparameter grid size).
Knowing that XGBoost uses cache optimization, is there any particular instance type that works…

my_cse lab
- 35
- 5
1
vote
2 answers
ModuleNotFoundError: No module named 'h2oaicore'
I am following the tutorial of driverless: Driverless AI Standalone Python Scoring Pipeline, you can check it in the following link:
http://docs.h2o.ai/driverless-ai/latest-stable/docs/userguide/scoring-standalone-python.html#tar-method-py
I am…

Eri
- 11
- 2
1
vote
1 answer
GLRM in H2O - Performance Metrics return NaN
I'm using a generalized low-rank estimator to infer missing values in a data set regarding sensor readings. I'm using H2O to create and train the model:
glrm = H2OGeneralizedLowRankEstimator(k=10,
…

Luisa Nogueira
- 13
- 2
1
vote
1 answer
How to access H2O Flow when using Google Colab
Does anyone know how to access H2O Flow when using Google Colab?
My code is as follows:
!pip install H2O
import h2o
h2o.init(bind_to_localhost=False, log_dir="./")
from google.colab.output import…

Kohei TAMURA
- 4,970
- 7
- 25
- 49
1
vote
2 answers
Get accuracy metric from leaderboard function when using H20 AI
I am running a binary classification model using H2O autoML. I have explicitly told autoML to treat this as a classification model with the following line of code.
# This line of code turns our int variable into a factor.
# This is necessary to tell…

Lihka_nonem
- 352
- 1
- 8
1
vote
2 answers
A question about the parallelism in h2o.grid() function
I try to use the h2o.grid() function from the h2o package to do some tuning using R, when I set the parameter parallelism larger then 1, it always shows the warning
Some models were not built due to a failure, for more details run…

Kim.L
- 121
- 10
0
votes
1 answer
Facing issues with H2O MOJO deployment using sparkling water on Google Dataproc
I am trying to migrate a model from Hadoop to GCP. Model MOJO will not be retrained. I am running the model in Dataproc using Airflow spark submit.
Source data format matches with Hadoop source and everything. While running the model, I am getting…

trougc
- 329
- 3
- 14
0
votes
0 answers
Chat with spreadsheet using meta-llama/Llama-2-13b-chat-hf
I made a spreadsheet which contain around 2000 question-answer pair and use meta-llama/Llama-2-13b-chat-hf model. But when start querying through the spreadsheet using the above model it gives wrong answers most of the time & also repeat it many…

aiwesee
- 1
0
votes
1 answer
H20 cluster - generate the values in a tabular format for pdp plots
I am using the h20 cluster to train the model using tuned random forest and plot pdp plots using the below code
tx = tuned_rf.explain(test_data, top_n_features=5, include_explanations='pdp')
This is returning pdp plots in graphical format the graph…

rqqa
- 35
- 6
0
votes
1 answer
Installing top2vec package, particularly in H2O Notebooks, and the error
After installing python top2vec package in H2O notebooks (!pip install top2vec), I am getting the following error when importing top2vec:
import top2vec
ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C…

Sam S.
- 627
- 1
- 7
- 23
0
votes
1 answer
Why does my training confusion matrix in h2o AutoML only shows 10k total cases instead of 200k
I am currently using h2o autoML to train a model on a binary classification problem. I have a train (70% ~200k rows), valid (10% ~30k rows), test (10% ~30k rows) and blend (10% ~30k rows) datasets all coming from the time sensitive splitting of the…

Guest6117
- 13
- 3
0
votes
1 answer
Get Stage Results from H2O Sparkling Water model
I am looking to create a confidence interval for one of my model's outputs and I need to get the model outputs before the link function is applied. From what I've read, it seems like I am interested in getting the stage results of the model.
So far…

omoshiroiii
- 643
- 5
- 11