Questions tagged [h2o]

Use this tag for questions about the H2O in-memory machine learning platform. Where relevant, add language tags like [r], [python], [scala], or [java].

Best Practices

Always post a Minimal, Complete and Verifiable Example (MCVE) and provide the H2O version number and client type (Python, R, Flow, etc).

If your question is not code related, do not post to Stack Overflow (per Stack Overflow guidelines). If your question is algorithm related, post to Cross-Validated on Stack Exchange using the "h2o" tag. All other questions can be posted to the h2ostream Google group (please do not double-post).

Resources

1875 questions
0
votes
1 answer

Is there any way to get conditional permutation importance from h2o.gbm?

How to compute conditional permutation importance from h2o.gbm? I have a data set with many highly correlated variables(>0.9). And fed this data set to h2o.gbm. As it turned out, RMSE increases (on CV) when I drop down correlated variables. Now I'm…
0
votes
1 answer

Create partial dependence plot using H2O in spark?

I am trying to create partial dependent plot using the following code rf_pdp = rf_model .partial_plot(data = htest, cols = ['var1', 'var2', 'var3'], plot=True) rf_pdp it runs without error and generate a table with mean_response, stddev_response,…
Gavin
  • 1,411
  • 5
  • 18
  • 31
0
votes
1 answer

How to make H2OGridSearch for H2OGradientBoostingEstimator repeatable (Reproducibility) in spark environment?

I am using the following code to run GBM in Sparkling Water. I have set up the seed and score_each_iteration, but every time, it still generates different results when I check the AUC even though I have set the seed and…
Gavin
  • 1,411
  • 5
  • 18
  • 31
0
votes
1 answer

H2O.GeneralizedLowRankModel objective is NA when passing loss by column

I am working with h2o glrm function. When I am trying to pass loss_by_col argument in order to specify different loss function for each column in my DataFrame (I have normal, poisson and binomial variables, so I am passing "Quadratic", "Poisson" and…
0
votes
1 answer

Predicting probabilities with H2O flow

Is it possible to predict probabilities in a binary classification task a in H2O flow? In particular I am finding difficulties in camputing the probability instead a crisp prediction because I can't see option in the UI of H2O when predicting. If…
Luca Pedretti
  • 31
  • 1
  • 2
0
votes
0 answers

GBT algorithm using H2O 3.8.2.6 in R and Rapidminer

I tried tuning parameters for GBM H2O in R using, https://github.com/h2oai/h2o-3/blob/3.10.0.7/h2o-docs/src/product/tutorials/gbm/gbmTuning.Rmd I then tried applying the tuned hyper-parameters in Rapidminer for the same data set. In R I got accuracy…
Kitooos
  • 37
  • 7
0
votes
1 answer

Is H2O DAI's MLI display menu dependent on the algorithms used in its experiments?

I see H2O DAI picks up the optimized algorithm for the dataset automatically. I hear that the contents of MLI (machine learning interpretation) from other platforms (like SAS Viya) is dependent on the algorithm it uses. For example, LOCO is not…
nasica88
  • 1,185
  • 10
  • 10
0
votes
1 answer

Can we see the list of algorithms DriverlessAI uses?

I am a newby in H2O DAI, and I think it's wonderful. I've run several experiments with small sample CSV data, and most of the time I see GLM and GBM are used. Can we see the full list of all algorithms provided with H2O DAI ? I see the algorithms…
nasica88
  • 1,185
  • 10
  • 10
0
votes
2 answers

Visualize trees in H2O XGBoost model

I was looking at this answer to visualize the gradient boosting tree model in H2O, it says the method on GBM can be applied to XGBoost as well: Finding contribution by each feature into making particular prediction by h2o ensemble model…
Zhirui Wang
  • 174
  • 1
  • 8
0
votes
2 answers

Spyder Kernel Dying

Let me first start by saying that this post is related to many on this site and that I am simply posting this for posterity and helping to add evidence as to why this particular error occurs. So I have a script that I run that pulls in data to a…
0
votes
2 answers

What are the column definitions for H2O's gains/lift table?

H2O's documentation doesn't provide clear definitions for each column in the gains/lift table output. I'm not sure how the capture rate is being calculated, and there is a score column that is not mentioned in the documentation. Here's what the…
coys
  • 13
  • 3
0
votes
0 answers

H2O server init crashing on a different environment

I have 2 python virtual environments. One is a POC env and has minimal code. Second one is with the application code base. The POC works fine. While I try to run with the application code and environment, the h2o.init() command fails. Following is…
Roopak A Nelliat
  • 2,009
  • 3
  • 19
  • 26
0
votes
1 answer

Cannot import .csv file as H2O dataframe

I am trying to import my 3000 observation & 77 features .csv file as H2O dataframe (while I am on a Spark session): (1st way) # Convert pandas dataframe to H2O dataframe import h2o h2o.init() data_train =…
Outcast
  • 4,967
  • 5
  • 44
  • 99
0
votes
0 answers

H2O GBM MaxDepth Finding for classification(categorical)

I am trying to find the best max_depth value using the following code library(h2o) h2o.init() # import the titanic dataset df <- h2o.importFile(path =…
Kitooos
  • 37
  • 7
0
votes
0 answers

H2O.ai implementation on EMR cluster

I am having trouble in deploying h2o.ai in a cluster in EMR. I am trying to installing with flatfile but it seems to be probing some issues in communicating with each other Require help in this.
1 2 3
99
100