Questions tagged [h2o]

Use this tag for questions about the H2O in-memory machine learning platform. Where relevant, add language tags like [r], [python], [scala], or [java].

Best Practices

Always post a Minimal, Complete and Verifiable Example (MCVE) and provide the H2O version number and client type (Python, R, Flow, etc).

If your question is not code related, do not post to Stack Overflow (per Stack Overflow guidelines). If your question is algorithm related, post to Cross-Validated on Stack Exchange using the "h2o" tag. All other questions can be posted to the h2ostream Google group (please do not double-post).

Resources

1875 questions
10
votes
1 answer

Start multiple h2o cluster from within R

My intention is to start two or more h2o clusters / instances (not two or more nodes!) from within R on the same computer/server to enable multiple user to connect with h2o at the same time. In addition, I want to be able to shutdown and restart…
constiii
  • 638
  • 3
  • 19
10
votes
5 answers

how to save/load a trained model in H2o?

The user tutorial says Navigate to Data > View All Choose to filter by the model key Hit Save Model Input for path: /data/h2o-training/... Hit Submit The problem is that I do not have this menu (H2o, 3.0.0.26, web interface)
Alex Lizz
  • 425
  • 1
  • 8
  • 19
9
votes
1 answer

What is the measure used for "importance" in the h2o random Forest

Here is my code: set.seed(1) #Boruta on the HouseVotes84 data from mlbench library(mlbench) #has HouseVotes84 data library(h2o) #has rf #spin up h2o myh20 <- h2o.init(nthreads = -1) #read in data, throw some away data(HouseVotes84) hvo <-…
EngrStudent
  • 1,924
  • 31
  • 46
9
votes
5 answers

Unable to convert data frame to h2o object

I am running the h2o package in Rstudio Version 0.99.447. I run version 10.9.5 OSX. I would like to set up a local cluster within R, following the steps of this tutorial:…
Boudewijn Aasman
  • 1,236
  • 1
  • 13
  • 20
8
votes
3 answers

Calculating MAPE in H2o: Error: Provided column type POSIXct is unknown

Following my answered question: R or Python - loop the test data - Prediction validation next 24 hours (96 values each day) I want to predict the next day using H2o Package. You can find detail explanation for my dataset in the same above link. The…
King Julien
  • 159
  • 17
8
votes
1 answer

How to suppress "Build Progress" bar when training an h2o model?

I'm tuning my parameters by testing many models, and I'm fairly annoyed that I can't do much about the "Build Progress" bars that are cluttering up my iPython Notebook. I've skimmed the docs looking for some sort of "verbose" setting to turn off,…
James Kelleher
  • 1,957
  • 3
  • 18
  • 34
8
votes
3 answers

R h2o load a saved model from disk in MOJO or POJO format

I'm catching up on h2o's MOJO and POJO model format. I'm able to save a model in MOJO/POJO with h2o.download_mojo(model, path = "/media/somewhere/tmp") # ok h2o.download_pojo(model, path = "/media/somewhere/tmp") # ok which writes an object with…
horaceT
  • 621
  • 13
  • 26
8
votes
3 answers

How to allow h2o to access all available memory?

I am running h2o through Rstudio Server on a linux server with 64 GB of RAM. When I initialize the cluster it says that the total cluster memory is only 9.78 GB. I have tried using the max_mem_size parameter but still only using 9.78 GB. localH2O…
mikew
  • 347
  • 6
  • 12
8
votes
1 answer

Difference between spark with h2o and sparkling water

I have a few questions or doubts on sparkling water and why is it needed. Lets assume that I have a generated h2o model with both binary and pojo. Now I want to deploy the model into production and have an option for using pojo and binary…
Lalit Agarwal
  • 2,354
  • 1
  • 14
  • 18
8
votes
2 answers

How to convert a column in H2OFrame to a python list?

I've read the PythonBooklet.pdf by H2O.ai and the python API documentation, but still can't find a clean way to do this. I know I can do either of the following: Convert H2OFrame to Spark DataFrame and do a flatMap + collect or collect + list…
BlueFeet
  • 2,407
  • 4
  • 21
  • 24
8
votes
1 answer

Transforming h2o model into non-h2o one

I know that there is possibility to export/import h2o model, that was previously trained. My question is - is there a way to transform h2o model to a non-h2o one (that just works in plain R)? I mean that I don't want to launch the h2o environment…
Andrzej Pisarek
  • 271
  • 3
  • 9
8
votes
3 answers

H2O R api: retrieving optimal model from grid search

I'm using the h2o package (v 3.6.0) in R, and I've built a grid search model. Now, I'm trying to access the model which minimizes MSE on the validation set. In python's sklearn, this is easily achievable when using RandomizedSearchCV: ## Pseudo…
TayTay
  • 6,882
  • 4
  • 44
  • 65
8
votes
1 answer

Loading data bigger than the memory size in h2o

I am experimenting with loading data bigger than the memory size in h2o. H2o blog mentions: A note on Bigger Data and GC: We do a user-mode swap-to-disk when the Java heap gets too full, i.e., you’re using more Big Data than physical DRAM. We won’t…
talegari
  • 343
  • 2
  • 9
8
votes
1 answer

Subsetting in H2O R

I have a h2o object. The standard R for subset sub1<-trans[trans$Type==1,] I tried the same in h2o. It is not working sub1<-trans[trans$Type==1,] I also tried sub1<-h2o.exec(trans[trans$Type==1,]) note* trans is a h2o data Object. Any idea to do…
chee.work.stuff
  • 326
  • 2
  • 14
7
votes
1 answer

Using Hyper-parameters from H2O to re-build XGBoost in Sklearn gives Difference Performance in Python

After using H2O Python Module AutoML, it is found that XGBoost is on the top of the Leaderboard. Then what I was trying to do is to extract the hyper-parameters from the H2O XGBoost and replicate it in the XGBoost Sklearn API. However, the…
B. Sun
  • 143
  • 3
  • 11