Questions tagged [h2o]

Use this tag for questions about the H2O in-memory machine learning platform. Where relevant, add language tags like [r], [python], [scala], or [java].

Best Practices

Always post a Minimal, Complete and Verifiable Example (MCVE) and provide the H2O version number and client type (Python, R, Flow, etc).

If your question is not code related, do not post to Stack Overflow (per Stack Overflow guidelines). If your question is algorithm related, post to Cross-Validated on Stack Exchange using the "h2o" tag. All other questions can be posted to the h2ostream Google group (please do not double-post).

Resources

1875 questions
5
votes
1 answer

Implementing custom stopping metrics to optimize during training in H2O model directly from R

I'm trying to implement the FBeta_Score() of the MLmetrics R package: FBeta_Score <- function(y_true, y_pred, positive = NULL, beta = 1) { Confusion_DF <- ConfusionDF(y_pred, y_true) if (is.null(positive) == TRUE) positive <-…
FR_
  • 147
  • 9
5
votes
2 answers

Implementing a decision tree using h2o

I am trying to train a decision tree model using h2o. I am aware that no specific library for decision trees exist in h2o. But, h2o has an implemtation of random forest H2ORandomForestEstimator . Can we implement a decision tree in h2o by tuning…
5
votes
2 answers

Python/H2o : Plot tree/Extract rules from H2ORandomForestEstimator model

Is there any "simple" way to plot trees from an H2O random forest model. I am also interestred in extracting the resulting rules ?
Ala Ham
  • 169
  • 1
  • 7
5
votes
2 answers

Merging Tree Models from two random forest models into one random forest model at H2O in R

I am relatively new to the machine learning ocean, please excuse me if some of my questions are really basic. Current situation: The overall goal was trying to improve some code for h2o package in r running on the supercomputer cluster. However,…
windsound
  • 706
  • 4
  • 9
  • 31
5
votes
2 answers

How to handle a skewed response in H2O algorithms

In my problem dataset response variable is extremely skewed to the left. I have tried to fit the model with h2o.randomForest() and h2o.gbm() as below. I can give tune min_split_improvement and min_rows to avoid overfitting in these two cases. But…
deepAgrawal
  • 673
  • 1
  • 7
  • 25
5
votes
1 answer

H2O importFile skiping rows

How can I skip rows while importing files?? Skip is not an available option on the h2o.importFile comand. I have used the library h2o and the command: h2o.importFile()
Jesus
  • 462
  • 6
  • 13
5
votes
4 answers

How to directly plot ROC of h2o model object in R

My apologies if I'm missing something obvious. I've been thoroughly enjoying working with h2o in the last few days using R interface. I would like to evaluate my model, say a random forest, by plotting an ROC. The documentation seems to suggest that…
Ophiothrix
  • 791
  • 1
  • 8
  • 11
5
votes
1 answer

H2O Python - how to get variable types, getTypes equivalent

What is the Python equivalent of getTypes in R? I'm trying to extract the variable types for each column from H2O data frame (enum, string, int etc.) Also, broadly can someone send me a link to some documentation listing all the properties and…
thasainta
  • 93
  • 1
  • 5
5
votes
3 answers

Error while using h2o.init in R

This is the error message: > h2o.init() Error in dirname(path) : path too long In addition: There were 12 warnings (use warnings() to see them) This is one of the warning messages (the others are similar): > warnings() Warning messages: …
John McClain
  • 51
  • 1
  • 2
5
votes
3 answers

I have downloaded a pojo from h2o, compiled it, but how do I use it?

I'm using the following sample code to download a pojo that I found from this post: import h2o h2o.init() iris_df = h2o.import_file("https://s3.amazonaws.com/h2o-public-test-data/smalldata/iris/iris.csv") from h2o.estimators.glm import…
Toenuff
  • 71
  • 1
  • 5
5
votes
1 answer

Is there a way to use saved model between different versions of H2O?

I have saved a trained model (deep net, but it is more general I think) in H2O. Now I want to load it by another instance of H2O and use it for scoring, but the problem is, that the version of H2O used for training (3.10.0.3) was different than the…
JaKu
  • 1,096
  • 16
  • 29
5
votes
4 answers

h2o implementation in R

I am learning h2o package now, I installed h2o package from CRAN and couln't run this code ## To import small iris data file from H\ :sub:`2`\ O's package irisPath = system.file("extdata", "iris.csv", package="h2o") iris.hex =…
varun
  • 71
  • 1
  • 4
5
votes
1 answer

Multi node cluster installation with h2o on AWS EC2

I was wondering about how to set up a h2o cluster using multiple AWS EC2 instances and R-Studio. I am not a computer scientist, so sorry for the trivial questions (!) Based on this tutorial (http://amunategui.github.io/h2o-on-aws/) I sucessfully…
constiii
  • 638
  • 3
  • 19
5
votes
3 answers

how to fit and score a machine learning models in Java/JVM based application

Could you please guide me on how to create and execute a machine learning models/statistical models (regression, Decision tree, K means clustering, Naive bayes, scorecard/linear/logistic regression etc. and GBM, GLM ) in Java/JVM based application…
Gaurav Gupta
  • 104
  • 2
  • 7
5
votes
2 answers

How do know how many deep learning epochs were done, from R?

Early stopping is turned on by default for h2o.deeplearning(). But, from R, how do I find out if it did stop early, and how many epochs it did? I've tried this: model = h2o.deeplearning(...) print(model) which tells me information on the layers,…
Darren Cook
  • 27,837
  • 13
  • 117
  • 217