Questions tagged [h2o]

Use this tag for questions about the H2O in-memory machine learning platform. Where relevant, add language tags like [r], [python], [scala], or [java].

Best Practices

Always post a Minimal, Complete and Verifiable Example (MCVE) and provide the H2O version number and client type (Python, R, Flow, etc).

If your question is not code related, do not post to Stack Overflow (per Stack Overflow guidelines). If your question is algorithm related, post to Cross-Validated on Stack Exchange using the "h2o" tag. All other questions can be posted to the h2ostream Google group (please do not double-post).

Resources

1875 questions
0
votes
1 answer

Use of h2o cluster for Neural Network Model

I'm trying to fit an ANN model to a dataset having 7 predictor variables and the response variable is a binary. I have converted all the required factor variables to numeric (If I am correct, this is a requirement) and the following error pops…
0
votes
1 answer

How to determine which label is considered the 'positive' class in H2O binary classifier?

Training a binary classifier using h2o.ai and would like to know which label is being considered to be the 'positive' class. This makes a difference since if have labels say, 'give cookie' and 'don't give cookie', and are trying to optimize to…
lampShadesDrifter
  • 3,925
  • 8
  • 40
  • 102
0
votes
1 answer

Why I can't find ddply for h2o in python?

ddply is present on H2OFrame documentation. However I can't find it. I have the version 3.22.0.1 which I downloaded at here. The in the source code of this .whl I can't find ddply. However, in the documentation page, we see a link for the source…
Eduardo Reis
  • 1,691
  • 1
  • 22
  • 45
0
votes
1 answer

Memory Utilization R h2o

I'm working on the Titanic. However I've run into a problem where the model no longer runs but instead throws back an error that seems to have something to do with memory allocation. Error: water.exceptions.H2OModelBuilderIllegalArgumentException:…
NelsonGon
  • 13,015
  • 7
  • 27
  • 57
0
votes
1 answer

Rstudio server on Azure Hdinsight R Packages not installing

I have provisioned an Azure HDInsight cluster type ML Services (R Server), operating system Linux, version ML Services 9.3 on Spark 2.2 with Java 8 HDI 3.6. I am able to login to Rstudio on the head node via SSH access and I ran the script from…
Kreitz Gigs
  • 369
  • 1
  • 9
0
votes
1 answer

h2o categorical_encoding understanding when to use and why

I'm trying to understand the pros/cons and when to use the various encoding options that are available to me in h2o with the parameter 'categorical_encoding'. It would be helpful if people could point out general rules of thumb on how to use…
runningbirds
  • 6,235
  • 13
  • 55
  • 94
0
votes
1 answer

How to filter a Group using lambda function in python H2O like pandas?

I am looking for doing something similar to filter, but with H2O data frame. I haven't find how. H2O Group By in python only has specific functions for processing groups, such as max, min, etc.
Eduardo Reis
  • 1,691
  • 1
  • 22
  • 45
0
votes
1 answer

How to create a H2O cluster on google cloud with static external IP?

Creating a new H2O-3 Cluster deployment in google cloud gives only 2 options for reserving an IP address: Ephemeral and None. Is it possible to create a h2o cluster with a static IP address. Using the "addresses" command in gcloud like: gcloud…
bespectacled
  • 2,801
  • 4
  • 25
  • 35
0
votes
1 answer

Keep Constant Columns h2o

I'm trying to implement a gradient boosting machine model using R's h2o package. However, the model keeps dropping a certain column that I know from other model build ups that this column is important. Warning message: In…
NelsonGon
  • 13,015
  • 7
  • 27
  • 57
0
votes
1 answer

h2o pojo on test data with extra columns than the model trained on and sometimes missing columns from the train dataset

I have created my model POJO, I have to keep my columns in same order with same datatype when generating predictions using Hive UDF? what is the cleanest way to ignore extra columns and add the columns which are present in train data set but not in…
Shubham
  • 1
  • 1
0
votes
2 answers

Importing a model into R, that was created in a newer version of H2O

I have H2O version 3.22.0.1 where I have created several models using Flow. I now want to import them into R (v 3.5.1) for further analysis. However, the version of the package for R is h2o_3.20.0.8 and I receive this error when running…
Robert Long
  • 5,722
  • 5
  • 29
  • 50
0
votes
1 answer

Loading Data Set using R h2o

I'm looking at Darren Cook's book and was trying to load the data set as follows. library(h2o) library(tidyverse) h2o.init() mydata<-"https:/raw.githubusercontent.com/DarrenCook/h2o//bk/data…
NelsonGon
  • 13,015
  • 7
  • 27
  • 57
0
votes
1 answer

using H2O flow XGboost model

It gives a regression prediction as continuous score with negative values, like -1.27544 < x < 6.68112. How I interpret the negatives?
Lana KH
  • 13
  • 1
  • 4
0
votes
1 answer

rsparkling as_h2o_frame does not work: java.lang.OutOfMemoryError: GC overhead limit exceeded

I first import a dataset from csv to Spark, do some transformation in Spark, and then try to convert it into H2O Frame. Here's my code: library(rsparkling) library(h2o) library(dplyr) library(sparklyr) sc <- spark_connect(master = "local") data <-…
Catiger3331
  • 611
  • 1
  • 6
  • 18
0
votes
1 answer

what is the file format when I run h2o.save_model to save model?

In H2O when I run h2o.save_model, it is easy to save model. But the file do not have extension. So what is the format of the file? I have read the http://docs.h2o.ai/h2o/latest-stable/h2o-docs/productionizing.html#about-pojo-mojo but it seem not…
Gavin
  • 1,411
  • 5
  • 18
  • 31