Questions tagged [rapidminer]

RapidMiner is an environment for machine learning, data mining, text mining, predictive analytics, and business analytics. RapidMiner is written in Java and it was open source. There is an open source version and an enterprise version with additional features and an API to write own extensions.

RapidMiner is an environment for machine learning, data mining, text mining, predictive analytics, and business analytics. RapidMiner is written in Java and, from version 6.5, provides an open source version as well as an enterprise version with additional features.

511 questions
1
vote
0 answers

Comparing RapidMiner models with x-validation

I am working in some forecasting models with RapidMiner and need some orientation to interpret the outputs and select the best among them. I am following some tutorials to check their accuracy with x-validation, and I am getting results…
1
vote
0 answers

Rapidminer studio and java 8

I have this simple code, which works in RapidMiner 5 with Java 6, and also works with RapidMiner Studio with Java 7. The problem is with Java 8, it displays an error that the requested repository doesn't exist! public void LoadDataProcess(String…
Sam Faith
  • 175
  • 5
1
vote
1 answer

Twitter data topical classification

So I have a data set which consists of tweets from various news organizations. I've loaded it into RapidMiner, tokenized it, and produced some n-grams of it. Now I want to be able to have RapidMiner automatically classify my data into various…
TomP
  • 133
  • 1
  • 2
  • 12
1
vote
0 answers

Using RapidMiner Textprocessing plugin in Java - Not able to use 'Document' object in the code

I am using RapidMiner 5. I want to make a text preprocessing module to use with a categorization system. I created a process in RapidMiner with these steps. Tokenize Transform Case Stemming Filtering stopwords Generating n-grams I want to write a…
pnv
  • 1,437
  • 3
  • 23
  • 52
1
vote
1 answer

K-medioids with Dynamic Time Warping in RapidMiner

How to perform K-medioids clustering with Dynamic Time Warping as a distance measure in RapidMiner? The idea with Dynamic Time Warping is to perform it on time series of different length. How can I do that in RapidMiner? I get this error message…
Kobe-Wan Kenobi
  • 3,694
  • 2
  • 40
  • 67
1
vote
2 answers

automatically determine number of clusters k-means

I want to build a cluster model in rapid miner that can define the number of clusters automatically and then continue to the k-means algorithm. Is there any way for determine k of clustering automatically in rapid miner?
Nervin
  • 41
  • 1
  • 6
1
vote
1 answer

Rapidminer - Out of memory when working on large datasets

In Rapidminer v.5.3013 I want to achieve the following: Read 15 million records from a database table - only one attribute but with up to 4096 characters Regex replacements on that data set Classification according to Naive Bayes Write the result…
royskatt
  • 1,190
  • 2
  • 15
  • 35
1
vote
1 answer

Add New attribute with default value in Rapidminer

I am very new to this tool "Rapid miner".What i want to know is how to add a new attribute to one data set with default value in rapid-miner. I tried with using "Generate attributes" but how to set a default value to the new attribute. Do i have to…
solidshadow
  • 63
  • 1
  • 6
1
vote
1 answer

RapidMiner: How can I efficiently generate n-grams?

For my thesis, I'm trying to perform sentiment analysis on larger (up to 2GB) files of text documents (product reviews) using RapidMiner Community Edition with the Text Mining extension. I'd like to generate bigrams for that. But for a small 70MB…
punischdude
  • 219
  • 2
  • 11
1
vote
1 answer

Association Rules in Rapid Miner

I wanted to ask this question in RapidMiner Community but after 2 days, still no answer. It might be an easy question for you. I want to find a meaningful relation between a couple of column values. the table is like this: SiteID , Number Of 2MB…
Rman Edv
  • 163
  • 1
  • 17
1
vote
1 answer

R extension on RapidMiner

I have installed rJava in R and the next step is to configure RapidMiner such that it can incorporate the R extension...The variables are set as: R_HOME=/home/.../R-2.12.2 JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64/jre I have exported both of these…
Bokha
  • 121
  • 1
  • 6
1
vote
2 answers

Integration of Rapidminer with Java: Obtaining the output Example Set (Process Result)

I want to execute a Rapidminer process from Java to use the output ExampleSet (Process Result) for subsequent operations (with Java). I managed the process execution with the code below, but I don't have a clue how to obtain the process Result…
Gerardo A.
  • 13
  • 5
1
vote
0 answers

Naive Bayes Results not Generating in RapidMiner

I'm running a Naive Bayes process in RapidMiner on Fisher's Iris dataset. My main process is as follows: Retrieve Iris, Set Role, Validation The Validation subprocess is as follows: Training Set: Naive Bayes; Test Set: Apply Model,…
1
vote
1 answer

join all text from nodes xpath

Hello I have some html file:

text in p2

text in p4

and other are like:

text in p1

text in p3

My query is:…
Jimmysnn
  • 583
  • 4
  • 8
  • 30
1
vote
1 answer

RapidMiner FP-growth operator not returning any results

i'm running into a problem with the fp-growth operator in rapidminer. i'm processing about 20 text files that are all in all <1MB in size. i used the process documents operator and within that tokenize, filter stop words, transform cases, generate…