Questions tagged [fselector]

FSelector is an R package that finds the most informative attributes in a data set with respect to a classification target.

FSelector is an R package that provides functions for selecting attributes from a given dataset. Attribute subset selection is the process of identifying and removing as much of the irrelevant and redundant information as possible. More information on the package's CRAN page.

18 questions
5
votes
1 answer

Feature selection in document-feature matrix by using chi-squared test

I am doing texting mining using natural language processing. I used quanteda package to generate a document-feature matrix (dfm). Now I want to do feature selection using a chi-square test. I know there were already a lot of people asked this…
3
votes
0 answers

Error: protect(): protection stack overflow while feature extraction

I have a dataframe that has 4755 rows and 27199 columns. It's actually a document term matrix and I'm trying to perform feature selection using the "FSelector" package. Here is some of the code below: library(FSelector) weights <-…
anonymous
  • 405
  • 8
  • 22
2
votes
1 answer

A duplicate bin range was detected. Try increasing the bin range precision

A little help wit this error ? I get it when i am running CFS in R Runnin code: best_features<- cfs(Target~.,df) where df is the dataset and Target and best_features are self explanatory. Error: Error in .jcall("weka/filters/Filter",…
2
votes
0 answers

Chi-squared feature selection using Fselector in R

I am a beginner in R and I have a data frame that has binary values in it. In my data frame, the first 6000 columns are the attributes I am going to select features from, and the last 10 columns (again binary) are the classes I need to train my data…
Diana01
  • 183
  • 1
  • 1
  • 10
2
votes
1 answer

Invalid prediction for "rpart" object Error

I am using the exact code for best first search from page 4 of this CRAN document (https://cran.r-project.org/web/packages/FSelector/FSelector.pdf), which uses the iris dataset. It works just fine on the iris dataset, but does not work on my ow…
Ashley A Holmes
  • 69
  • 3
  • 5
  • 10
2
votes
1 answer

Use of formula in information.gain in R

In the function definition for the FSelector information.gain function, information.gain(formula, data) what exactly is the purpose of the formula? I'm trying to use the function to do feature selection for a classification task. In the few examples…
1
vote
1 answer

Install FSelector in R Notebook Databricks

I'm trying to install the package FSelector in my R notebook in databricks and it's not working. I've already tried following this How should I deal with "package 'xxx' is not available (for R version x.y.z)" warning? blogpost but nothing…
user228812
  • 17
  • 4
1
vote
0 answers

R library(FSelector) failed to run due to java error

I have been trying to get FSelector from MLR package working recently, but have been running into the same java issue on my mac: Error: package or namespace load failed for ‘FSelector’: .onLoad failed in loadNamespace() for 'rJava', details: …
lydias
  • 841
  • 1
  • 14
  • 32
1
vote
1 answer

Error after deleting NA values twice, first by using pandas library second by R

First I deleted NA values using the following Python code: import pandas as pd a = pd.read_csv("true.csv",low_memory=False) #print a b = pd.read_csv("false.csv",low_memory=False) merged = a.append(b,…
1
vote
1 answer

How to do feature selection on SparseMatrix matrix in R

I have text classification problem with over 20k features, 3m objects, and over 3k classes. Data is very sparse. I wrote the program on R. Data matrix in sparseMatrix object. How can I select features on this data? I found package FSelector, but it…
Osm
  • 81
  • 4
1
vote
1 answer

Unable to use FSelector package in R on mac OS X El Capitan

I'm using OS X El Capitan Version 10.11.3 now. When I tried to use library(FSelector) in R, errors pop out saying: JavaVM: requested Java version ((null)) not available. Using Java at "" instead. JavaVM: Failed to load JVM:…
Yan
  • 499
  • 6
  • 20
0
votes
0 answers

Error about loading the ‘FSelector’ package

what does the below error mean? Error: package or namespace load failed for ‘FSelector’: .onLoad failed in loadNamespace() for 'RWeka', details: call: new_Weka_package_manager() error: java.lang.UnsupportedClassVersionError:…
somayeh A
  • 1
  • 1
0
votes
1 answer

Install R package - FSELCETOR on SUSElinux server

I have an account to Suse linux server, without root access. I have installed an R , and now I want to use FSELECTOR package. I have started with this steps: $ whereis java java: /usr/bin/java /usr/lib64/java /etc/java /usr/share/java…
0
votes
0 answers

Unable to run information.gain

I am trying to run information.gain on 15 million data .The code I have used: library(FSelector) weights <- information.gain(is_opened~.,df) but I am getting this error. Error in .jnew("java/lang/String", k) : java.lang.OutOfMemoryError: GC…
Prajna
  • 129
  • 1
  • 8
0
votes
1 answer

How to select only the best features by setting up the threshold using FSelector information gain in R language?

I have done information gain feature selection in R by using FSelector package in R install.packages("RWekajars") install.packages("FSelector") library(FSelector) weights <- information.gain(Classname~., df) Attributes …
Lin
  • 1
  • 1
1
2