Questions tagged [weka]

Weka (Waikato Environment for Knowledge Analysis) is an open source machine learning library written in Java.

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.

Weka is open source software issued under the GNU General Public License.

Weka's main user interface is the Explorer, but essentially the same functionality can be accessed through the component-based Knowledge Flow interface and from the command line. There is also the Experimenter, which allows the systematic comparison of the predictive performance of Weka's machine learning algorithms on a collection of datasets.

The Explorer interface features several panels providing access to the main components of the workbench:

  • The Preprocess panel has facilities for importing data from a database, a CSV file, etc., and for preprocessing this data using a so-called filtering algorithm. These filters can be used to transform the data (e.g., turning numeric attributes into discrete ones) and make it possible to delete instances and attributes according to specific criteria.
  • The Classify panel enables the user to apply classification and regression algorithms (indiscriminately called classifiers in Weka) to the resulting dataset, to estimate the accuracy of the resulting predictive model, and to visualize erroneous predictions, ROC curves, etc., or the model itself (if the model is amenable to visualization like, e.g., a decision tree).
  • The Associate panel provides access to association rule learners that attempt to identify all important interrelationships between attributes in the data.
  • The Cluster panel gives access to the clustering techniques in Weka, e.g., the simple k-means algorithm. There is also an implementation of the expectation maximization algorithm for learning a mixture of normal distributions.
  • The Select attributes panel provides algorithms for identifying the most predictive attributes in a dataset.
  • The Visualize panel shows a scatter plot matrix, where individual scatter plots can be selected and enlarged, and analyzed further using various selection operators.

Online Resources:

Use Weka in your Java Code

Weka on Sourceforge

Weka on GitHub

3033 questions
0
votes
0 answers

Problems when uploading a csv file to weka

My original file is in the .xlsx extension and through Excel I change it to the .csv extension (comma delimited file) I do the process of changing the extension and I have no problems, when I try to upload it to the weka software I follow the…
0
votes
0 answers

What is the used method that calculate "F-measure" ? Micro or Macro?

In Weka experimenter, what is the used method that calculate "F-measure" ? Micro or Macro? How to calculate from the results of precision and recall in file destination (csv)?
0
votes
0 answers

WEKA pattern investigation with Apriori , I don't get results

I want to extract knowledge from the competition data KDD Cup 1999. This dataset includes records about connections to the Internet, where each connection refers to an attack or a normal usage. The dataset contains 41 attributes and 4.898.431…
Thanos
  • 21
  • 7
0
votes
0 answers

Testing output of Weka decision tree does not agree with the trained model's logic

I trained a decision tree model using the REPTree classifier in Weka. The model it produces is fairly simple: Has Pool? = FALSE | Spike Count < 4.5 | | Early/Pump Demand < 0.94 : 0 (265/3) [128/1] | | Early/Pump Demand >= 0.94 | | | …
0
votes
0 answers

Parameters that denote the dynamic range, symmetry and slope of the sigmoid function respectively in software WEKA

I need the three parameters that denote the dynamic range, symmetry and slope of the sigmoid function respectively in software WEKA. I've alread search on books, papers and manuals without succesfull. I want to implement a neural network and i need…
0
votes
0 answers

How to fix weka error java.lang.reflect.InvocationTargetException

When I tried to launch weka 3.8.6. It didn't launch but pops up Weka Error. When I tried to run weka 3.8.6 the Error is: java.lang.reflect.InvocationTargetException
Ismaeel
  • 1
  • 1
0
votes
0 answers

Pcap4j return null packets header depending on runtime environment

I'm actually developing a contextual data generator for Weka and I package it as a Weka plugins (using the templates here: https://github.com/bnjmn/weka/tree/master/packages/templates). A part of my plugins is to reading data from a pcap file so I'm…
PierreBls
  • 19
  • 5
0
votes
1 answer

How do I upload my arff file to WEKA without seeing 'premature end of line, read Token[EOL] line 22'

This error message has been appearing when I try to upload a dataset I got from twitter using snscrape: 'premature end of line, read Token[EOL] line 22' How do I resolve it? I've tried to convert the file, it keeps showing the same error. I…
eGoodluck
  • 1
  • 1
0
votes
0 answers

How to generalize the SMOTE algorithm using Weka?

I am working on an imbalanced dataset, and I use SMOTE to balance the data. I build my model using Weka. I want to make (SMOTE) as a part of my model. So, when I apply my model to another imbalanced dataset, the model can increase the number of…
Muneera
  • 11
  • 2
0
votes
1 answer

NullPointerException when importing a package in Weka

I am developing a plugin for Weka in Java, and I'm using the weka/packages/templates to compile and create the package with ant. It look like Weka using the weka.core.packageManagement.DefaultPackage class while importing the packages from the GUI…
0
votes
0 answers

upload data set in weka (Problem setting)

I am trying to upload my data set on weka , but I get error which is problem setting base instances. does anyone know how to solve this problem?
0
votes
1 answer

Getting "Field 'dwell time' is not a java.lang.Number" error when retrieving data from Firebase Firestore

I'm working on an Android project that uses Firebase Firestore to store data. I'm trying to retrieve a document from the Firestore database and get the value of a field called "dwell time" which is a number, but I keep getting the following…
0
votes
1 answer

Is Weka's JRIP updateable?

I have a large dataset and I am using Weka with the Python wrapper to train the RIPPER algorithm. Can I use Weka's updateable interface for incrementally training the JRIP algorithm (RIPPER)? How can I train the large dataset with WEKA's python…
Kathiravan Natarajan
  • 3,158
  • 6
  • 22
  • 45
0
votes
1 answer

Missing labels in tokenized data set

I am implementing sentiment analysis, and when i tokenized my dataset.arff to use it as a training model, I have noticed that that is missing the label of "Positive" the dataset.arff file Head: relation SentimentAnalysis @attribute text…
0
votes
1 answer

What is the stable version of Weka for Windows 7 32-bit?

I run a laptop with Windows 7 32-bit, Java 8 update 333, and Java(TM) 6 Update 7. I used several versions of Weka, but everyone has a problem: version 3.9.3 and version 3.8.3: Logistic Regression classifier does not work + I face this error message…
Muneera
  • 11
  • 2
1 2 3
99
100