Questions tagged [svmlight]

SVMLight is an implementation of a Support Vector Machine - supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis.

The main features of the program are the the following:

  • fast optimization algorithm
    • working set selection based on steepest feasible descent
    • "shrinking" heuristic
    • caching of kernel evaluations
    • use of folding in the linear case
  • solves classification and regression problems. For multivariate and structured outputs use SVMstruct.
  • solves ranking problems (e. g. learning retrieval functions in STRIVER search engine).
  • computes XiAlpha-estimates of the error rate, the precision, and the recall
  • efficiently computes Leave-One-Out estimates of the error rate, the precision, and the recall
  • includes algorithm for approximately training large transductive SVMs (TSVMs) (see also Spectral Graph Transducer)
  • can train SVMs with cost models and example dependent costs
  • allows restarts from specified vector of dual variables
  • handles many thousands of support vectors
  • handles several hundred-thousands of training examples
  • supports standard kernel functions and lets you define your own
  • uses sparse vector representation
98 questions
1
vote
1 answer

Values greater than 1 in svm prediction file

I am using svm light to train a model for binary classification. Using the model, I tested some examples. I was surprised to see the output of the prediction file, it contains values greater than 1 as well as less than -1. I thought the range is…
Rakib
  • 7,435
  • 7
  • 29
  • 45
1
vote
2 answers

Python sklearn.datasets.dump_svmlight_file failed to output the right index of column

I want to execute SVM light and SVM rank, so I need to process my data into the format of SVM light. But I had a big problem.... My Python codes are below: import pandas as pd import numpy as np from sklearn.datasets import…
陳冠穎
  • 313
  • 1
  • 5
  • 10
1
vote
1 answer

Using machine learning to estimate likelihood of an even occurrence given a stream of data

I have a stream of data (e.g. 3D position) generating by a system which it looks like: (pos1, time1) (Pos2, time2) (pos3, time3) ... I want to use a machine learning technique to estimate the likelihood (or detect) of a particular event from given…
user1021110
  • 213
  • 1
  • 6
  • 13
1
vote
0 answers

OpenCV+PYTHON: HOGDescriptor readALTModel(filename)

does anybody know if there is a way to use the function readALTModel(filename) in the (cpp) HOGDescriptor class with python/opencv? Thanks!
1
vote
0 answers

How to extract support vectors from SVMLight model

I used SVMLight library to classify my dataset into some classes and now want to get support vectors from generated model. How can I extract SVs of each class from model?
Mahmood Kohansal
  • 1,021
  • 2
  • 16
  • 42
1
vote
1 answer

Python Svmlight module: How to pass training data to the learn function, when using the rank configuration

I am using the Svmlight package in python to train a SVM rank model. However, I cannot figure out a way to pass the training data to the learn function. My python source code is as follows: import svmlight trainingDat = open('train.dat','r') model…
Sarin
  • 197
  • 3
  • 13
1
vote
1 answer

SVM-rank query-level features

I wonder how query-level features (such as term count in query) are useful? Because, query-level features are ignored while generating model file. Train file; 3 qid:1 1:2 2:1 3:0 4:0.2 5:0 2 qid:1 1:2 2:0 3:1 4:0.1 5:1 1 qid:1 1:2 2:1 3:0 4:0.4…
user1766006
  • 177
  • 1
  • 14
1
vote
0 answers

Modeling features of Relation Extraction in the SVMlight input format

I am currently working on a project that focuses on relation extraction from a corpus of Wikipedia text, and I plan to use an SVM to extract these relations. To model this, I plan to use Word features, POS Tag features, Entity features, Mention…
1
vote
0 answers

SVM: why only count the first two column features

I found a SVM example online. I do not understand why only count the first two columns of features. The data set is famous "spiral_Nc10_train.mat" and "spiral_Nc10_train.mat". "spiral_Nc10_train.mat" contains 1) data = 1000*3 double; 2) label =…
Angelababy
  • 247
  • 3
  • 7
  • 16
1
vote
1 answer

How to provide cost for balancing training by imbalanced train dataset as available in svmlight?

Cost in e1071's SVM doesn't seems same as svmlight's Cost. The manual of e1071 library states the following definition for its cost parameter: cost of constraints violation (default: 1)—it is the ‘C’-constant of the regular- ization term in the…
Bit Manipulator
  • 248
  • 1
  • 2
  • 17
1
vote
1 answer

hierarchical classification with SVM

I am trying to deal with a problem of classification with SVM, at the beginning I managed to solve the problem at the first level, ie classify my data into 2 classes (class1 and class2). now I want to continue the classification hierarchically ie f…
1
vote
1 answer

How to prepare input file for svmstruct

I want to use svmstruct for my Named entity Recognition task. Some of my features for each token are not in numerical format ( mostly in textual format such a n-char affixes or word shape ,...) . Since svmstruct's input format is same as svmlight…
1
vote
0 answers

SVM-light Out of memory error

I'm using SVM-light for my researches and it's doing fine (at least it's still processing) svm_learn.exe -t 3 -m 4000 learn_data model Learn_data here is about 14.000.000 lines of of data with 20.000 of features. But as for -z p flag for ranking…
aromatvanili
  • 175
  • 4
  • 12
1
vote
1 answer

SVMlight train data formatting

I am trying to classify the reuters text using svm light but my train data does not follow the format <'line> .=. <'target> <'feature>:<'value> <'feature>:<'value> ... <'feature>:<'value> # <'info> it is of the form <'line> .=.…
1
vote
1 answer

unclassified data points in libsvm?

SVM tutorials state that if a data point falls in the area surrounding the separating line (in the margin) - it isn't classified. How is this implemented in libraries like SVMlight and libsvm?
Cheshie
  • 2,777
  • 6
  • 32
  • 51