Questions tagged [scikits]

Scientific Python toolkits

Scikits is the collective name of a number of Python libraries for various scientific tasks, such as statistics, machine learning and image and audio processing. These scikits build on Numpy and Scipy.

An overview of available scikits is given at http://scikits.appspot.com/scikits

218 questions
5
votes
3 answers

SVM implmentation, scikits learn reducing runtime, fastest svm

I'm working with scikit-learn on building some predictive models with SVMs. I have a dataset with around 5000 examples and about 700 features. I'm 5 fold cross validating with a 18x17 grid search on my training set then using the optimal parameters…
tomas
  • 665
  • 1
  • 10
  • 14
5
votes
1 answer

Sparse implementations of distance computations in python / scikit-learn

I have a large (100K by 30K) and (very) sparse dataset in svmlight format which I load as follows: import numpy as np from scipy.cluster.vq import kmeans2 from scipy.spatial.distance import pdist, squareform from sklearn.datasets import…
Nicholas
  • 313
  • 2
  • 9
5
votes
1 answer

F1 smaller than both precision and recall in Scikit-learn

I am doing multi-class classification, with unbalanced categories. I noticed f1 is always smaller than the direct harmonic mean of precision and recall, and in some cases, f1 is even smaller than both precision and recall. FYI, I called…
Flake
  • 4,377
  • 6
  • 30
  • 29
5
votes
1 answer

Scikits-learn: Use custom vocabulary together with Pipeline

In my scikits-learn Pipeline, I would like to pass a custom vocabulary to CountVectorizer(): text_classifier = Pipeline([ ('count', CountVectorizer(vocabulary=myvocab)), ('tfidf', TfidfTransformer()), ('clf',…
Mathias Loesch
  • 373
  • 1
  • 5
  • 15
5
votes
2 answers

No module named 'info' on fresh Python 3 installation

I did a fresh python3 installation on OSX via homebrew: brew install python3 Then I created a virtual environment for my project and installed scipy and scikits.samplerate: virtualenv -p /usr/local/bin/python3 pythen_env pip install scipy pip…
Emiswelt
  • 3,909
  • 1
  • 38
  • 56
5
votes
4 answers

Error Installing Scikits.audiolab

I'm trying to install scikits.audiolab with pip. I read this post, however, it looks like that user's problem was that they hadn't installed numpy. I can replicate their trace by uninstalling numpy and running pip install scikits.audiolab. The…
Ivan Kelber
  • 336
  • 4
  • 13
5
votes
1 answer

How to get `skbio` PCoA (Principal Coordinate Analysis) results?

I'm looking at the attributes of skbio's PCoA method (listed below). I am new to this API and I want to be able to get the eigenvectors and the original points projected onto the new axis similar to .fit_transform in sklearn.decomposition.PCA so I…
O.rka
  • 29,847
  • 68
  • 194
  • 309
5
votes
2 answers

Neural Network composed of multiple activation functions

I am using the sknn package to build a neural network. In order to optimize the parameters of the neural net for the dataset I am using I am using an evolutionary algorithm. Since the package allows me to build a neural net where each layer has a…
benj rei
  • 329
  • 2
  • 5
  • 12
5
votes
1 answer

Choosing and implementing clustering method: DBSCAN something else?

I have a need to cluster a data set of lat,long coordinates. I am using python as my language and plan on using DBSCAN as I don't want to have to specify the # of clusters. The goal and purpose is to be able to input a large data set of lat,long…
5
votes
2 answers

How to obtain GridSearchCV partly finished results?

I have started a grid search for SVM parameters in a rather wide range. The most of the search space have been calculated and now I got one last process, which goes already for 100 hours. I'd like to see the results, that already have been…
Kamerer
  • 77
  • 1
  • 1
  • 7
5
votes
1 answer

Adding new words to text vectorizer in scikit-learn

Scikit-learn CountVectorizer for bag-of-words approach currently gives two sub-options: (a) use a custom vocabulary (b) if custom vocabulary is unavailable, then it makes a vocabulary based on all the words present in the corpus. My question: Can…
5
votes
1 answer

Scikit-learn: role of weights in Ridge Regression

I am using the library scikit-learn to perform Ridge Regression with weights on individual samples. This can be done by: esimator.fit(X, y, sample_weight=some_array). Intuitively, I expect that larger weights mean larger relevance for the…
Marco
  • 3,053
  • 5
  • 27
  • 29
5
votes
2 answers

crab: why is crab's UserBasedRecommender so slow?

the benchmark for crab is (http://www.slideshare.net/marcelcaraciolo/crab-a-python-framework-for-building-recommender-systems page-37) Benchmarks Pure Python w/ Python w/ Scipy Dataset dicts and NumpyMovieLens 100k 15.32 s 9.56 s…
user1687717
  • 3,375
  • 7
  • 26
  • 29
4
votes
2 answers

sklearn (scikit-learn) logistic regression package -- set trained coefficients for classification.

So I read the scikit-learn package webpate: http://scikit-learn.sourceforge.net/dev/modules/generated/sklearn.linear_model.LogisticRegression.html I can use logistic regression to fit the data, and after I obtain an instance of LogisticRegression,…
CuriousMind
  • 15,168
  • 20
  • 82
  • 120
4
votes
2 answers

audiolab for python 2.7

I have python 2.7 installed in my system and I need to use the audiolab module. I downloaded from the following site: http://www.lfd.uci.edu/~gohlke/pythonlibs/#scikits.audiolab When I import scikits.audiolab I get the following error: Traceback…
user887112
1 2
3
14 15