Questions tagged [machine-learning]

Implementation questions about machine learning algorithms. General questions about machine learning (concepts, theory, methodology, terminology, etc.) should be posted to their specific communities.

Machine learning revolves around developing self-learning computer algorithms that function by virtue of discovering patterns in data and making intelligent decisions based on such patterns.

Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. Machine learning explores the construction and study of algorithms that can learn from and make predictions about data. Such algorithms operate by building a model from example inputs in order to make data-driven predictions or decisions rather than following strictly static program instructions.

NOTE: If you want to use this tag for a question not directly concerning implementation, then consider posting on Cross Validated, Data Science, or Artificial Intelligence instead; otherwise you're probably off-topic. Please choose one site only and do not cross-post to more than one - see Is cross-posting a question on multiple Stack Exchange sites permitted if the question is on-topic for each site? (tl;dr: no).

Classic Problems:

Classification (supervised learning) classification supervised-learning
Regression (supervised learning) regression
Clustering (unsupervised learning) cluster-analysis unsupervised-learning
Density estimation
Sampling
Reinforcement Learning reinforcement-learning

Relevant Algorithms:

Principal component analysis (PCA) pca
Artificial neural networks (ANN) neural-network
Support vector machines (SVM) svm support-vector-machines
K-nearest neighbor (kNN) knn nearest-neighbor
k-means k-means
Bayesian networks bayesian-networks
Gaussian mixture model (GMM) mixture-model
Decision trees decisiontrees
Genetic algorithms genetic-algorithm
Simulated annealing simulated-annealing
Hidden Markov model (HMM) hidden-markov-models
Conditional Random Field (CRF)
Gaussian Processes gaussian-process
Kalman filter kalman kalman-filter
Particle filter particle-filter
Gibbs sampling
Graphical models
Ensemble methods (bagging, boosting, ...) ensemble-learning
Deep learning deep-learning
Q-Learning q-learning

Applications:

Computer vision (e.g, object tracking, gesture recognition) computer-vision
Image recognition (e.g, face, gait, iris, handwriting) image-recognition face-recognition ocr
Speech recognition speech-recognition
Speaker recognition voice-recognition
Natural language processing (NLP) nlp
Music information retrieval (MIR)
Bioinformatics bioinformatics
Spam filtering spam-filtering
Anomaly detection anomaly-detection
Automatic vehicle driving
Recommendation system recommendation-engine
Machine translation machine-translation

Software:

LibSVM libsvm
Weka weka
Orange orange
Shogun shogun
scikit-learn scikit-learn
PyBrain pybrain
Apache Mahout mahout
RapidMiner rapidminer
KNIME knime
Waffles
Azure Machine Learning azure-machine-learning
nltk nltk
Caffe caffe
TensorFlow tensorflow
Theano theano
Keras keras
OpenNMT opennmt
XGBoost xgboost
CatBoost catboost
Stanford CoreNLP stanford-nlp

Related-tags:

Video Lectures:-

Machine Learning with Python

55241 questions

134

votes

2 answers

Does it make sense to use Conda + Poetry?

Does it make sense to use Conda + Poetry for a Machine Learning project? Allow me to share my (novice) understanding and please correct or enlighten me: As far as I understand, Conda and Poetry have different purposes but are largely…

python machine-learning package conda python-poetry

asked Jan 25 '22 at 15:09

Seub

2,451
4
25
34

133

votes

6 answers

How does Apple find dates, times and addresses in emails?

In the iOS email client, when an email contains a date, time or location, the text becomes a hyperlink and it is possible to create an appointment or look at a map simply by tapping the link. It not only works for emails in English, but in other…

machine-learning nlp information-extraction named-entity-recognition

asked Feb 15 '12 at 14:12

Martin

39,309
62
192
278

133

votes

9 answers

Google Colaboratory: misleading information about its GPU (only 5% RAM available to some users)

update: this question is related to Google Colab's "Notebook settings: Hardware accelerator: GPU". This question was written before the "TPU" option was added. Reading multiple excited announcements about Google Colaboratory providing free Tesla K80…

python machine-learning gpu ram google-colaboratory

asked Feb 12 '18 at 15:44

stason

5,409
4
34
48

132

votes

3 answers

Why does one hot encoding improve machine learning performance?

I have noticed that when One Hot encoding is used on a particular data set (a matrix) and used as training data for learning algorithms, it gives significantly better results with respect to prediction accuracy, compared to using the original matrix…

machine-learning data-mining scikit-learn data-analysis

asked Jul 04 '13 at 12:04

maheshakya

2,198
7
28
43

127

votes

23 answers

TensorFlow, "'module' object has no attribute 'placeholder'"

I've been trying to use tensorflow for two days now installing and reinstalling it over and over again in python2.7 and 3.4. No matter what I do, I get this error message when trying to use tensorflow.placeholder() It's very boilerplate code: tf_in…

python machine-learning tensorflow

asked May 23 '16 at 06:20

user3023715

1,539
2
11
12

125

votes

7 answers

What is the difference between a feature and a label?

I'm following a tutorial about machine learning basics and there is mentioned that something can be a feature or a label. From what I know, a feature is a property of data that is being used. I can't figure out what the label is, I know the meaning…

machine-learning

asked Nov 30 '16 at 20:54

Wojtek Wencel

2,257
6
31
65

125

votes

8 answers

Why should weights of Neural Networks be initialized to random numbers?

I am trying to build a neural network from scratch. Across all AI literature there is a consensus that weights should be initialized to random numbers in order for the network to converge faster. But why are neural networks initial weights…

machine-learning neural-network artificial-intelligence mathematical-optimization gradient-descent

asked Nov 17 '13 at 05:34

Shayan RC

3,152
5
33
40

124

votes

6 answers

machine learning libraries in C#

Are there any machine learning libraries in C#? I'm after something like WEKA. Thank you.

c# machine-learning

asked Oct 26 '09 at 10:23

Dervin Thunk

19,515
28
127
217

123

votes

6 answers

Common causes of nans during training of neural networks

I've noticed that a frequent occurrence during training is NANs being introduced. Often times it seems to be introduced by weights in inner-product/fully-connected or convolution layers blowing up. Is this occurring because the gradient computation…

machine-learning neural-network deep-learning caffe gradient-descent

asked Nov 27 '15 at 17:23

Aidan Gomez

8,167
5
28
51

122

votes

6 answers

How to load a model from an HDF5 file in Keras?

How to load a model from an HDF5 file in Keras? What I tried: model = Sequential() model.add(Dense(64, input_dim=14, init='uniform')) model.add(LeakyReLU(alpha=0.3)) model.add(BatchNormalization(epsilon=1e-06, mode=0, momentum=0.9,…

python machine-learning keras data-science

asked Jan 29 '16 at 00:03

pr338

8,730
19
52
71

122

votes

6 answers

Understanding min_df and max_df in scikit CountVectorizer

I have five text files that I input to a CountVectorizer. When specifying min_df and max_df to the CountVectorizer instance what does the min/max document frequency exactly mean? Is it the frequency of a word in its particular text file or is it the…

python machine-learning scikit-learn nlp

asked Dec 29 '14 at 23:57

moeabdol

4,779
6
44
43

120

votes

8 answers

Can Keras with Tensorflow backend be forced to use CPU or GPU at will?

I have Keras installed with the Tensorflow backend and CUDA. I'd like to sometimes on demand force Keras to use CPU. Can this be done without say installing a separate CPU-only Tensorflow in a virtual environment? If so how? If the backend were…

python machine-learning tensorflow keras

asked Nov 19 '16 at 08:04

mikal94305

4,663
8
31
40

120

votes

8 answers

What is an intuitive explanation of the Expectation Maximization technique?

Expectation Maximization (EM) is a kind of probabilistic method to classify data. Please correct me if I am wrong if it is not a classifier. What is an intuitive explanation of this EM technique? What is expectation here and what is being…

machine-learning cluster-analysis data-mining mathematical-optimization expectation-maximization

asked Aug 04 '12 at 10:56

London guy

27,522
44
121
179

119

votes

3 answers

How to concatenate two layers in keras?

I have an example of a neural network with two layers. The first layer takes two arguments and has one output. The second should take one argument as result of the first layer and one additional argument. It should looks like this: x1 x2 x3 \ / …

python machine-learning keras neural-network hierarchical

asked Apr 04 '17 at 00:56

rdo

3,872
6
34
51

119

votes

20 answers

Scikit-learn: How to obtain True Positive, True Negative, False Positive and False Negative

My problem: I have a dataset which is a large JSON file. I read it and store it in the trainList variable. Next, I pre-process it - in order to be able to work with it. Once I have done that I start the classification: I use the kfold cross…

python machine-learning scikit-learn classification supervised-learning

asked Jul 09 '15 at 17:19

Euskalduna

1,517
2
13
12

Prev 1 2 3

…

99 100 Next