Questions tagged [nmf]

Non-negative matrix factorization (NMF or NNMF), also non-negative matrix approximation is a group of algorithms in multivariate analysis and linear algebra where a matrix V is factorized into (usually) two matrices W and H, with the property that all three matrices have no negative elements.

nmf is a technique to approximate a matrix like V = WH. Here dimension of V,W,H can be respectively m*n, m*p, p*n where p << n usually. Now W can be thought as a weight matrix for hidden variables. As p can be very small this can also be viewed as a dimensionality reduction technique like pca.

nmf is widely applicable in most real world cases where V can't have negative values like image-recognition, text-classification, recommender system etc. General applications of nmf include:

Feature learning like Principal Component Analysis
Topic recovery like Probabilistic Latent Semantic Analysis
Clustering like K-means
Temporal Segmentation like Hidden Markov Model
Filtering and source separation like Independent Component Analysis

For this tag users should provide mathematical clarity as it is an advanced topic along with information about application to specific case.

Useful links:

77 questions

votes

2 answers

R NMF package: How to extract sample classifications?

In the NMF R-package one can use consensusmap() to visualise outputs. The plots show which samples belong to which clusters in the "consensus" track. I would like to extract this sample classification such that I get a data frame like this: Sample …

r cluster-analysis consensus nmf

asked Nov 20 '16 at 12:58

Esben Eickhardt

3,183
2
35
56

votes

2 answers

How can I correctly use Pipleline with MinMaxScaler + NMF to predict data?

This is a very small sklearn snipplet: logistic = linear_model.LogisticRegression() pipe = Pipeline(steps=[ ('scaler_2', MinMaxScaler()), ('pca', decomposition.NMF(6)), ('logistic', logistic), ]) from sklearn.cross_validation…

scikit-learn pipeline nmf

asked Aug 28 '16 at 15:02

Bear Huang

votes

0 answers

ERROR: long vectors (argument 1) are not supported in .C [in call to 'silhouette.default'] with R package NMF

Currently I'm running r version 3.6.0 (2019-04-26) on a debian server with 264 GB memory and Intel(R) Xeon(R) CPU. Now I'm trying to run the nmf calculating for a about (5*10^4) * 1100 matrix, it works well when I set a specific rank, such…

r matrix vector debian-based nmf

asked Mar 26 '20 at 06:52

zzbb2266

votes

1 answer

Surprise NMF throws ZeroDivisionError: float division

I'm trying to do a basic recommendation system. I use Surprise's NMF model for this. Here is my dataset just before starting to work with NMF: store_id item_id quantity 0 62693933 912003029 3.000 1 62693933 912003034 4.000 2…

python nmf

asked Feb 10 '20 at 13:49

emremrah

1,733
13
19

votes

0 answers

Scikit-learn NMF removing duplicate words

I'm using scikit-learn's nmf algorithm to extract trending words from some blogs. For example I have "game thrones"( which is good although "of" is dropped as stopword ), but I also have "game" and "thrones". I have "marcus hutchins"(good) but I…

python machine-learning scikit-learn topic-modeling nmf

asked Aug 09 '17 at 08:22

jack jack

votes

1 answer

Non negative matrix factorisation in python on individual images

I am trying to apply NMF to a particular image that is loaded in grayscale mode. I have tried several links but my image after application of NMF remains almost the same and cannot be distinguished with the grayscale image initially loaded. However,…

python numpy matplotlib scikit-learn nmf

asked Jul 09 '17 at 05:34

Nishanth Rao

votes

0 answers

How to test the trained NMF topic model on new text

I have created a NMF topic model in python the code snippet for which is as follows: def select_vectorizer(req_ngram_range=[1,2]): ngram_lengths = req_ngram_range vectorizer = TfidfVectorizer(analyzer='word', ngram_range=(ngram_lengths),…

python scikit-learn nlp topic-modeling nmf

asked May 18 '17 at 22:13

Arman

votes

1 answer

IndexError: out of bounds using NMF in sklearn

I am attempting to create topic models from a corpus of data. The code is able to properly use NMF to generate the tasked number of topics from the parsed data, however it breaks when the corpus length = 20, as seen below 20 [u'bell', u'closed',…

python scikit-learn nmf

asked Jan 13 '17 at 21:51

sudo_coffee

vote

1 answer

How to determine which document falls under a particular topic after applying topic modelling techniques like NMF, LDA, BERTopic?

Is there any way I can map generated topic from LDA, NMF and BERTopic to the list of documents and identify to which topic it belongs to? Click here to view Example

python nlp lda topic-modeling nmf

asked Aug 22 '22 at 12:07

Navya

vote

2 answers

get_coherence : C_V method gets an error but U_Mass works

I'm using the following code to check the coherence value. The problem is code below works well when I change the coherence type into "u_mass", but if I want to compute "c_v", an Index error occure. Previous text process: # Remove Stopwords, Form…

lda topic-modeling topicmodels nmf

asked May 19 '22 at 09:30

Victoria L

vote

1 answer

Unable to find dot product of two matrix (W and H from NMF ) with same inner dimensions

I am doing Non-Negative Matrix Factorization (NMF) of a matrix A in R. It has Genes on rows and Samples on the columns. For NMF, I am using the CRAN package NMF. Once the basis matrix W and coefficient matrix H are computed, I want to check whether…

r matrix-multiplication dimensionality-reduction nmf

asked May 11 '22 at 13:06

sp29

vote

1 answer

Topic Modelling - I have used NMF and LDA, what is next?

I have used NMF and LDA for topic modelling in Python, with what I would call good results with NMF, and poor results with LDA. My data is highly domain specific, with a lot of unique/specific vocabulary. I am trying to improve my NMF output by…

python nlp lda topic-modeling nmf

asked Jun 27 '21 at 07:26

Prolle

vote

1 answer

Short text in the context of topic modeling

I am working on topic modeling and I am curious what exactly would be short text under this context?For example, if there is a research paper ,would the research paper's title and abstract be considered as short text?

python-3.x nlp lda topic-modeling nmf

asked Jun 09 '20 at 10:29

Sri Test

vote

1 answer

NMF with negative values Python

I'm working with the Scikit-Learn NMF algorithm and I would like to know if there is any way to use negative values with the algorithm, I need it to work with BVH files. I'm using python 3.7.5 import numpy as np import re from sklearn.decomposition…

python python-3.x scikit-learn scikits nmf

asked Apr 29 '20 at 00:44

Mauricio Fernandez

vote

0 answers

Scikit-learn NMF return NAN values

I am working with a 6650254x5650 sparse matrix which values are in numpy.float64 format. I am using the NMF implemetnation from scikit-learn as following from sklearn.decomposition import NMF model = NMF(n_components=12, init='random',…

numpy scikit-learn sparse-matrix matrix-decomposition nmf

asked Apr 09 '20 at 10:28

Areza

5,623
7
48
79

Prev 1

3 4 5 6 Next