Questions tagged [lda]

Latent Dirichlet Allocation, LDA, is a generative model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar.

If observations are words collected into documents, it posits that each document is a mixture of a small number of topics and that each word's creation is attributable to one of the document's topics. LDA represents documents as mixtures of topics that spit out words with certain probabilities.

It should not be confused with Linear Discriminant Analysis, a supervised learning procedure for classifying observations into a set of categories.

1175 questions
0
votes
2 answers

Can I print method LDA with plot in R?

I have this library(MASS) mydata.qda <- qda(Sp ~ ., prior = c(1,1,1)/3, data = mydata.learn) I would like to plot my results like http://scikit-learn.org/0.10/auto_examples/plot_lda_vs_qda.html
sonia
  • 167
  • 2
  • 2
  • 10
0
votes
0 answers

How to analysis irregular results by discriminant analysis in R?

I have learnt the use of LDA function in R to analysis regular results like this: (x1&x2 are factors, G is classification) X1 X2 G 2.95 6.63 1 2.53 7.79 1 3.57 5.65 1 3.16 5.47 2 2.16 6.22 2 Now my question is how to analysis the data if G are…
noahye
  • 1
  • 1
0
votes
0 answers

How to import and use feature vectors in MALLET's topic modelling?

I am using MALLET's topic modelling. I have set of keywords along with weights for a set of documents which I want to train and use the model to infer new documents. Note: each keyword of the document has weight assigned to it which is similar to…
sravan_kumar
  • 1,129
  • 1
  • 13
  • 25
0
votes
1 answer

How to add new documents to existing topic model in mallet or batch the model for large document counts

I want to use topic modeling and found MALLET suitable for me. I successfully created my first demo using some 0.1 million Documents.Now as per my requirements i have to deal with 10 million documents for which am not able to processed further.Is…
Hardik Dobariya
  • 339
  • 2
  • 4
  • 20
0
votes
1 answer

Mallet dirichelet parameter higher than 1

I've been using MALLET in order to perform my topic modeling(LDA). I tried to discover 20 topics in a dataset The outcome is the following (the list of keywords is not important for this question): 0 0.05013 list_of_topic_keywords_0 1 0.06444…
0
votes
1 answer

perform LDA with 3 classes in R

I have three classes with mean mu1 <- matrix(c(3, 1), nrow=2) mu2 <- matrix(c(4, 3), nrow=2) mu3 <- matrix(c(8, 2), nrow=2) and covariance cov <- matrix(c(.5, .3, .3, .5), nrow=2, ncol=2) I would like to simulate about 100 observations from each…
user3358686
  • 883
  • 2
  • 7
  • 7
0
votes
1 answer

topic modeling using keywords for topics

I need to do topic modeling in the following manner: eg: I need to extract 5 topics from a document.The document being a single document.I have the keywords for 5 topics and related to these 5 keywords i need to extract the topics. The keywords for…
user2876812
  • 326
  • 1
  • 4
  • 15
0
votes
1 answer

Term weighting for original LDA in gensim

I am using the gensim library to apply LDA to a set of documents. Using gensim I can apply LDA to a corpus whatever the term weights are: binary, tf, tf-idf... My question is, what is the term weighting that should be used for the original LDA? If…
papafe
  • 2,959
  • 4
  • 41
  • 72
0
votes
0 answers

Matrix whose rows have different column names in R

I'd like to have a matrix-like data structure in R, where each row has different column names. Essentially, I'd like almost a list of dictionaries. Consider the following code: x <- c(.5, .3, .2) y <- c(.1, .6, .3) names(x) <- c("foo", "bar",…
sinwav
  • 724
  • 1
  • 7
  • 20
0
votes
1 answer

Infer LDA models

I'm new to LDA and topic modeling and I would like to understand the inference mechanism. I would like to apply LDA on activity recognition. Say that I have defined 10 topics composed by a probability distribution of events. for example TOPIC_1 =…
gabboshow
  • 5,359
  • 12
  • 48
  • 98
0
votes
1 answer

Similarity between LDA results over two different number of topics?

if we choose 20 topics in LDA and then if we choose 30 topics. So my question is will both these results intersect those 20 topics and produce similar results
hitesh_noty
  • 365
  • 1
  • 10
0
votes
1 answer

Text Analysis Using LDA and tm in R

Hey guys I have a little bit of trouble conduction LDA because for some reason once I get ready to conduct the analysis I get errors. I'll do my best to go through what I am doing unfortunately I will not be able to provide data because the data I…
theamateurdataanalyst
  • 2,794
  • 4
  • 38
  • 72
0
votes
2 answers

Linear discriminant analysis variable importance

Using the R MASS package to do a linear discriminant analysis, is there a way to get a measure of variable importance? Library(MASS) ### import data and do some preprocessing fit <- lda(cat~., data=train) I have is a data set with about 20…
Nick
  • 115
  • 2
  • 3
  • 7
0
votes
1 answer

How do i classify a signal by using LDA

I am using LDA algorithm to classify a signal, how can i do that with multiple classes ? I have read some references but don't get the result. I I've looked example code at link but it doesn't have explicit classifier part for me!
user3517638
0
votes
1 answer

Infer new Document using Mahout 0.9 LDA

I am able to build a LDA model using Mahout 0.9 CVB implementation. I now need to infer a new document using this model. I was wondering if there is any already existing method/implementation to do the same? Any help will be highly appreciated.…