Word2vec classification and clustering tensorflow

Question

I am trying to cluster some sentences using similarity (maybe cosine) and then maybe use a classifier to put text in predefined classes.

My idea is to use tensorflow to generate the word embedding then average them for each sentence. Next use a clustering/classification algorithm.

Does tensorflow provide ready to use word2vec generation algorithm?

Would a bag of words model generate a good output?

score 1 · Accepted Answer · answered May 10 '17 at 03:03

1

No, tensorflow does not provide a ready-to-use word2vec but it does have a tutorial on word2vec.
Yes, a bag of words can generate surprisingly good output (but not state-of-the-art), and has the benefit of being amazingly faster. I have a small amount of data (tens of thousands of sentences) and have achieved F1 scores of >0.90 for classification.

answered May 10 '17 at 03:03

Anton Codes

Well I found out about Tensorflow not having a ready to use for word2vec or glove. They provided a tutorial for implementing a word2vec model. I took a look at DL4J (JAVA) and I think I am going to stick with that for the moment. – LonsomeHell May 10 '17 at 20:41

1 Answers1