Questions tagged [glove]

GloVe is an unsupervised learning algorithm for obtaining vector representations for words (word embeddings).

GloVe is an unsupervised learning algorithm for obtaining vector representations for words (word embeddings). See https://nlp.stanford.edu/projects/glove/ for more information.

92 questions
0
votes
2 answers

Can't map a certain word to vector

I'm having trouble with implementing word-to-vector mapping with GloVe. My code seems to be working fine, but there is a weird problem: I'm getting error when trying to map one particular word - 'the', to it's vector representation. I have no idea…
beginner
  • 89
  • 1
  • 4
0
votes
1 answer

IMDB dataset preprocessing unsuitable for GLoVe word embeddings?

I want to train a simple sentiment classifier on the IMDB dataset using pretrained GLoVe vectors, an LSTM and final dense layer with sigmoid activation. The problem I have is that the obtained accuracy is relatively low: 78% . This is lower than the…
fornit
  • 79
  • 1
  • 11
0
votes
1 answer

How do you create Bag-of-Words feature vector after applying GloVe embedding?

If I have two movie reviews: "this was a really good movie" and "i did not like this movie at all" And I apply GloVe embedding to them I will get two vectors, with multiple word vectors inside them that look like this: 1st Review: [[300…
sgmm
  • 580
  • 2
  • 7
  • 13
0
votes
1 answer

Load a part of Glove vectors with gensim

I have a word list like['like','Python']and I want to load pre-trained Glove word vectors of these words, but the Glove file is too large, is there any fast way to do it? What I tried I iterated through each line of the file to see if the word is…
Pone
  • 3
  • 1
0
votes
2 answers

Illegal Hardware Instruction Error when using GloVe

I am trying to train GloVe embeddings. In the GloVe implementation from stanfordnlp there are 4 scripts to run. However, running the second script, coocur, results in an Illegal Hardware Instruction-Error. I don't understand how this error is…
sinaj
  • 129
  • 1
  • 1
  • 10
0
votes
2 answers

Similarity of two words in GloVe

How to find similarity of two words in Glove? We have model.similarity(word1, word2) in Word2vec. Is there a similar way in GloVe? How to save and load trained GloVe model?
0
votes
1 answer

How to convert spark mllib word2vec model to glove txt format?

I use Spark MLlib to train a domain specific word2vec model and I need to use it in glove word2vec format. How can I convert it to glove txt format?
0
votes
1 answer

glove most similar to multiple words

I am supposed to do some exercises with python glove, most of it doesn't give me any problems but now i am supposed to find the 5 most similar words to "norway - war + peace" from the "glove-wiki-gigaword-100" package. But when i run my code it just…
Littellittel
  • 13
  • 1
  • 4
0
votes
0 answers

Use Glove vectors without Embedding layers in LSTM

I want to use the Glove vectors in Language Modeling. But the problem is if I use Embedding layer in the model, I can't predict the output vector and match the word. What I mean here is, I want to give the glove vector representation for my…
Sathiyakugan
  • 674
  • 7
  • 20
0
votes
0 answers

How to convert a vector back to natural language using pre-trained glove model?

I am training my neural network with a large corpus of text data. First I converted every text(words) into vector using glove. Those vectors are the input to the neural network. Now, I have an output vector from the output layer of the network. How…
bot_xxx
  • 1
  • 1
0
votes
0 answers

How to convert a text file by word2vec using python

I am beginner of python language,natural language processing,deep learning,neural networks.I want to execute a program which convert text file into vector by using word2vec in python..someone please help me import math import nltk file =…
0
votes
1 answer

Glove word embedding model parameters using tex2vec in R, and display training output (epochs) after every n iterations

I am using text2vec package in R for training word embedding (Glove Model) as: library(text2vec) library(tm) prep_fun = tolower tok_fun = word_tokenizer tokens = docs %>% # docs: a collection of text documents prep_fun %>% tok_fun it =…
Sam S.
  • 627
  • 1
  • 7
  • 23
0
votes
0 answers

Segmentation fault (core dumped) in Glove.c

I'm trying to train Glove https://github.com/stanfordnlp/GloVe/blob/master/src/glove.c on a pretty big dataset, the newest wikidump (22G txt file). The total # of vocab that I'm training is 1.7 mil. Every file (shuffle, cooccur, vocab_count) until…
aerin
  • 20,607
  • 28
  • 102
  • 140
-1
votes
1 answer

Glove text pre-processing

I noticed in techniques, people convert text URLs, number, and dates to . Does the glove dataset has embedding trained for these placeholders. Can I feed them directly into the dataset?
Abhishek Bhatia
  • 9,404
  • 26
  • 87
  • 142
-1
votes
1 answer

embedding and clustering a specific text (using GloVe)

Edit 2: I thought better on my question and realized it was way to generalized and it is only a matter of something basic; creating a new array from the Glove file (glove.6B.300d.txt) that contains ONLY the list of words that I have in my…