Questions tagged [fasttext]

fastText is a library for efficient learning of word representations and sentence classification.

fastText is a library for efficient learning of word representations and sentence classification. See https://github.com/facebookresearch/fastText for more information.

465 questions
0
votes
3 answers

Facebook fasttext bin model UnicodeDecodeError

I downloaded pretrained word vector file (.bin) from facebook (https://fasttext.cc/docs/en/crawl-vectors.html) However, when I tried to use this model it happens to make error. from gensim.models import FastText fasttext_model =…
zzaebok
  • 55
  • 1
  • 7
0
votes
1 answer

Not able to load input data for fasttext

my input.txt contains the following: __label__SPAM buy our products __label__HAM Please send me the last business forecast __label__SPAM buy viagra __label__HAM important meeting at 10:00AM But after running the command: ./fasttext skipgram -input…
hacker315
  • 1,996
  • 2
  • 13
  • 23
0
votes
1 answer

Desired distribution of weights in word embedding vectors

I am training my own embedding vectors as I'm focused on an academic dataset (WOS); whether the vectors are generated via word2vec or fasttext doesn't particularly matter. Say my vectors are 150 dimensions each. I'm wondering what the desired…
Evan Mata
  • 500
  • 1
  • 6
  • 19
0
votes
1 answer

Train Fastext on non-english data set

I'm into a new project which I desire to represent words as vectors, I read about Fasttext library and I saw that they have pre-trained models for language which is not English. The purpose is to predict closeness between different words…
U.Rush
  • 398
  • 2
  • 7
0
votes
1 answer

Fasttext ignore wrong predictions?

I have trained a fast text supervised model. I did used a pyfasttext python library to predict the model. Ex: I have data like this. text - label The meeting is planned - event The work should be finished - task Here, the model predict the label…
Giri Annamalai M
  • 810
  • 9
  • 24
0
votes
1 answer

How I can get vector from output matrix in FastText ?

In this study author have found out that, Word2Vec generates the two kinds of embeddings(IN & OUT). https://arxiv.org/abs/1602.01137 Well, you can easily get that using syn1 attribute in gensim word2vec. But in the case of gensim fastText, the syn1…
0
votes
1 answer

Gensim fasttext wrapper returns permission error 13 while model training

I tried to reproduce this tutorial on my local machine to get used to gensim fasttext functionalities. Fasttext and gensim libraries are correctly installed. By calling the train method of gensim fasttext wrapper model_wrapper =…
Marco
  • 91
  • 8
0
votes
0 answers

Python - Google Colab Change Directory after wget

in Google Colaboratory, I downloaded fasttext module with !wget command. !wget https://github.com/facebookresearch/fastText/archive/v0.1.0.zip !unzip v0.1.0.zip It works fine. "fastText-0.1.0" file extracted successfully. After then: !cd…
0
votes
0 answers

Fasttext UnicodeDecode issue

I am trying to load the fasttext file to use it as word embedding first time. I have this: KeyedVectors.load_word2vec_format(binary_file_path, binary=True, encoding='utf-8', unicode_errors='ignore') I also tried what is described here:…
amy
  • 342
  • 1
  • 5
  • 18
0
votes
0 answers

fastText Quick Start Guide

I am just reading the recent book called fastText Quick Start Guide and I ran codes in the book.When I ran this code $ cat data/yelp/yelp_review.csv | \ python parse_yelp_dataset.py \ > data/yelp/yelp_review.v1.csv` It was ok, But when I ran…
Kyaw
  • 27
  • 7
0
votes
0 answers

How to run fastText model on google extension?

I am trying to integrate trained fastText model for text classification in Google Extension. There are several fastText javascript wrapper like , but all of them running on node.js which is server-side while chrome extension to my knowledge runs on…
0
votes
2 answers

How to make 2 versions of Fasttext Python wrapper work together?

We have used Facebook's Fasttext amazing library for a while. We access the trained models using the python wrapper (https://pypi.org/project/fasttext/). It used to be a third-party library, but is now maintained by Facebook and was merged to their…
Dennis Golomazov
  • 16,269
  • 5
  • 73
  • 81
0
votes
1 answer

Prevent cmake from using certain CPU features

I am buiding fastText library on one machine, but run it on another machine with a different CPU. I am building it this way: mkdir build && cd build && cmake .. && make && make install When I run it on another machine, I get "Illegal instruction"…
JustAC0der
  • 2,871
  • 3
  • 32
  • 35
0
votes
0 answers

How to convert large binary file into pickle dictionary in python?

I am trying to convert large binary file contains Arabic words with 300 dimension vectors into pickle dictionary What I am write so far is: import pickle ArabicDict = {} with open('cc.ar.300.bin', encoding='utf-8') as lex: for token in lex: …
KDA
  • 1
  • 3
0
votes
0 answers

Can we use char rnn to create embeddings for out of vocabulary words?

I have word embeddings for 10 million words which were trained on a huge corpus. Now I want to produce word embeddings for out of vocabulary words. Can I design some char RNN to use these word embeddings and generate embeddings for out of vocab…