Questions tagged [fasttext]

fastText is a library for efficient learning of word representations and sentence classification.

fastText is a library for efficient learning of word representations and sentence classification. See https://github.com/facebookresearch/fastText for more information.

465 questions
-1
votes
1 answer

Facebook fasttext bin model preprocessing

I have downloaded a .bin FastText model,and load it as follows: ft = fasttext.load_model('/content/drive/MyDrive/dataset/cc.en.300.bin') how can i make preprocessing and normalization on cc.en.300.bin model. i want to make lemmatization, removing…
-1
votes
1 answer

Google Drive pretrained model file can't be opened for loading?

I have deployed a demo of a product using Heroku and I would like to load a pretrained fasttext model via a file in my Google Drive. I successfully downloaded the library via my heroku requirements file. The file I want to use to load it…
-1
votes
2 answers

Multi-label classification with FastText

I was wondering if FastText is able to deal with multi-labelled data? Could someone share a simple example along with a confusion matrix (true vs predicted labels)? I have already taken a look at FastText documentation page. Thank you in advance
-1
votes
1 answer

Error : cc.fr.300.bin cannot be opened for loading

I am using Azure Machine Learning and Azure Databricks. In Azure Databricks I have a script.py written by %% command (%%write script.py). In this script I tried to load cc.fr.300.bin that is saved as a model in Azure Machine Learning. I did…
codelifevcd
  • 175
  • 1
  • 10
-1
votes
1 answer

Find list of Out Of Vocabulary (OOV) words from my domain spectific pdf while using FastText model

How to find list of Out Of Vocabulary (OOV) words from my domain spectific pdf while using FastText model? I need to fine tune FastText with my domain specific words.
-1
votes
1 answer

Are fasttext Wiki word vectors monolingual?

After reading your paper from Bojanowski et al. (2016), I went to consult the available pre-trained word vectors on the fasttext website. Here is my concrete doubt: Are these pre-trained word vectors…
mljistcart
  • 45
  • 6
-1
votes
1 answer

Fast text use (getting it up to compare word vectors)

I am a little ashamed that I have to ask this question because I feel like I should know this. I haven't been programming long but I am trying to apply what I learn to a project I'm working on, and that is how I got to this question. Fast Text has…
John B
  • 9
  • 1
-1
votes
1 answer

How to classify natural languages written in other forms of characters?

Background I would like to classify all the three phrases as Chinese, 'zh' using fastText. ["Ni hao!", '你好!', 'ni hao!'] However the trained model looks not applicable for the semantic classification. Is there any idea to do the same task with…
halt
  • 393
  • 5
  • 17
-1
votes
1 answer

Vectorize list of string with Word2Vec to feed to keras sequential layer

I am trying to built a custom made word embedding model with fastText, that represents my data (list of sentences) as vectors so I can "feed" it to a Keras CNN for abusive language detection. My tokenised data is stored in a list like this: data =…
Lisa
  • 9
  • 4
-1
votes
1 answer

FastText and Datasets in Azure ML with Python

I am running an experiment (a custom-made model created with Pytorch) in Azure ML and using FastText (not the gensim version), but met a problem: In the experiment, I have a (rather large) text file in a dataset and need to train FastText with it,…
-1
votes
1 answer

fastText with Python: Calculate Accuracy

I am using fastText with Python, which gives precision and recall, but not accuracy. How do I get accuracy from fastText? Or, alternatively, how do I calculate accuracy given precision and recall?
Software Dev
  • 910
  • 3
  • 10
  • 27
-1
votes
1 answer

Is there any unsupervised clustering technique which can identify numbers clusters itself?

I checked unsupervised clsutering on gensim, fasttext, sklearn but did not find any documentation where I can cluster my text data using unsupervised learn without mentioning numbers of cluster to be identified for example in sklearn KMneans…
user2129623
  • 2,167
  • 3
  • 35
  • 64
-2
votes
1 answer

Got error on installing pyfastext on Mac: command 'cc' failed with exit status 1

I just got an error on installing pyfasttext. cogent@MBP: pip install --user pyfasttext And then I got these: Collecting pyfasttext Downloading…
-2
votes
1 answer

fastText and word2vec: NaNs in accuracy computation code

I downloaded the pre-trained English Wikipedia vectors file (wiki.en.vec) from the fastText Github repository page, and I tried to compute the syntactic and semantic analogy task accuracies as described in the first of Mikolov's word2vec papers as…
Ricky
  • 103
  • 1
  • 2
  • 10
-3
votes
1 answer

Dictionary not influenced by input?

There's a get_dictionary() function in the fastrtextpackage, and I thought it would return all the words in the dictionary. However, when I set wordNgrams to 2 or 3, it returned exactly the same list of words as what I got when setting wordNgrams…
Sky
  • 11
1 2 3
30
31