Questions tagged [top2vec]

6 questions
0
votes
0 answers

Top2Vec model returning TypeError: 'numpy.float64' object cannot be interpreted as an integer

I'm trying to train a top2vec model and come up against either the issue of not having enough documents which I rectify by concatenating the dataframe with itself etc. Then upon training the model the Type Error comes up. I can't find where the…
Magnetar
  • 85
  • 8
0
votes
0 answers

Adding new documents with document IDs to a trained top2vec topic model in Python

When adding new documents to top2vec topic model, I am getting the following error: # Here is the building model: # df["text"] is text column of dataframe df from top2vec import Top2Vec model = Top2Vec(documents=df["text"]) # document_ids not…
Sam S.
  • 627
  • 1
  • 7
  • 23
0
votes
1 answer

Installing top2vec package, particularly in H2O Notebooks, and the error

After installing python top2vec package in H2O notebooks (!pip install top2vec), I am getting the following error when importing top2vec: import top2vec ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C…
Sam S.
  • 627
  • 1
  • 7
  • 23
0
votes
1 answer

Azure ML Studio Job Resources

I'm currently training a Top2Vec ML model on a CommonCrawl news dataset in Azure ML Studio. When running my Python code inside a ipynb Notebook in ML Studio itself (online) the CPU is being fully used (100% workload) but when executing my script as…
Luca
  • 11
  • 1
  • 4
0
votes
0 answers

Answered in comment - How do I find the topic of news article using an already trained Top2Vec model?

I'm trying to build a news recommendation system for myself using Top2Vec topic modeling. Given the amazing news datasets, it isn't too difficult to actually train the model, but I'm unsure of how to categorize a novel article. Top2Vec has the…
AjS
  • 13
  • 4
0
votes
0 answers

top2vec - explanation of get_documents_topics function behavior

Need explanation on what get_documents_topics(doc_ids, reduced=False, num_topics=1) does. Get document topics. The topic of each document will be returned. The corresponding original topics are returned unless reduced=True, in which case the…
mon
  • 18,789
  • 22
  • 112
  • 205