Questions tagged [carrot2]

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot)

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot).

It is implemented in Java and can automatically organize small collections of documents (search results but not only) into thematic categories.

107 questions
0
votes
0 answers

Can't run Carrot2 workbench 3.11.0 on Windows (8 and 10)

Java is installed (version 8). My friend asked for help in this question but i can't find any answer. If someone know how to run this program please tell me. Windows 8 - 64-bit architecture, Windows 10 - 32-bit. P.S. Sorry by my language (=
Ajmda
  • 1
0
votes
1 answer

solr hierarchical clustering

I'm trying to enable hirearchical clustering (sub clusters generation) in Apache SOLR. For this I'm using the SOLR Clustering Component, setting the "outputSubclusters" parameter to true. However, when I show the output in JSON, the object I receive…
Omar Perez
  • 23
  • 3
0
votes
1 answer

carrot2 api not support japanese language

I am trying to use carrot2 API to cluster documents in japanese language. It throws out this WARN: org.carrot2.text.linguistic.DefaultTokenizerFactory: Tokenizer for Japanese (ja) is not available. This may degrade clustering quality of Japanese…
0
votes
2 answers

Where to find a Carrot2 C# API?

I'm been trying to use Carrot2 (clustering engine) in a C# project. They state that they offer a C# API, but the download links[1] are broken (404 - not found). Their support page[2] suggests to ask questions on SO, so I thought I'll post…
Nick
  • 1,365
  • 2
  • 18
  • 37
0
votes
3 answers

no results found plugin carrot2+elasticsearch

Good afternoon! Can you help me please with some error? I had old version 1.3.4 of Elasticsearch and installed plugin Carrot2 Clustering plugin 1.7.0. I upgraded Elasticsearch to version 1.5.0 and plugin Carrot2 Clustering plugin 1.8.0. Now I'm…
mariya24
  • 1
  • 1
0
votes
1 answer

Carrot2 FoamTree API

Iam working on Carrot2 FoamTree API especially working on Large Hierarchial clustering and I need to know if there is a way to post the leaf node as a hyperlink that when clicked opens the link in a new window??
0
votes
1 answer

Carrot2 workbench not able to process large data

I wanted to cluster my data-set using carrot2 workbench. I have an input xml file with 65536 documents. I am using Lingo clustering algorithm. But when I start the process, the workbench returns the result within few seconds having all the…
vishalaksh
  • 2,054
  • 5
  • 27
  • 45
0
votes
1 answer

Force or boost words in carrot2 clustering labels

I am using Carrot2 to cluster query results from Solr. Is is possible to force (or at least boost) the occurrence of certain words in the labels, in either Lingo, STC or k-means? With Lingo, this is already possible with the option "Title word…
tkja
  • 1,950
  • 5
  • 22
  • 40
0
votes
1 answer

Tokenizing cluster labels of Carrot2 Lingo Clustering on Solr

I use the Carrot2 Lingo Clustering Algorithm to Cluster my Solr search results. Now I want to process the clustering labels further and therefore I need to tokenize the labels to get them one-by-one per label. Is there some kind of post tokenizer…
user1337
  • 460
  • 6
  • 22
0
votes
2 answers

Carrot2 3.9.4 upgrading to Lucene 4.7+

Getting an error when trying to connect to a Lucene index above 4.7: Processing error: org.apache.lucene.index.IndexFormatTooNewException: Format version is not supported (resource: MMapIndexInput(path="C:\kcp-search-data\kcp-index_2.cfs")…
0
votes
2 answers

Can't run Carrot2 Document Clustering Workbench for 64-bit Linux (version 3.9.4) on my Ubuntu 14.04

I've downloaded Carrot2 Document Clustering Workbench for 64-bit Linux version 3.9.4 and tried to run it on my Ubuntu GNU/Linux 14.04. First I've received an error: org.eclipse.swt.SWTError: No more handles [Unknown Mozilla path (MOZILLA_FIVE_HOME…
Emre Sevinç
  • 8,211
  • 14
  • 64
  • 105
0
votes
1 answer

What is the maximum score value for lingo algorithm?

I want to produce a word cloud of the cluster results, but to do that, I will need to know the maximum score value that the Lingo algorithm has. Does anyone know the answer?
0
votes
1 answer

Carrot2 documents similarity and how are the ordered documents indexes in the tf-idf matrix?

I'm trying to determine the similarity between two documents using carrot. Is it possible get this similarity directly from the framework? Additionally I've been studying the tf-idf matrix and realized that the rows correspond to the stemmed all…
msoares
  • 13
  • 4
0
votes
1 answer

Using the carrot2 workbench without specifying a query

I would like to use the workbench to do some tests but I could not understand how to run it without specifying a query. I would like to make the cluster of documents, without having to specify a query using the workbench. Is it possible? Tanks
msoares
  • 13
  • 4
0
votes
1 answer

carrot2 pubmed read timed out

I just started using carrot2 directly today (I have previous experience using it as part of @note2). I am able to search the web and Wikipedia through either http://search.carrot2.org or the workbench application. When searching the pubmed source,…
Mark Miller
  • 3,011
  • 1
  • 14
  • 34