Questions tagged [carrot2]

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot)

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot).

It is implemented in Java and can automatically organize small collections of documents (search results but not only) into thematic categories.

107 questions
0
votes
1 answer

how to specify use of STC algorithm in Solr admin console?

I have a test Solr environment using Carrot2 on Ubuntu. With the Carrot2 workbench I can alternate between the three defined algorithms (Lingo, STC, kmeans). How do I do the same thing in the Solr admin query tool? is it an argument passed with the…
0
votes
2 answers

terms relation & score from solr

I have below person and his skill data already indexed in solr Candidate, Skills ----------------- 1, Java, JSP, Servlet, Spring, Hibernate 2, Java JSP, JDBC 3, Java, JDBC, RMI 4, JDBC, SQL 5, .Net, C# from above I would like build terms…
Jigar Parekh
  • 6,163
  • 7
  • 44
  • 64
0
votes
1 answer

User defined dictionary in lingo3g for Elasticsearch wrt label/word/synonym

While browsing the lingo3g manual I came across with http://download.carrotsearch.com/lingo3g/1.9.0/manual/#chapter.lexical-resources Which states that we can customize the name of the label as per pre defined Word/Label dictionary. So I have some…
Prashant
  • 17
  • 1
  • 4
0
votes
1 answer

Sequential clustering with Carrot2?

its possible to do "sequential clustering" with carrot2. What i mean with sequential clustering is, let say I have a set of documents that im clustering, and then after a some time new documents are ariving for clustering, is it possible to cluster…
0
votes
1 answer

Cluster list of comments

I am a beginner who just has discovered the great Carrot2 framework . I try to use the Java API to cluster a list of Facebook comments (~100 comments with about 10-200 characters) with the LingoClusteringAlgorithm. Can I use the comment text as…
Sonson123
  • 10,879
  • 12
  • 54
  • 72
0
votes
1 answer

maximum limit of documents to be clustered

Is there any limit of documents while clustering through carrot2 plugin with elastic search. That is like we can cluster at max 10000 documents using carrot2 is there any limitation like this?
Prashant
  • 17
  • 1
  • 4
0
votes
1 answer

solr clustering based on solr fields including geo-spacial location fields

Trying to use carrot2 for doing to resultset clustering. I have couple of questions with respect to this. a) Can we cluster the documents in Solr/Lucene based on the specific fields in solr? like cluster them based name, person name and geo-distance…
Ganesh
  • 573
  • 2
  • 13
0
votes
2 answers

Solr/Carrot2 Integration

SOlr/Carrot2 Integration i have multiple text files for each i created XML to index document on Solr as bellow data data here Some spanish text here
GaneshP
  • 746
  • 7
  • 25
0
votes
1 answer

Identify docs in merged bubbles

I am new to carrot2 here so please bear with me. I am using carrot2workbench 32-bit in order to use Aduna cluster map. It's helpful to see the relationships among the clusters, but how do you identify the documents in an overlap (merged bubble)…
0
votes
1 answer

Number of clusters obtained using carrot2 inconsistent on the same data set

I am using carrot2 for clustering a set of 500 emails. I am using the BisectingKMeans algorithm provided by carrot2. On the same data set, when I specify k = 9, only 6 are generated and when I give it to run with 8 clusters, 7 are generated –…
afs
  • 167
  • 1
  • 9
0
votes
2 answers

Document tagging

I have very huge solr index. I want to tag all documents with terms which better represent that document like this. Does this type of clustering results is also come under document tagging? Which approach is better, Index time Document tagging or…
0
votes
1 answer

carrot2 Workbench and filter Query fq

i need to add clusteringComponent from solr to my application. For tuning the clusteringComponent i use the Carrot2 workbench, which is very useful to find out what ar the right parameters. The point is: i only like to analyse / cluster new data,…
The Bndr
  • 13,204
  • 16
  • 68
  • 107
0
votes
1 answer

Carrot2 license for commercial project without website&documentation

According to carrot2 FAQ we should acknowledge project users about using Carrot2. Can I use Carrot2 in a commercial project? Yes. The only requirement is that you properly acknowledge the use of Carrot2 (on the project's website and…
yura
  • 14,489
  • 21
  • 77
  • 126
-1
votes
1 answer

carrot2 and solr for local indexing

i have indexed a local folder with solr 6.5 with data_driven_schema_configs schema i have install carrot2 web 3.15 and carrot workbench it's ok : all work well i want to open any one file in the files list result ... when i use workbench (on…
fmarin
  • 1
-1
votes
1 answer

How to export cluster names and values from Carrot2 workbench?

I just started working with Carrot2 Workbench to perform clustering on search engine results. The visualizations are nice, but I am looking to export the list of clusters and their document counts. In addition, I'd be interested in possibly…
andrew citera
  • 385
  • 1
  • 2
  • 7