Questions tagged [carrot2]

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot)

Carrot2 is an open source search results clustering engine, not to be confused with the AMQP client (carrot).

It is implemented in Java and can automatically organize small collections of documents (search results but not only) into thematic categories.

107 questions
0
votes
1 answer

Can I use my own labels for carrot2

I am modifying the source code of carrot for a project. Acc. to the LINGO algorithm, it first generates the most probable labels and builds clusters which best suit the labels right? So, can I input my own set of labels to the carrot to see how it…
sir_osthara
  • 154
  • 2
  • 9
0
votes
1 answer

Can I cluster document from a local file?

I have already used carrot2 for my clustering project. I integrate carrot2 with my php codes so I use DCS. My question is "can I cluster documents from a local file?" since there's an option 'From XML File' in parameter 'Document Source' in the…
0
votes
1 answer

Carrot2 dcs php example class modification

I currently have solr and carrot2 configured and working on my server. I am using the dcs example class for php provided in the DCS download from project.carrot2.org . For reference the class can be found here…
James
  • 101
  • 11
0
votes
1 answer

Carrot2 dcs webapp setup

I have been struggling with setting up Carrot2 for use PHP, on a local machine. The plan is to have Carrot2 retrieve cluster from Solr populated by Nutch. Currently Solr and Nutch are correctly configured and I have been able to access the…
James
  • 101
  • 11
0
votes
1 answer

Cluster Labels Storage in Carrot2

Where the cluster labels are stored in Carrot2? After storing the labels, the FoamTree and Circle results are generated, but where are these labels stored? How can I get them using some code etc.?
0
votes
1 answer

Carrot2 visualization in .NET project

Is there any way that I can use Carrot2's visualization in my .Net project? I cannot find anything other than a reference saying that the source code is not available. But can we use it? Please help if you know the answer Thanks in advance!
Lilz
  • 4,013
  • 13
  • 61
  • 95
0
votes
1 answer

cluster remote lucene indec with dcs

I try to use a lucene index on a remote server as an input for carrot2 installed on the same server. Regarding the documentation this should be possible with carrot2-dcs (documentation chapter 3.4 Carrot2 Document Clustering Server: Various document…
0
votes
1 answer

Building carrot2 source code in Java

I'm trying to compile Carrot2 code in Eclipse but I'm getting a lot of warnings: .apt_generated/ is missing from source.. etc Do you have any idea please?
Lilz
  • 4,013
  • 13
  • 61
  • 95
0
votes
1 answer

Guide for Carrot2 in PDF

Could you please say if there is a pdf (or doc) version of the documentation? If yes, where can I find it? If such does not exist, I guess it would be great to create such. PDF allows reading the file in Mendeley Desktop program, underlining parts…
0
votes
1 answer

VectorSpaceModel Carrot2

Is it possible to get the vector space model after you have clustered your documents? I see in the documentation, it is possible to create your own Vector Space Model with: public VectorSpaceModelContext(PreprocessingContext…
WYS
  • 1,637
  • 2
  • 16
  • 37
0
votes
1 answer

Carrot2 Score, Label, Phrases and similarity?

I don't quite understand the difference between a label and a phrase in Carrot2, and they don't seem to give a clear distinction i http://doc.carrot2.org/. I tried printing them both out, but they are apparently the same(using kmeansclustering). Can…
WYS
  • 1,637
  • 2
  • 16
  • 37
0
votes
1 answer

Cannot import Carrot2 to Eclipse

I want to use Carrot2 as a library to build a new software. But when I build a new project, there is no "create project from existing source". Can I know how to use Carrot2 as a library? Thanks!
0
votes
2 answers

Carrot2+ElasticSearch Basic Flow of Information

I am using Carrot2 and ElasticSearch. I has elastic search server running with a lot of data when I installed carrot2 plugin. Wanted to get answers to a few basic questions: Will clustering work only on newly indexed documents or even old…
Pratik Poddar
  • 1,353
  • 3
  • 18
  • 36
0
votes
0 answers

ParseError when trying to send XML string to Carrot2 server

When I send this request to my Carrot2 server: http = httplib2.Http() my_url = 'http://localhost:8080/dcs/rest?dcs.c2stream=xml' xml_string = etree.tostring(xml) http.request(my_url, 'POST', body=xml_string, headers={'Content-type':…
0
votes
2 answers

Carrot2-bisectingKmeans setting the attribute of resouceLookup causes the error of no resouces named stopwords.ar

Carrot2-I tried to cluster docs through the bisectingKmeans algorithm. I set the attribute of resouceLookup and tested the language of English/Chinese. It both resulted in the error "no resouces named stopwords.ar in resouce lookup locations...".…