Questions tagged [cleartk]

From website "ClearTK provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA. It is developed by the Center for Computational Language and Education Research (CLEAR) at the University of Colorado at Boulder. Please see the conceptual overview for a broad introduction to ClearTK."

From website:

Features

  • A common interface and wrappers for popular machine learning libraries such as SVMlight, LIBSVM, LIBLINEAR, OpenNLP MaxEnt, and Mallet.
  • A rich feature extraction library that can be used with any of the machine learning classifiers. Under the covers, ClearTK understands each of the native machine learning libraries and translates your features into a format appropriate to whatever model you're using.
  • Infrastructure for creating NLP components for specific tasks such as part-of-speech tagging, BIO-style chunking, named entity recognition, semantic role labeling, temporal relation tagging, etc.
  • Wrappers for common NLP tools such as the Snowball stemmer, the OpenNLP tools, the MaltParser dependency parser, and the Stanford CoreNLP tools.
  • Corpus readers for collections like the Penn Treebank, ACE 2005, CoNLL 2003, Genia, TimeBank and TempEval.
10 questions
3
votes
1 answer

cleartk dependency not found when calling StanfordCoreNLPAnnotator from UIMA RUTA

I am trying to call ClearTK's StanfordCoreNLPAnnotator from within UIMA RUTA, but cannot get it to work. I am using eclipse with a maven-enabled RUTA project in which I also have Java code for auxiliary tasks. I have imported…
3
votes
1 answer

Parse raw text with MaltParser in Java

I found that NLKT in python does it via *raw_parse* function but I need to use Java. I found cleartk has a MaltParser wrapper but there is no documentation about it. I'm looking for a function or a project that first converts raw English text to…
Halil
  • 2,076
  • 1
  • 22
  • 30
1
vote
1 answer

How to pipeline cleartk to ruta?

ErroR MesSagE I am using cleartk and Ruta for my project.And I want to connect ClearTk with Ruta using SimplePipeline.How can I pass ClearTk output values into Ruta Script? Cleartk Github link is Github Example ClearTk And i am trying this…
Mari Selvan
  • 3,598
  • 3
  • 21
  • 36
0
votes
2 answers

String IN, String OUT?

I'm new to ClearTK and UIMA. So far I couldn't find any examples on how to create a pipeline where no files are involved. I'm trying to process a small text stored in a Java String variable using cleartk and UIMA, and get an XML String back (outcome…
izilotti
  • 4,757
  • 1
  • 48
  • 55
0
votes
1 answer

Cleartk - Mallet Classifier trains on 0 instances despite data being there

I am using a Cleartk (V. 2.0) simple pipeline to develop a binary classifier for individual sentences in a CAS. However, even though training data gets generated, the classifier does not pick it up during training, see below. I am working off of…
0
votes
1 answer

Why can't my Java program find svm_learn or svm_classify?

I am using ClearTK along with SVM-Light programmatically within Java. I have downloaded the package both for ClearTK integration as well as for SVM-Light itself. Whether I run via Eclipse or the command line, I keep getting Cannot find file…
demongolem
  • 9,474
  • 36
  • 90
  • 105
0
votes
1 answer

Do we need to create process() inside a new annotator?

Im creating an annotator called "NewAnnotator" and try to make it works in a pipeline with others annotators in ClearTK like: SentenceAnnotator, PosTaggerAnnotator, etc. So I want to be able to run…
user1314404
  • 1,253
  • 3
  • 22
  • 51
0
votes
1 answer

How create similar code in a project

I have a java project that uses cleartk. Now I need to make an extra code to it and test. Under their project/src/main/java/org.cleartk.syntax.opennlp/ already have some java file. I need to duplicate one of them, rename and change the code inside,…
user1314404
  • 1,253
  • 3
  • 22
  • 51
0
votes
1 answer

Cleartk: Errors initializing [class org.cleartk.classifier.jar.DefaultSequenceDataWriterFactory] Field 'dataWriterClassName' is required

My main program looks something like: public static void main(String args[]) throws UIMAException, IOException{ //TypeSystemDescription tsd = TypeSystemDescriptionFactory.createTypeSystemDescription(Question.class); AggregateBuilder…
VJune
  • 1,195
  • 5
  • 16
  • 26
0
votes
0 answers

cleartk error in instance extraction

I am trying to run introductory cleartk example for classifying documents with a few modifications and a different dataset. I get the following error: Exception in thread "main" java.lang.NoSuchMethodError:…
VJune
  • 1,195
  • 5
  • 16
  • 26