Questions tagged [ruta]

Apache UIMA Ruta is a scripting language for natural language processing.

Apache UIMA Ruta (formerly called TextMarker) consists of a rule-based script language and an Eclipse-based workbench.

279 questions
3
votes
1 answer

How/are you supposed to use the DKPro libraries with UIMA Ruta?

I have studied the default UIMA Ruta Workbench Eclipse project enough to significantly understand its moving parts - for instance, why the input/ and output/ folders behave as they do, how to accomplish the project using the jcasgen and other Maven…
tacos_tacos_tacos
  • 10,277
  • 11
  • 73
  • 126
3
votes
1 answer

Setting feature value to the count of containing annotation in UIMA Ruta

I've got a RUTA script where all the sentences have been annotated with a Sentence annotation and various words and phrases have been annotated with their own specific annotations. That all works as expected. Each one of those annotations has a…
Nick Collier
  • 1,786
  • 9
  • 10
3
votes
2 answers

UIMA Ruta : script for the combination of chars and numbers

I've just started with Ruta and I would like to write a rule that finds any combination of chars, numbers and dot (.) . (JAVA Regex for it - ([a-z0-9.]+) ) for e.g. - abcd.03ef0.3abc 03a.bcd.03eeff903a.bc
user3778893
  • 125
  • 5
3
votes
1 answer

How should I use UIMA Ruta to match the all words between line break?

Thank for any strong hands! I have some text like the following aaaaa aaaa aaaaa aaaaaa bbbbb bbbbb bbbb bbbbbb cccccc ccccc ccccc cccccc I want to use Ruta to create annotation that matches all strings between line break. I want my annotation to…
Cheung Brian
  • 715
  • 4
  • 11
  • 29
3
votes
1 answer

UIMA RUTA - how to do find & replace using regular expression and groups

RUTA newbie here. I'm processing a document using RUTA and have a lot of normalization to do before I can start annotating. I'm trying to find the best way to do a Find and Replace of sequence of characters using regular expressions and groups on…
3
votes
1 answer

NoSuchMethodError when running UIMA Ruta script from UIMAFIT SimplePipeline

I am trying to run an existing UIMA Ruta analysis engine from a UIMAFIT simple pipeline using the following code: File specFile = new File("MyEngine.xml"); XMLInputSource in = new XMLInputSource(specFile); ResourceSpecifier specifier =…
3
votes
1 answer

cleartk dependency not found when calling StanfordCoreNLPAnnotator from UIMA RUTA

I am trying to call ClearTK's StanfordCoreNLPAnnotator from within UIMA RUTA, but cannot get it to work. I am using eclipse with a maven-enabled RUTA project in which I also have Java code for auxiliary tasks. I have imported…
2
votes
1 answer

Mark first sentence in a document

I already have marked Sentences in a document (type: Sentence). How can I mark the first one as the document title? I already tried Sentence{->MARKFIRST(DocTitle)};, but this didn't work. All the best Philipp
2
votes
1 answer

UIMA Ruta version 3.1.0

I've upgraded the Uima Ruta version to 3.1.0 in Ruta Workbench. I tried with normal Ruta project its working fine but while convert into maven project i'm getting below error. Exception in thread "main"…
prasanth
  • 261
  • 1
  • 10
2
votes
1 answer

UIMA Ruta: JCas cannot be initialized/Script cannot be found

I try to integrate a UIMA Ruta script (Main.ruta) in an uimaFIT-pipeline (inMain.java) but the script cannot be found. This is my project…
TheFrisbee
  • 77
  • 8
2
votes
1 answer

How to extract ID and Date from two substrings using Apache UIMA Ruta?

how can I extract the 2 ID and Date numbers in the following text using Ruta: ID:1341234 Date:20191021 I tried the following: RETAINTYPE(WS); "ID:" n:NUM{-> CREATE(Entity, "label" = "ID", "value"=n.ct)}; "Date:" n:NUM{-> CREATE(Entity, "label" =…
2
votes
0 answers

Why does the annotation test environment of UIMA-Ruta count some annotations as True Positive and False Positiv

I would like to evaluate my ruta script using the annotation test environment provided by the uima ruta plugin for eclipse. I've built a manually-annotated corpus. Some annotations are evaluated as true postives as well as false positives, which…
MichaDe
  • 41
  • 3
2
votes
0 answers

Issue while applying COUNT condition in UIMA RUTA

I used COUNT Condition to find the number of punctuations in an annotation.But I didn't received the expected output. DECLARE Sentence(INT pmcount); "Conflicts of interest"->Sentence; DECLARE…
2
votes
1 answer

UIMA Ruta: Check if feature is empty/undefined

Within a UIMA Ruta script, I would like to check whether a feature of an annotation has been set/defined/is not null (whatever you call it). The feature is itself not of primitive type, but an "Annotation". Is there a way to do that so that this…
Andreas
  • 23
  • 4
2
votes
1 answer

How to set the parameter dictRemoveWS to true in Apache Ruta

dictRemoveWS is related to Wordlist. I want to remove whitespace issue in my custom ruta script taking wordlist from txt file. The documentation says to use CONFIGURE method with following example: ENGINE…
NpnSaddy
  • 317
  • 3
  • 11
1
2
3
18 19