Questions tagged [ruta]

Apache UIMA Ruta is a scripting language for natural language processing.

Apache UIMA Ruta (formerly called TextMarker) consists of a rule-based script language and an Eclipse-based workbench.

279 questions
2
votes
1 answer

UIMA RUTA - How to combine annotations?

I built some custom annotations on sample phrases in order to recognize keywords. The problem that I have right now is that I can't combine these annotations to build upper-level patterns of annotations. eg. the phrases are: 1."I want to buy…
Tom W
  • 21
  • 2
2
votes
1 answer

Debugging Apache UIMA Ruta scripts

I work on a project that will use Apache UIMA Ruta to process text fragments/documents. Our project will include a lot of Ruta scripts, maybe some larger complex ones by using imports. I there a way to debug the scripts? In ideal situation we can…
Jasper Huzen
  • 1,513
  • 12
  • 26
2
votes
2 answers

Match all occurrences from the same dictionary in one line in UIMA RUTA

I have type Dog, which has features size, color and other Size wordlist: big, medium, small Other wordlist: old, fat, happy, lazy Color wordlist does not affect anything in this case, so I won't list it. And my code (just for the main file): PACKAGE…
eawer
  • 1,398
  • 3
  • 13
  • 25
2
votes
1 answer

UIMA Ruta: set feature with MARKONCE action

MARKONCE: I have a sequence of sentences ending with question sign, and I want to annotate the entire span at once. So, I use MARKONCE to accomplish this task. Now, I also need to set the feature for this newly created annotation. How to set the…
Viorel Morari
  • 537
  • 3
  • 10
2
votes
1 answer

UIMA RUTA wordlist matching issue

I am trying to match some multi-word tokens using UIMA RUTA 2.6.0. And there are some phrases that are partially equal to each other, e. g. in the same file I has following entries: "includes the", "include the", "in this", "in the". There is next…
2
votes
1 answer

UIMA Ruta run errors using DKProCore Part-of-Speech Tagger (German Novels tutorial)

When following this tutorial on UIMA Ruta for German Novels step by step, I get the following error upon running Main.ruta: *SEVERE: Exception occurred org.apache.uima.analysis_engine.AnalysisEngineProcessException (...) Caused by:…
2
votes
1 answer

Ignore Self Part of Self

I'm trying to detect person names based on dictionaries : Dictionary Start David Alberto David Alberto Dictionary End Input Start David Alberto Input End I want to anotate David Alberto as Person Name not David or Alberto. I'm using below…
2
votes
2 answers

Detect full word match only from dictionary

I'm trying to detect city names from dictionaries But if there is any word without spaces it's also get detected. For example: Dictionary Start New York .. Dictionary End Input Start New York NewYork Input End What i want to anotate New York as City…
Gaurav
  • 139
  • 1
  • 16
2
votes
1 answer

Annotate Data in between Markup

I'm trying to write a rule to detect Data in between Markup tags. Input data format is fixed for example <1> Lorem ipsum dolor sit amet, consectetuer adipiscing elit. Aenean commodo ligula eget dolor. Aenean massa. Cum sociis natoque penatibus et…
Gaurav
  • 139
  • 1
  • 16
2
votes
1 answer

UIMA Ruta: Creating new annotations by combining existing annotation's features in plain Java

I'm trying to convert the following logic into a UIMA Ruta Rule: Sentence {->NewAnnotation} IF Sentence.part1 contains Constituent.label="VB" AND Sentence.part2 contains Constituent.label="VBZ" In other words, I need to create a new annotation out…
Viorel Morari
  • 537
  • 3
  • 10
2
votes
1 answer

Memory problems, while using UIMA Ruta

The problem: I'm using UIMA Ruta (v2.3.1) in one of my projects, but now I'm facing a problem: The memory exceeds explainable sizes, but it can't be figured out, where this problem is located, except for the class…
D. Bauer
  • 21
  • 2
2
votes
1 answer

How to access Array in UIMA-RUTA

I have an Annotation class with String Array as one of the field. I want to add and remove string elements to that String Array from ruta script. I searched for FSArray but didn't got anything. Please help me with solving above problem.
2
votes
1 answer

Ruta in UIMA Environment. Working with predifined collections/sets and lexicons in plain Java

I'm a beginner with Ruta and the idea I'm trying to grasp now is how to handle, within UIMA environment(in plain Java), the class variables/collections. I've tried following the examples given in the documentation; but the Ruta rules are applied…
Viorel Morari
  • 537
  • 3
  • 10
2
votes
1 answer

UIMA Ruta Only annotate longest match

I'm using RUTA and wrote a lot of different rules for extracting the same entity. For example, I will extract the "toilet paper factory". At the moment my rules will result in toilet paper factory, paper factory and factory. But I'm only interested…
PascalTurbo
  • 2,189
  • 3
  • 24
  • 41
2
votes
1 answer

UIMA Ruta how to annotate a part of input string

I have an input string (ex. 'abcdefgh'), all i need is to make an annotation with a part of this string (ex. 'cd'), in other words i need to substring an input string and mark an annotation with it. How can i do this?
1 2
3
18 19