Questions tagged [ruta]

Apache UIMA Ruta is a scripting language for natural language processing.

Apache UIMA Ruta (formerly called TextMarker) consists of a rule-based script language and an Eclipse-based workbench.

279 questions
0
votes
1 answer

How to add DocumentBlockExtension in uima ruta

When I try to use DocumentBlock it shows not define in script block. How I need to add it in additional Engine? Can anyone explain in detail about the usage of DocumentBlock.
0
votes
1 answer

UIMA RUTA annotation at the beginning of sequence

I have sequence of annotations that are instances of the same type (e.g. sequence of CW annotations). I need to remove the first of them (more formally: remove annotation that has no annotations of the same type before in document). Less formally:…
0
votes
1 answer

WORDTABLE not working properly

I have a code like: WORDTABLE presidentsOfUSA = 'presidentsOfUSA.csv'; DECLARE Annotation PresidentOfUSA(STRING party, INT yearOfInauguration); Document{->MARKTABLE(PresidentOfUSA, 1, presidentsOfUSA, "party" = 2, "yearOfInauguration" =…
Prabhav
  • 447
  • 3
  • 17
0
votes
1 answer

UIMA RUTA Combination of Annotations

I am new to UIMA RUTA and after reading the UIMA RUTA Guide, I have the following question. I want to write a set of rules that will search for two annotations (FIRST, SECOND) inside my document with specific values (FIRST: "hello" and SECOND:…
Aribas
  • 57
  • 4
0
votes
1 answer

UIMA RUTA: Italics

Does anyone know how can I search for all words in a text that are italicized? And to extend that, search for specific words that are (or are not) italicized? For example, given "I am certain that I am not mistaken", I'd like to extract certain, or…
Suds
  • 13
  • 1
0
votes
1 answer

UIMA Ruta learning sources and related reading material

I finished reading UIMA Ruta official documentation which is very helpful. I was wondering whether there is more material on UIMA Ruta to continue my reading (and learning). Could you suggest any other documentation/book/paper/website that you…
nikolas
  • 164
  • 7
0
votes
1 answer

Uima Ruta Rule for Below Regex

I need to annotate below cases. I have block of texts & need to get sub blocks of texts which contains banks.For example below is a complete text block i need to annotate Bank of America as Bank Name. hereinafter described and hereinafter referred…
Gaurav
  • 139
  • 1
  • 16
0
votes
1 answer

How to use cutter concept -UIMA RUTA

How to use cutter concept in RUTA. I had go through the documentation but I'm not clear.Can anyone explain it briefly.
0
votes
1 answer

Finding nearest word from a character -Uima Ruta

I need to find the nearest word from a character and that word should startswith that character. For example: Sample Text: What you need is What you get WYG. From the sample text,I need to find the nearest word startswith W.
0
votes
1 answer

Problems with UIMA Ruta Plugin - Unhandled event loop exception

Error log I get Unhandled event loop exception every time after trying to run a new script in the UIMA Ruta Eclipse plugin, when I test a document that were processed before with another descriptor. The only solution to that, that worked, was to…
0
votes
1 answer

Tagging Markfast

I'm using Markfast as below Script: WORDLIST PARTICLESLIST = 'NameswithParticles.txt'; DECLARE PARTICLES; Document{ -> MARKFAST(PARTICLES, PARTICLESLIST,true)}; NameswithParticles.txt: der Der de De d' D' De la de La de…
prasanth
  • 261
  • 1
  • 10
0
votes
1 answer

UIMA RUTA TABLES

I'm trying to rename LI and TABLE which is coming from HTML Coversion Like Document{-> RETAINTYPE(MARKUP)}; LI{->MARK(List)}; Document{-> RETAINTYPE}; Its Fine. But When Im Using Same Script for Table Like DECLARE TableContent; …
prasanth
  • 261
  • 1
  • 10
0
votes
1 answer
0
votes
0 answers

Receiving divide by Zero error in ruta

org.apache.uima.ruta.engine.HtmlConverter mapAnnotations(454) WARNING: illegal annotation offset mapping org.apache.uima.ruta.engine.HtmlConverter mapAnnotations(454) WARNING: illegal annotation offset…
0
votes
0 answers

simple date validator in RUTA

DECLARE NUM date; NUM {REGEXP("[0-9]{4}[0-1][0-9][0-3][0-9]") -> CREATE(date)}; INT number = date; how to do last line of code??? All documentation is…
Roma
  • 41
  • 1
  • 3
1 2 3
18
19