Questions tagged [gate]

GATE is a Java framework for developing natural language processing applications.

GATE (a General Architecture for Text Engineering, http://gate.ac.uk) is a family of open-source tools for developing natural language processing applications. The GATE family includes a number of inter-related components, and this tag is appropriate for questions about any of them:

  • GATE Developer, a graphical development environment for NLP applications.
  • GATE Embedded, the Java library underlying GATE Developer, allowing applications developed in Developer to be embedded into other Java applications.
  • GATE Teamware, a web-based collaborative annotation and data curation platform.
  • GATE Mímir, an indexing system supporting annotation- and ontology-based search over large text collections that have been processed by a GATE Embedded application.
  • GATECloud.net, a service for running GATE Embedded applications over large corpora using cloud computing.

GATE Developer, Embedded and Mímir are licenced under the Lesser GPL, and Teamware under the Affero GPL.

For more information about the GATE family of tools, see

293 questions
1
vote
0 answers

How to integrate inline-annotated tags into standoff annotation?

I am doing some custom annotation on the GATE platform. Let’s say I have a sentence to tag below. Raw text the McDonald's is '麦当劳' in Chinese. so "I want some McDonald's" is "我想吃麦当劳" in Chinese. Using GATE, I tagged 4 specific tokens (McDonald’s…
user2870222
  • 269
  • 1
  • 3
  • 13
1
vote
1 answer

Parsing either font style or block of paragraph in GATE

I have a word document. I need to match particular table section or heading section of it using GATE. I thought if there were any steps from where we can first check any font size or font style of the heading and then match rest of the content till…
Sumit Ramteke
  • 1,487
  • 1
  • 16
  • 39
1
vote
2 answers

How do we deal with a large GATE Document

I'm getting Error java.lang.OutOfMemoryError: GC overhead limit exceeded when I try to execute Pipeline if the GATE Document I use is slightly large. The code works fine if the GATE Document is small. My JAVA code is something like this: TestGate…
Identity1
  • 1,139
  • 16
  • 33
1
vote
0 answers

Gate : Reinitialising (ANNIE) Gazetteers at runtime

Good Afternoon I have an application that I am building that takes a users input and converts it in to a gazetteer. I'd really like to be able to reinitialise a gazetteer when I run my app over a single documentas the gazetteer may have changed. I…
1
vote
1 answer

ANNIE plugin to recover original text/position after normalization

We are using ANNIE plugin "Document Normalizer" to remove stopwords and other punctuation symbols, later we call Gazetteer to process the normilized text and in the last step, we need some plugin to recover original text/position for each…
Valijon
  • 12,667
  • 4
  • 34
  • 67
1
vote
1 answer

Cannot load Creole.xml GATE

I am trying to embed GATE to my java project. I've read the GATE guideline on how to build a GATE standalone application. However, I got many errors saying that the creole.xml of each plugins (ANNIE,Twitter, etc) can't be loaded. Here is my…
bohr
  • 631
  • 2
  • 9
  • 29
1
vote
2 answers

How to remove a Lookup from DefaultGazetteer programatically

I need to teach Gazetteer by adding/removing words. I know how to add new Lookup but when I've tried to remove it, the Lookup was not removed. gazetter.remove("string to be found"); // returns false Any help, please!
Valijon
  • 12,667
  • 4
  • 34
  • 67
1
vote
1 answer

Gate Developer and Gate Embedded difference in output

I have developed an application using Gate Developer, which apply paum algorithm and display the results in a new annotation set called "output" having an annotation called "comment". Then, I imported this application on Gate Embedded. However, the…
celineu
  • 576
  • 1
  • 5
  • 18
1
vote
1 answer

GATE - Treetagger: Token.category still in English

Due to some reasons, I needed to install Gate on another computer. So I needed to transfer the Gate work I had done until now on the other computer. Everything seem to work fine. I downloaded Treetagger for french and it runs without errors popping…
celineu
  • 576
  • 1
  • 5
  • 18
1
vote
0 answers

Nature of Javadocs, and importing them to NetBeans

I have downloaded GATE, a set of tools to help with natural language processing, with the intent of creating an named entity extractor in Java. I have successfully imported the jar files into my Netbeans project. The Javadocs are not being…
1
vote
1 answer

Compare between different NER systems in GATE

I am new to GATE. I was trying to analyse the performance of different tools on a wide range of corpus. The problem is the diff tool or corpus QA tool require the annotation sets to be identical -even case sensitive. Indeed, each system has its own…
1
vote
1 answer

How to build Jape rules in gate

I need to build a rule where Lhs check if the first character of word beggin in b then check the whole word without the first character that found in lookup
1
vote
1 answer

Stanford_CoreNLP plugin for gate

I'm searching a Stanford_CoreNLP plugin with Stanford NER(not StanfordParser or StandfordPOSTagger) for GATE (General Architecture for Text Engineering). I found some information about the plugin here. But I couldn't find it integrated with GATE…
Sampath Liyanage
  • 4,776
  • 2
  • 28
  • 40
1
vote
2 answers

Artificial Intelligence : How to route sentence to action

I'm a beginner in the field of artificial intelligence... I can use GATE or any other Natural Language Processing but I don't have an answer for this : Do you know how to evaluate how 2 sentences can be close? even with a large data set? Do you have…
merlin
  • 122
  • 2
  • 6
1
vote
1 answer

remove stopwords with groovy

I want to remove stopwords from a given text with GATE. Therefore I use a Tokenizer and a Gazetteer: The Gazetteer returns me the stopwords which I want to delete. I think there is no GATE plugin for deleting words, isn't it? So I want to do it with…
Munchkin
  • 4,528
  • 7
  • 45
  • 93