Questions tagged [gate]

GATE is a Java framework for developing natural language processing applications.

GATE (a General Architecture for Text Engineering, http://gate.ac.uk) is a family of open-source tools for developing natural language processing applications. The GATE family includes a number of inter-related components, and this tag is appropriate for questions about any of them:

  • GATE Developer, a graphical development environment for NLP applications.
  • GATE Embedded, the Java library underlying GATE Developer, allowing applications developed in Developer to be embedded into other Java applications.
  • GATE Teamware, a web-based collaborative annotation and data curation platform.
  • GATE Mímir, an indexing system supporting annotation- and ontology-based search over large text collections that have been processed by a GATE Embedded application.
  • GATECloud.net, a service for running GATE Embedded applications over large corpora using cloud computing.

GATE Developer, Embedded and Mímir are licenced under the Lesser GPL, and Teamware under the Affero GPL.

For more information about the GATE family of tools, see

293 questions
1
vote
2 answers

Gate- add annotation to entire document

I am trying to do document classification with gate. For that I need to annotate the entire document with one type of annotation. Can anyone please tell me how to do that?
Chamath Sajeewa
  • 300
  • 1
  • 14
1
vote
1 answer

Why does Gate.init() call throws java.lang.NullPointerException?

I'm new to GATE, ANNIE & Info Extraction(IE). I'm trying to use ANNIE to do some IE. I installed GATE on Ubuntu 14.*, GATE Developer 8 worked perfectly. I'm trying to use GATE Embedded from my Java (JDK 1.8) project in Eclipse (Luna). I created a…
1
vote
0 answers

Developing GATE plugin in Java: org/apache/log4j/Logger error

I'm a Java beginner and have a question similar to the ones here: Java, How to add library files in netbeans? and Caused By: java.lang.NoClassDefFoundError: org/apache/log4j/Logger I followed the answers given there, which were very good, but I…
user3729787
  • 121
  • 2
  • 11
1
vote
3 answers

extract annotated text from text file by using java code

I have annotated text file on the following format: Buffer Overflow in client/mysql.cc in Oracle MySQL and…
1
vote
1 answer

GATE API and JAPE code, return empty result

I used GATE API with java code and tried to run one of the known JAPE rules on text of document but unfortunatly I could not get the appropriate results. My code as following: public void initAnnie() throws GateException, IOException { …
Tech
  • 77
  • 11
1
vote
1 answer

Annotating a document with JAPE

I have been searching for a solution to this for weeks, I have some documents(about 95) that I am trying to classify using GATE. I have put them in one corpus I called training_corpus, however, after ANNIE has annotated the corpus, I have to go back…
tigg
  • 107
  • 8
1
vote
1 answer

Apache Tika in Gate Embedded

So i need to load a pdf document for my gate embedded app. I try to parse the pdf to string with apache tika but the ANNIE tool of gate can't make find annotations in the string. I've heard about tikaformat, but can not find any examples where…
Respino
  • 11
  • 2
1
vote
1 answer

GATE Tagging annotations for Numbers,Currency, Money

Please help. I have some text " 99285/25 EMERGENCY SE 850.0 650.00 04/05/12 INTERPRET 813.44 $36.00 04/05/12 CARLOS " I need to tag 650.00 also as Money, Gate automatically picks up $36.00 as Money tag. How do I specify it? Imports: { import…
Sanjeev
  • 49
  • 9
1
vote
1 answer

Gate - extracting individual annotated text from annotation set

I'm trying to extract the individual text values of an annotation set which are generated by the default ANNIE processing resources. When i iterate through the annotation set each entry only gives the start and end position which the annotation…
cdugga
  • 3,849
  • 17
  • 81
  • 127
1
vote
2 answers

GATE annotation counting

I am trying to figure out a way of counting some annotations in GATE e.g. if I have some annotations occurring multiple times in a text document and if I want to count it, is there some sort of plugin that can help me? Thanks
user2910192
  • 41
  • 1
  • 3
1
vote
1 answer

Stanford Tagger with GATE

I am using GATE for Arabic under Linux and when using Stanford Tagger there aren't any POS tags. All parameters were not changed (as default) what can I do to perform that task? Does this problem depend on the parameters ? What installation…
1
vote
2 answers

How to custom tag word(s) in GATE JAPE grammar?

I have a set of documents and each document has different heading. Example if document heading says "Psychological Evaluation" I want to tag the document as "Medicalrule". I loaded the document and loaded ANNIE with defaults. In Processing…
Sanjeev
  • 49
  • 9
1
vote
0 answers

Read and Convert Data in XML file (to annotated with Gate Embedded) in another format

I call my Application from GUI Developer Gate in my programme Java,and i to annotated over my document into format XML.But now i could not extract my data in XML file .Can you help me please ? Corpus corpus = Factory.newCorpus("Processing CV"); …
Vampir
  • 31
  • 5
1
vote
1 answer

GATE Embedded sample Example NoClassFound Error

I am new to GATE Embedded , i tried simple example and getting NoClassDefFoundError. First i will explain what i tried Donwload and Extract Gate 7.0 in D:\project\gate-7.0 Netbean create new project --> added library gate.jar and lib folder Tried…
GaneshP
  • 746
  • 7
  • 25
1
vote
2 answers

GATE write annotation ID as a feature

I was wondering is someone can help me out here. I think this could be of use for anyone trying to conduct machine learning on GATE (General Architecture for Text Engineering). So basically to conduct machine learning I first need to add some code…