Questions tagged [tess4j]

Tess4J is a Java JNA wrapper for Tesseract OCR API.

Description

A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.

Releases Versions.

  • Version 1.3 (released : May 31, 2014)
  • Version 2.0 Beta (released : June 1, 2014)
  • Version 3.4.3 (released: 14 January 2018)

Features:

The library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

Related tags

Links

Tess4J homepage
Tess4J Github

222 questions
1
vote
1 answer

tesseract is giving highly inconsistent result

I want get result of match which is in format of image . Below is the code I'm using to read text from image .I have used python code it also gives same result. How can i improve the output or is there any other better way for my problem . …
gaurav singh
  • 1,376
  • 1
  • 15
  • 25
1
vote
2 answers

Tesseract / Tess4j memory leak

We are trying to use Tesseract with Tess4j for OCR text extraction. On continuous use of tesseract over a period, we notice the RAM used by the application getting increased gradually, During this time, The heap memory is still free. We monitored…
aravinth
  • 416
  • 1
  • 5
  • 20
1
vote
0 answers

Java OCR Not reading clearly obvious numbers

picture of processed image + original I'm working on a project where I'm taking Smash Bros. game output, taking a screenshot, processing it in order to detect the percents which the characters are sitting at. The program I wrote is detecting the 57…
Frolic
  • 11
  • 2
1
vote
2 answers

Exception in thread "main" java.lang.NoClassDefFoundError: com/sun/jna/Pointer

I am trying to use tess4j into my java program. Here is my code: `import java.io.File; import net.sourceforge.tess4j.Tesseract; import net.sourceforge.tess4j.TesseractException; class Test { public static void main(String[] args) { …
Shobhit Tewari
  • 535
  • 5
  • 20
1
vote
1 answer

Why is tessdata throwing Exceptions?

All is happening on MAC in SpringBoot WAR file deployed in Tomcat: I have tesseract 4.1.0 installed (via brew on Mac). For Tessdata - OCR on Mac net.sourceforge.tess4j
Morkus
  • 517
  • 7
  • 21
1
vote
1 answer

Tess4J, not finding tessdata

So I am working on this project and I need to use tess4J inorder to read some text on some images. Unfortunatly i can not seem to resolve the following error: Error opening data file C:\Users\****\eclipse workspace\****\tessdata\eng.traineddata…
1
vote
0 answers

OCR not recognising text against a black background in Tess4j

I am not able to extract text against a black background as shown in the attached image. Everything else is getting recognised. Can somebody please help? public static void main(String args[]) { scrShotDirPath = new…
Prakash P
  • 419
  • 1
  • 8
  • 18
1
vote
1 answer

How to whitelist characters in tess4j version 4.1.*

The objective is to read number specific data (1,2,...,9,0) from an image. For this, I'm using Tess4j version 4.1.1.
pritamprasad
  • 159
  • 1
  • 13
1
vote
1 answer

How can I get simple Text from image?

I have a image captcha I already use Tess4j, but it cannot return perfect result. I want to get text from simple image captcha same a picture. Thanks for your help.
kien vu
  • 75
  • 2
  • 9
1
vote
3 answers

How to extract text from PDF image

I wanted to extract data from PDF which has image, and the image is form where letter will be inside small boxes for example, name : t e s t, here each and every word will be inside square box. I have tried tesseract OCR could not get the desired…
1
vote
1 answer

How to set language in Tess4J using language from mobile device?

I am using a web service to read image files and return me some text on it using Tesseract. As we know Tesseract/Tess4j supports languages input in the ISO 639-3 format (ie: eng, spa, deu, ara, etc...) but the language I get from the mobile device…
Francisco Souza
  • 806
  • 15
  • 38
1
vote
0 answers

Android application and java application produces different results for same image

I am developing OCR application for two different platforms. Android Windows I am using android studio for android application development using Tess-two library version 5.4.1. Following is my code that I am using for text recognition from…
1
vote
0 answers

How do I make Tess4J accurately read TIFF image after threshold has been applied?

I am very new to Tess4J so my knowledge is somewhat limited. I am currently using Tess4j to do OCR on an image that has already had a threshold applied to it using OpenCV. I'm using OTSU to threshold the image, and cropping out most noise, but…
1
vote
0 answers

Tess4j, Java : Tesseract causing JVM crash

I am working on integrating OCR functionality in our application. Currently whenever I run doOCr method, JVM crashes and entire application is stopped. I checked out and found this, but the answer is very unclear as it just tells to create bean, but…
We are Borg
  • 5,117
  • 17
  • 102
  • 225
1
vote
0 answers

net.sourceforge.tess4j is throwing wrong results when reading data from image

I am trying to work with OCR (Optical Character Reorganization). I have a sample image and i want to read data out of it. Below is my sample image file. I have used tess4j API to read the text from image. Please find the below piece of code. public…
Manindar
  • 999
  • 2
  • 14
  • 30