Questions tagged [tess4j]

Tess4J is a Java JNA wrapper for Tesseract OCR API.

Description

A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.

Releases Versions.

  • Version 1.3 (released : May 31, 2014)
  • Version 2.0 Beta (released : June 1, 2014)
  • Version 3.4.3 (released: 14 January 2018)

Features:

The library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

Related tags

Links

Tess4J homepage
Tess4J Github

222 questions
0
votes
0 answers

Tess4J 4.0.0 java.lang.UnsatisfiedLinkError: The specified module could not be found

Downloading and using tessData-best for tesseract-ocr github repository improved my accuracy for english on Tess4j 3.8.4 significantly. But to use that trainedData for other languages i had to upgrade to Tess4J 4.0.0. But When I did that its giving…
0
votes
0 answers

Couldn't Seek Exception and Unable to load libtesseract3051.dll in Tess4j

I need to do Character Recognition from Binary images.In order to perform that I'm using Tess4j library.While running the program I'm getting "Couldn't Seek" Exception and if I modify my code,it shows Unable to load libtesseract3051.dll.Can anyone…
Jeyanth
  • 531
  • 2
  • 6
  • 19
0
votes
1 answer

How to ignore special characters in Tesseract OCR using java

I have extracted text from image through Tesseract OCR using java. But the output is consisting of some special characters because image contains some symbols. I want to ignore all the special characters and display just text. Is there any way that…
jasmine shini
  • 11
  • 1
  • 2
0
votes
0 answers

Tessaract(tess4j) - Performing OCR over byte array

I am working on a application that reads attachments from the e-mail(PDF's in general) and performs OCR operation on the PDF files. The problem is that I need to save the files to the HDD first and I think that this is not necessary. Is it possible…
Rodrigo
  • 37
  • 5
0
votes
0 answers

tess4j version 4.0.0-SNAPSHOT. Invalid memory access, when I set oem to 2 or 3.

The TESSDATA_PREFIX is set to the parent folder of the tessdata folder of the commandline tesseract 4.0.0 (C:\Program Files (x86)\Tesseract-OCR). The commandline tesseract produces reasonable output in all four OCR Engine Modes. Here is my…
Grada Gukovic
  • 1,228
  • 7
  • 13
0
votes
1 answer

How to handle exception Unsupported image format from tess4j?

I'm trying to use tess4j to get a text from image, it's error: "java.lang.RuntimeException: Unsupported image format. May need to install JAI Image I/O package." Although, I added JAI-imageio-core to dependencies. File imageFile = new…
0
votes
0 answers

Jna loading issue

Hi iam trying to Tess4j in one of my projects. Every time i run,iam getting a error message Exception in thread "main" java.lang.UnsatisfiedLinkError: The specified module could not be found. at com.sun.jna.Native.open(Native Method) at…
Binu
  • 49
  • 5
0
votes
1 answer

HTTP Status 500 - Handler processing failed; nested exception is java.lang.UnsatisfiedLinkError: The specified module could not be found

I am trying to implement OCR using tess4j.When I am running the application in eclipse its working fine.But when I deployed the WAR file to tomcat or Jboss I am getting below error HTTP Status 500 - Handler processing failed; nested exception is …
Shailesh
  • 38
  • 7
0
votes
0 answers

Plate recognition using Java and tesseract, tess4j

I have a problem during compilation of the program. The code itself has no errors. Program code: package plate; import java.io.File; import net.sourceforge.tess4j.Tesseract; public class OcrReader { public static void main(String[] args)…
Celtas
  • 1
0
votes
1 answer

Tesseract invalid memory access when i am using tess4j1.3.0

When using OCR tess4j using rest controller getting exception "java.lang.Error: Invalid memory access". Have also set the data path but as soon as i run the program it throws the above mentioned exception. And i have added the -Djava.jna.path with…
0
votes
1 answer

Could not make tesseract works on Centos7

I'm working on an app using Java Sprint with Maven in Eclipse. I make tesseract works on my dev machine in Windows (yeah, I know...) but when I tried to deploy the app in the server which is a Centos 7, I get some errors and the app stop…
Faabass
  • 1,394
  • 8
  • 29
  • 58
0
votes
1 answer

Tesseract not using path variable

Why does my Tesseract instance require me to explicitly set my datapath, but doesn't want to read the environment variable? Let me clarify: running the code ITesseract tesseract = new Tesseract(); String result = tesseract.doOCR(myImage); Throws an…
Ognjen Mišić
  • 1,219
  • 17
  • 37
0
votes
1 answer

java.lang.UnsatisfiedLinkError: Le module spécifié est introuvable.on Tess4J

I'm developing a web app using Spring MVC with Maven and with some functionality using tess4j for the OCR. My dev environment: Eclipse Neon.3 os: win server 2008R2 jvm 64 bit apache tomecat 9 I downloaded the latest version of Tess4J from…
kajede
  • 1
  • 1
0
votes
1 answer

Tess4j issue in Centos

I am using tess4j in AWS AMI with Centos. When creating the api using TessBaseAPICreate() i also got the error as below java.lang.NoSuchMethodError:…
Shawn Chen
  • 73
  • 8
0
votes
1 answer

Tesseract is giving junk data as an output for Japaneses language

I'm trying to build a sample application in java for Japaneses language that will read an image file and just output the text extracted from the image. I found one sample application on net which is running perfect for English Language but not for…
Aditya
  • 11
  • 3