Questions tagged [tess4j]

Tess4J is a Java JNA wrapper for Tesseract OCR API.

Description

A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.

Releases Versions.

  • Version 1.3 (released : May 31, 2014)
  • Version 2.0 Beta (released : June 1, 2014)
  • Version 3.4.3 (released: 14 January 2018)

Features:

The library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

Related tags

Links

Tess4J homepage
Tess4J Github

222 questions
2
votes
0 answers

Tesseract, Tess4J - improve OCR output on low DPI images

I use Tesseract and JNA wrapper Tess4J for my Java application. I try to OCR jpeg images with 120 DPI. The output text is a pretty low quality and a lot of important words are not recognized properly. I think the main issue is in my input images…
alexanoid
  • 24,051
  • 54
  • 210
  • 410
2
votes
1 answer

Suppress Warning on Console when using Tess4j for OCRing

Help in Suppress Warning- "Warning. Invalid resolution 1 dpi. Using 70 instead." when using Tess4j for OCRing Hi All, I would like to suppress the warning thrown out in Console when using Tess4j for OCRing. Please help. Tesseract uses Leptonica…
user578219
  • 597
  • 2
  • 9
  • 32
2
votes
0 answers

How to deal the multipage pdf file with tess4j

I am using tess4j to recognize the image file. Pix pix = Leptonica1.pixRead(image.getPath()); TessAPI1.TessBaseAPIInit3(tessBaseAPI, tessDataPath, "eng"); TessAPI1.TessBaseAPISetImage2(tessBaseAPI, pix); // …
Shawn Chen
  • 73
  • 8
2
votes
1 answer

Issue met when using Tess4j

I am trying to create the TessBaseAPI using TessBaseAPICreate() and met the error as below: java.lang.UnsatisfiedLinkError: Error looking up function 'TessPDFRendererCreateTextonly': dlsym(0x7fc93f349590, TessPDFRendererCreateTextonly): symbol…
Shawn Chen
  • 73
  • 8
2
votes
1 answer

Java8, Tess4j : Optimize image for OCR with tesseract

I am working on Tesseract and I have OCR functionality working already. I wanted to optimize the image so that OCR results will be better. Currently I am only making the image monochrome and scaling it to double its size. Even after that I am having…
We are Borg
  • 5,117
  • 17
  • 102
  • 225
2
votes
0 answers

tess4j The specified module could not be found

please need your help.. I testing Tess4j but i have a problem Development environment : eclipse, jdk 1.7 Code System.setProperty("jna.library.path", "32".equals(System.getProperty("sun.arch.data.model")) ? "/lib/win32-x86" : "/lib/win32-x86-64"); …
ICARUS
  • 21
  • 3
2
votes
1 answer

Why is DPI relevant for images taken by a camera for OCR

I am currently working on a project that involves using the Tess4j Tesseract OCR engine. While working on this project I come along a lot of websites that state that Tesseract works best on images of at least 300 DPI (Dots per Inch). My question is…
danielrieske
  • 127
  • 1
  • 11
2
votes
1 answer

Tesseract in Apache server : The process cannot access the file because it is being used by another process

I'm using Tesseract in my JSF web application. when I was getting familiar with the Tesseract API as a simple java project , It worked without problems, then once I integrated it in my web project which uses apache tomcat 7 server, I get the…
Sam
  • 605
  • 9
  • 19
2
votes
2 answers

Missing slf4j in tess4j

I want to test tess4j to do OCR. I used the following tutorial for a first example: http://tess4j.sourceforge.net/codesample.html I included the required libraries, but I am getting the following error: Exception in thread "main"…
pexmar
  • 331
  • 2
  • 12
2
votes
0 answers

Surprisingly large difference between tess4j and tess-two

tess-two seems to work pretty well when I know EXACTLY the location on screen where the text I want to OCR is. Now I'm trying to scan for text against a busy background, and it's not working quite as well. annotated android screenshot I built a…
steve
  • 21
  • 1
2
votes
2 answers

Tess4j Issue in Windows: java.lang.UnsatisfiedLinkError: The specified module could not be found in instance.doOCR(imageFile)

After 2 days of googling I am still unable to find the solution of my issue with Tess4j version 3.0: java.lang.UnsatisfiedLinkError: The specified module could not be found. I write server side Spring boot app on my Windows 10 x64. I used this…
Yuliia Ashomok
  • 8,336
  • 2
  • 60
  • 69
2
votes
2 answers

How do I use Tess4j in my programs?

I am undertaking a project using OCR in Java and have come across Tess4j. I am using netbeans on a mac and there doesn't seem to be any beginner information available. Do I actually need to have Tesseract installed to use Tess4j? If so, when I…
volican
  • 137
  • 1
  • 14
2
votes
1 answer

Tess4J InvocationTargetException

I have a problem running the below java code outside of the eclipse compiler. Inside the compiler it works fine, but after exporting as a runnable jar I get an InvocationTargetException. I believe it is relevant to the dll files but after reading…
BlueCow
  • 21
  • 4
2
votes
0 answers

Issue reading Bold fonts with Tesseract API / Tess4j

I am having problem in reading BOLD font characters from image with Tesseract API http://tess4j.sourceforge.net/. Is Tesseract having issues with BOLD fonts. How to read BOLD fonts with Tesseract / tess4j? Do I need to add some libs with Tesseract…
Wisher Well
  • 133
  • 1
  • 1
  • 8
2
votes
2 answers

Tess4J on Ubuntu Linux, UnsatisfiedLinkError

I am getting a getting an UnsatisfiedLinkError using Tess4j in Tomcat+Ubuntu(Linux) java.lang.UnsatisfiedLinkError: Error looking up function 'TessBaseAPICreate': /usr/lib/libtesseract.so.3.0.2: undefined symbol: TessBaseAPICreate at…
1 2
3
14 15