Questions tagged [tess4j]

Tess4J is a Java JNA wrapper for Tesseract OCR API.

Description

A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.

Releases Versions.

  • Version 1.3 (released : May 31, 2014)
  • Version 2.0 Beta (released : June 1, 2014)
  • Version 3.4.3 (released: 14 January 2018)

Features:

The library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

Related tags

Links

Tess4J homepage
Tess4J Github

222 questions
0
votes
1 answer

Tess4J doOCR() for *First Page* of pdf / tif

Is there a way to tell Tess4J to only OCR a certain amount of pages / characters? I will potentially be working with 200+ page PDF's, but I really only want to OCR the first page, if that! As far as I understand, the common sample package…
Don Cheadle
  • 5,224
  • 5
  • 39
  • 54
0
votes
1 answer

How to turn off the dictionary in tess4j?

The accuracy of character recognition in my tess4j OCR application is very low. I have heard that turning off the dictionary in tess4j will increase the accuracy by letting individual characters to be recognized. But i don't know how to do it.…
Chalaka Ellawala
  • 405
  • 3
  • 5
  • 14
0
votes
1 answer

Need a method to specify 'Tess Data' folder during run time

I am using 'tess4j' api to get the text from a .jpg image. The 'TessData' folder is packaged in the jar files. I want to include custom fonts and create my own 'TessData' folder, is there a way we can give 'TessData' location during run time. I used…
Suvidh
  • 103
  • 1
  • 2
  • 10
0
votes
1 answer

Running tess4j (tessreact java wrapper) under Linux

On linux server tesseract 3.0.2 libraries were correctly installed: [root@XXX as]# rpm -qa | grep ".*tess.*" tesseract-3.02-alt1.r723 When trying to run tess4j wrapper, got this exception: Exception in thread "main" java.lang.UnsatisfiedLinkError:…
Andremoniy
  • 34,031
  • 20
  • 135
  • 241
0
votes
1 answer

java.lang.NoClassDefFoundError occures using tess4j without Eclipse

I use tess4j library for my java app. I followed instructions and copied liblept168.dll and libtesseract302.dll into my project root folder. When I run my app with Eclipse, it works normally. But when I export it as runnable jar file, it gives me…
-1
votes
3 answers

Trying to get Tess4J to work

I have been trying, unsuccessfull so far, to get Tess4J to work on NetBeans. I am following the tutorial here:- http://tess4j.sourceforge.net/tutorial/ I have followed it word for word, but get this error message saying:- "Error opening data file…
Jeremy Watts
  • 147
  • 1
  • 2
  • 8
-1
votes
4 answers

How to remove hyphens and white space in between of the string?

Using Tess4J jars I got a set of string containing white spaces and hyphens. V O D) >3 IIIIIII v .. 18:05Desks ¢ 3HOME FIND DESK FIND COLLEAGUE BOOKINGS0, SelfiMGiKIOSK708O4il27197097102, LOGSM’KIOSK’O8O47127197107682Monday, May 29 2017…
Manish
  • 17
  • 1
  • 9
-1
votes
1 answer

Java OCR for images with complicated backgrounds

I'm trying to get some text from images which look like this: This example would actually be the best case scenario as most of them would have a colored and more complex background instead. I don't need it to be 100% accurate since I know the…
Vlad Vidac
  • 978
  • 1
  • 10
  • 26
-1
votes
1 answer

How to add DLLs while building a JAR - Eclipse

I have made an OCR Application that would convert image files to Doc Files, using Tesseract as its OCR Engine. I used the Tess4j JNA Wrappers for this. While making the application i put the dll files and the language data(tessdata) in the bin…
Shaurya Chaudhuri
  • 3,772
  • 6
  • 28
  • 58
-1
votes
1 answer

How to use tesseract OCR in android and in eclipse and in java

I am making an app that wants to use tesseract OCR library. But tesseract is written in C++. So I wnt for tess4J. I read tutorials how to use it in eclipse and in android project. I followed this question (Including Tess4J to a Java project as…
Utkarsh Srivastav
  • 3,105
  • 7
  • 34
  • 53
-2
votes
1 answer

How to recognize specific regions from image using Tess4j java

I am working on Tess4j for reading passport. Is there any function to recognise only part of passport image such as Name, Surname etc. Does Tesseract supports zonal OCR? How can I achieve this task using Tesseract? Give me some suggestions on it.
Shilpa
  • 61
  • 1
  • 8
-2
votes
1 answer

Java code working fine when created as an independent static main class, but not when called from servlet

I am using Tess4j API for performing OCR and have created a dynamic web project in eclipse. If I create a new java class directly under the Java resources folder, the code is working fine. public static void main(String[] args){ File…
Saurabh Gour
  • 643
  • 2
  • 8
  • 24
1 2 3
14
15