Questions tagged [tess4j]

Tess4J is a Java JNA wrapper for Tesseract OCR API.

Description

A Java JNA wrapper for Tesseract OCR API.
Tess4J is released and distributed under the Apache License, v2.0.

Releases Versions.

  • Version 1.3 (released : May 31, 2014)
  • Version 2.0 Beta (released : June 1, 2014)
  • Version 3.4.3 (released: 14 January 2018)

Features:

The library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

Related tags

Links

Tess4J homepage
Tess4J Github

222 questions
0
votes
2 answers

OCR : Not getting desired result

I have this image . I am trying to OCR the letters in this image. I am not getting desired result for letters '9' and 'R'. First I cropped these letters, & and executing following command. tesseract 9.png stdout -psm 8 . It is just returning…
Bhushan
  • 1,489
  • 3
  • 27
  • 45
0
votes
1 answer

Can we use Regular Expression to pass as variables in Tesseract?

Can we improve Tesseract character recognition accuracy by regular expression. For example we tell to Tesseract that the text can have this kind of structure. 4characters2Digits[4Digits]3char4Digits2char // Our string in the image is…
Bahramdun Adil
  • 5,907
  • 7
  • 35
  • 68
0
votes
1 answer

How to get tess4j to compile in Netbeans

So am trying to get the simple ocr test example from the following website http://tess4j.sourceforge.net/tutorial/ to correctly compile and read from the .tif image. But for whatever reason I get a bunch of errors when running. I have the folder…
0
votes
1 answer

NoSuchFieldError: RESOURCE_PREFIX when using Tess4J with Eclipse

I've tried using Tess4J in Eclipse but it results with the error in the title. I imported these jar files: tess4j, jai_imageio, and jna. The test code I have written is as per the example given on their examples page. File DMTfile =…
ALegendsTale
  • 41
  • 3
  • 8
0
votes
1 answer

Tess4j Java Tesseract Recognize Specific Font?

Currently, I am using Tess4j to recognize the text in my image. It accurately reads 95% of the characters I throw at it, but I would like to get 100%. I know the font that the image is using and I was wondering if there was a way I can get Tess4j to…
Alvarez
  • 1
  • 1
0
votes
1 answer

Unexpected result of Tesseract while using 'digits' flag

My image is and when I do tesseract myimage.png result digits I am getting result as 80 1 3047490 though I am expecting to get only digit, that is 4749 in my image. What I am doing wrong ? My Tesseract version is 3.03. PS: I also tried with no…
Bhushan
  • 1,489
  • 3
  • 27
  • 45
0
votes
0 answers

java.lang.UnsatisfiedLinkError: Unable to load library 'libtesseract302': The specified module could not be found

This is kind of duplicate question, But none of the answers helped to resolve this. I did this, Pulled code from here Added -Djna.library.path=${workspace_loc:/ocr-tess4j-example}/dlls/x64 in VM arguments, This folder has three dll files…
Selvakumar Ponnusamy
  • 5,363
  • 7
  • 40
  • 78
0
votes
1 answer

Build jar with tess4j

I create project in Intellij Idea, add tess4j 2.0 from maven, write test application. When I start debug all works fine. When I click "build artifacts" and launch jar file I haven't any result, no errors, nothing. public class MainApp { static…
0
votes
0 answers

Tess4j (Tesseract V3.03) Invalid memory access

When invoking tesseract doOcr on a multi page tiff documents throws invalid memory access error. Datapath is set to the folder where tessdata is present and dlls are present in classes folder Exception in thread "main" java.lang.Error: Invalid…
0
votes
1 answer

Tesseract: An alternative to building a source in linux remote host?

I have a Java Web project using Tess4j. On my local Mac OS it works perfect because probably I already have the Tesseract and Leptonica libraries built from source by ./config, make, make install Now I want to deploy the project to a remote Linux…
Spring
  • 11,333
  • 29
  • 116
  • 185
0
votes
2 answers

Tess4J NoClassDefFoundError

I'm trying to use Tess4J with the following: public static String parseImageCharacters(BufferedImage image) throws Exception { Tesseract instance = Tesseract.getInstance(); return instance.doOCR(image); } but I'm getting the…
Eduardo
  • 6,900
  • 17
  • 77
  • 121
0
votes
1 answer

NPE during concurrent thread access of a single tess4j instance

I am working with Tesseract 3.0.2 and using 1.4.1 tess4j..this is not working in a thread-safe manner, I get a NPE. I am using Grizzly/Jesery/Spring. @Service("textExtractorService") public class TextExtractorServiceImpl implements…
Jeryl Cook
  • 989
  • 17
  • 40
0
votes
0 answers

Tess4J not running in 32-bit JVM

I am having an issue getting tess4j to work in a 32-bit jvm environment. Oddly, it works in 64bit, but not 32bit (opposite of the issue others seem to face). Exception in thread "main" java.lang.UnsatisfiedLinkError: The specified module could not…
Poity
  • 21
  • 2
0
votes
1 answer

Intercepting console output which originated from Tess4J

I am trying to intercept the red Empty page!! message that gets printed to my screen when using Tess4J. I wrote a short interceptor class that overrides print and println and replaced stdout and stderr to check for this string: private static class…
dotwin
  • 1,302
  • 2
  • 11
  • 31
0
votes
1 answer

Feed tesseract (Tess4J) from ImageMagick (JMagick)

I'm trying to create a Java program that will OCR many formats of images. Images cannot be read directly from file, because their bytes are to be send through network. I'm currently able to read raw bytes of image pixels using ImageIO. However I…
nuoritoveri
  • 2,494
  • 1
  • 24
  • 28
1 2 3
14
15