Questions tagged [text-recognition]

314 questions
0
votes
1 answer

How to extract text from these colored images?

I want to extract the text on Labels from the images. The images are coloured and are in a real-life environment. PFA images. Sample Image I have tried multiple solutions: I'm able to read text from flat images using Tesseract but it's not working…
0
votes
2 answers

MLKit Text Recognition in portrait

I have been following this link regarding Firebase MLKit text recognition (OCR) for iOS and it seems to be working fine with the exception of when the photo selected (via either camera or library) was taken in portrait. When processing a photo in…
teh_raab
  • 384
  • 1
  • 3
  • 21
0
votes
1 answer

Swift, Firebase - Use CMSampleBufferRef with live feed of camera

I'm currently trying to implement the MLKit from Firebase to use text recognition. So far, I've got the code for the camera, which shows its live feed inside of an UIView. My intention is now to recognize text in this live feed, which I reckon is…
Tom
  • 3,672
  • 6
  • 20
  • 52
0
votes
1 answer

OCR code written without custom loss function

I am working on OCR model. my final goal is to convert OCR code into coreML and deploy it into ios. I have looked and run a couple of the github source codes namely: here here as you have a look on them they all implemented loss as a custom layer…
sariii
  • 2,020
  • 6
  • 29
  • 57
0
votes
1 answer

How to detect text/logo-details from an image of any consumer product?

I am trying to detect name of any consumer product from an image of its packaging.For eg- Maggie (I want to detect- Maggie happiness is homemade) Kellogg's I have tried applying image prepossessing(e.g- erosion, open, close etc.) and then supplying…
0
votes
2 answers

Text-recognition + OMR

I am working on a medium scale app with a number of functions. I did most of the parts already. I just left the part of Text Recognition + OMR for attendance system. Can anyone give me some direction (documentation or tutorial i can follow) on…
user8554744
0
votes
1 answer

Different results when running both iOS and Android versions of MLKit Text Recognition On-Device API with the same image as input

this is more a question for the Firebase/MLKit team. When using the same image as input for the iOS and Android versions of MLKit Text Recognition On-Device API, I do get different results on the bounding boxes information (x,y,width,height)…
Joao
  • 1
0
votes
1 answer

RNCamera text recognition fails on Portrait mode

I'm developing an app using react-native-camera, RNCamera, on ejected mode. BTW, the text recognition feature is supposed to work only for Android. On Portrait mode, it only detects Single or double characters on each detect event, like 'O',…
diogenesgg
  • 2,601
  • 2
  • 20
  • 29
0
votes
1 answer

why text recognizer is not operational?

In line 3 there is if that check if text recognizer is operational or not and when I run program on my asus me371 mg tablet it will never go in else. It means text recognizer is never operational, what should I do? TextRecognizer textRecognizer=new…
ebrahim
  • 1
  • 1
0
votes
2 answers

How to modify the connectionist Temporal Classification (CTC) layer of the network to also give us a confidence score?

I am trying to recognize words from cropped images of words itself by training a CRNN(CNN+LSTM+CTC) model. I am confused how to add confidence score along with recognized words. I am uisng tensorflow and following the implementation of…
vinayak A
  • 77
  • 2
  • 9
0
votes
1 answer

Extract Name from ID Card with OCR Mobile Vision

I'm able to get all text from ID card with Mobile Vision API Text Recognition but I want to extract specific information such as Name, Surname from the text. A block of output is like this: Is there any way extract Name from the String? or, Is…
Melih
  • 23
  • 10
0
votes
1 answer

How to improve Google Vision results while detecting a text on an image if we know the language of

How to modify the following Python code to return results in German? Is it possible? Thank you. def detect_text_uri(uri): """Detects text in the file located in Google Cloud Storage or on the Web. """ client =…
0
votes
1 answer

How to detect if a text is Person, Organization, or other entity using Spacy?

I have a csv records of sales, each record has column customer name. This column is a combination of persons name and organization name. How can I use spacy to detect if a this column is a person or organization?
Arelancelot
  • 491
  • 3
  • 14
0
votes
1 answer

OpenCV hmm text detection algorithm is not working on simple image

I'm trying to write simple program to recognize some simple patterns. It works fine with tesseract, but for some reason it doesn't work with Hmm. Here is sample code: std::string image_path = "assets/ubuntu.png"; std::string filename =…
Alex Zhukovskiy
  • 9,565
  • 11
  • 75
  • 151
0
votes
0 answers

image detect text orientation (0, 90, 180, 270) degrees

I'm searching for a plain java solution to detect the text orientation (0, 90, 180, 270) degrees on an image. I already tried it with the Hough-Transform algorithm, but without success. Is there any other algorithm I can use? Best
sbo
  • 951
  • 2
  • 12
  • 25