Questions tagged [text-segmentation]

Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics.

Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics.

References:

Related Tags:

197 questions
1
vote
1 answer

Backtracking, Text Segmentation

I could not figure out this brainstorming algorithm question! Could some one help? --------Description-------- Suppose you are a scientist who are searching for certain kinds of genes. You know that a gene you are looking for should be formed of the…
User82949B
  • 187
  • 1
  • 10
1
vote
0 answers

Detecting vertical text present in a text image

Given an image consisting of text in both horizontal and vertical orientation, I want to detect in which bounding box vertical text is present and what is the orientation of the corresponding box's text itself (top to bottom or horizontal stacked…
1
vote
1 answer

How do I segment the connected characters in this case?

It seems that I need some advice on segmenting connected characters (see the image below). As you can see, C and U, as well as 4,9 and 9 are connected and therefore when I try to draw contours they are joined into one block. Unfortunately, there…
Don Draper
  • 463
  • 7
  • 21
1
vote
0 answers

Dynamic kernel for image

On text extraction, how to set kernel size dynamically for morphologyEx operation in cv2?Basically, I want to extract the word from image depending on various types of fonts, size. My code work for particular images only.how to find out what the…
devesh
  • 618
  • 6
  • 26
1
vote
3 answers

Python: How to get possible combinations of keys in dict

Given a dict of vocabulary: {'A': 3, 'B': 4, 'C': 5, 'AB':6} and a sentence, which should be segmented: ABCAB. I need to create all possible combinations of this sentence such as [['A', 'B', 'C', 'A', 'B'], ['A', 'B', 'C', 'AB'], ['AB', 'C',…
muc777
  • 37
  • 7
1
vote
0 answers

How to correct page segmentation by Tesseract OCR using image processing?

For eg, the content beneath Article Info and Abstract are being considered as one single paragraph by Tesseract OCR. This is interfering with the selection of text later on. After I fiddled around with some image processing steps (Gaussian Blur,…
1
vote
1 answer

How to merge two PunktSentenceTokenizer pickle files?

I have trained the PunktSentenceTokenizer in NLTK and obtained a pickle file "learnt.pickle": …
ashesh
  • 13
  • 4
1
vote
1 answer

How to set SRX rule to break both before and after a character

I am updating an SRX rules file using the SRX specification www.ttt.org/oscarstandards/srx/srx10.html There is no explicit mention of how to make a break both before and after a certain text. In a document the bullet character \u2022 appears and in…
ib11
  • 2,530
  • 3
  • 22
  • 55
1
vote
1 answer

for loop causing bad text segmentation in matlab

the input images are a.jpg and b.jpg these two image stored in for example comp folder.and want to write the segmented image in segment folder.but I think for looping problem segmentation repeated for so many times for each image.And I could't solve…
1
vote
1 answer

what's mean about "compact unwind info" in linker synthesized

When I analyze the link map file which was generated by xcode, in the linker syntheized section, there is a data named "compact unwind info". compact unwind info 858.57KB 858572 Unchecked it take about 858kb space size. I want to know what the…
boo
  • 493
  • 6
  • 17
1
vote
1 answer

Text Segmentation on Texts Occluded by Objects

I'm working on a project to extract characters of a license plate. I have developed a method by MSER to segment characters for license plates with complex background and it works well. The problem is, in some cases like seen below, the frame(license…
Horizon1710
  • 792
  • 10
  • 28
1
vote
1 answer

How to save exact image of segmentation result (segmentation using vertical projection method)?

I've already modified a code for segmentation process. The code is as follow: % Preprocessing + Segmentation (VP with secondary element) % // Original Code of Vertical Projection for Segmentation by Soumyadeep Sinha // % // Modified by Ana Ainul S.…
1
vote
1 answer

Saving Segmentation Result Automatically - Matlab Arabic OCR

Complete Segmentation code: % Preprocessing + Segmentation % // Original Code of Segmentation by Soumyadeep Sinha with several modification by Ana// % Saving each single segmented character as one file function [s] = seg (a) myFolder =…
Ana Ain
  • 173
  • 1
  • 3
  • 14
1
vote
3 answers

search for a sentence in a paragraph

How can I search a paragraph for one or more sentences using PHP?
john
  • 535
  • 7
  • 23
1
vote
1 answer

Difference between classification and segmentation in data mining tasks?

Can someone say what is difference between classification and segmentation in data mining tasks? Can we say that segmentation is classification task when objects are costumers and dividing criterias are relevant to Marketing? So segmentation is just…
Anja
  • 11
  • 1
  • 2