refining captcha with a little noise

Question

I'm trying to crack a particular web CAPTCHA. I'm planning to do it by segmenting the characters and passing them to an ANN (mostly for features, I will be using method of moments as it seems difficult to completely remove noise completely)

The captcha is very noisy, and unfortunately there is no color difference between the noise and the actual text, so separation based on color will not work. After quite some thought, I managed to implement a flood-fill style algorithm on the pixels of the captcha to separate small disconnected components, and after this I ended up with something like this: CAPTCHA after considerable noise removal

Most of the noise is gone but some of it is left around the letters themselves (since it is touching the text). I'm not an expert on image filters, and I'm finding it very difficult to find the right filter to reduce the remaining noise and enhance the characters. Any Ideas on what filter(s) I could use for this purpose.

(Note: I'm not using any image manipulation tool/library for this. I'm writing raw pixel manipulation code, but I can implement most filters given their convolution kernel)

The problem is that due to this noise, it is becoming difficult to segment the characters. Clearly trying to find vertical lines with no dark pixels is not going to work, since there is noise and some of the letters are touching. Any ideas on how I could segment these efficiently?

EDIT: Original image original image of captcha

I also tried median filter with radius of 1, but it did not seem to help much — Quark, Dec 15 '14 at 04:55
post the original image, maybe we have a different idea. Maybe if you use some morphological operators like erode and dilate you can wipe some noise. — David Clifte, Dec 15 '14 at 11:41

McMa · Answer 1 · 2014-12-15T16:04:13.267

1

what about trying morphological operators like closing and opening? they are very easy to implement and a simple but efficient tool.

After one closing with a 3x3 cross structuring element (kernel) and binarising the image the noise is almost gone:

enter image description here

I am sure just a bit more trying will render great results.

edit: to clear things up a little, the closing is a dilation followed by an erosion (other way around for opening). A dilation is assigning every pixel in your image the maximal value of all pixels in the kernel (structuring element) around it, conversly, the erosion assign every pixel the minimal value of all pixels in the kernel around it.

Also take a look at the wikipedia link and the external links in there.

edited Dec 15 '14 at 16:04

answered Dec 15 '14 at 14:08

McMa

1,568
7
22

Thanks, could you please give me a little more detail of how I could implement closing operation? I don't really know much about morphology – Quark Dec 15 '14 at 14:28
Thanks alot! though I guess it's the reverse, I was able to achieve similar effect as you did by opening (erosion then dilation). Do you also have any ideas for edge enhancement/ completion? – Quark Dec 15 '14 at 18:25
yes, of course, I always confuse them... as for the edge definition you can keep on using mixtures of the morphological operators: different kernels will produce different results. If you want to take a look at other more complex techniques, I think hough-lines and hough circles might do a good job – McMa Dec 15 '14 at 22:06

refining captcha with a little noise

1 Answers1