I am trying to read a newspaper using OCR using tessaract. Before passing the image to tessaract, I am using Kraken to segment the actual lines and draw a line across the sentences for proper detection by tessaract.
When passing the image through kraken.pageseg.segment , I am getting an empty list and this output Too many connected components for a page image : 5903
, instead it should have returned a list containg the coordinates of the bounding box around the sentences.
I looked up the source code of kraken and found this perticular error message, but I am unable to understand it. [Source code for error][1]