I'm trying to recognize random chars in android with tess-two API .
I have a printed paper sheet with the string: "5XqaLB"
when i show to the camera parts of the string to recognize it, i get th following examples:
original -> result
"5XqaLB" -> "5anLB"
"XqaLB" -> "anLB"
"qaLB" -> "qaLB"
"5Xq" -> "5Xq"
I suppose this happens because tesseract tries to guess an word with the recognized chars. I searched a lot but can't find a solution. Anyone has ideas to avoid this tesseract replacements?
Already tried whitelist, blacklist, and confs like:
baseApi.setVariable("load_system_dawg", "0");
baseApi.setVariable("load_freq_dawg", "0");
baseApi.setVariable("load_punc_dawg", "0");
baseApi.setVariable("load_number_dawg", "0");
baseApi.setVariable("load_unambig_dawg", "0");
baseApi.setVariable("load_bigram_dawg", "0");
baseApi.setVariable("load_fixed_length_dawgs", "0");
baseApi.setVariable("segment_penalty_garbage", "0");
baseApi.setVariable("segment_penalty_dict_nonword", "0");
baseApi.setVariable("segment_penalty_dict_frequent_word", "0");
baseApi.setVariable("segment_penalty_dict_case_ok", "0");
baseApi.setVariable("segment_penalty_dict_case_bad", "0");
can anyone give any guess how to have tesseract recognize only plain chars?