I have an use case where i need to parse an image or PDF to read a survey form with all the values like Name, Age, Address etc as a key value pair and load the data to the table for each of the column.
Right now we are using AWS Textract and we are getting all the info as expected but the key, value pair for the checkboxes and radio buttons are not getting properly.
For example, the question is Are you from India? and two radio buttons yes, no.
The expected output if yes is checked is { 'Are you from India?': 'Yes' }. but the output i am getting is {'YES': 'SELECTED', 'NO': 'NOT_SELECTED'}.
Is there any way to train Textract to achieve this information or any other approach i can try to achieve.
Can someone please help me with this. I am new to this domain so i am completely blind on where to look for this.
Thanks In Advance.