All,
We are using openai RAG pattern to retrieve sources from a document and sending them to openai to get answers. We are using azure cognitive search as the vector db.
we are generating a Json file from the original pdf. Our client also provided a list of expected questions and answer.
We indexed the pdf json along with the questions and answers.
In our testing we are getting the expected results along with content from both the pdf json and questions and answers.
Questions
what are the implications of this approach?
are we introducing any bias?
is there a better way to do this ?
Thanks -Nen