As we are getting in to turbulent times of AI. I am as well spilling mine drop in to ocean. As I am pythonian, all attempts are done in python/anaconda.
Does anybody have already some experience in "data formats" passable to GPT family of AIs?
In documentation is recommended use of OpenAI tool for control. Followed by documentation recommending format ("Prompt:", "Completion:") With strings marked as:
["str" = in quotes,"/" = separator ,"@>" = unique symbol,
" " = comp. starts with empty space]
'Prompt': 'Hello AI..!!/@>'
'Completion': ' How are you today?/@>'
"Completion" should have empty space at start of every sting. So far I was able to find just simple examples as:
Col1 Col2
'Prompt': 'Completion':
'Text/@>' ' Text/@>'
Is there any way it will understand more complex dataset? Is effective to have more dim. DataFrame? Example:
Col1 Col2 Col3 Col4
'Prompt_a': 'Completion_a': 'Prompt_b': 'Completion_b':
'Text/@>' ' Text/@>' 'Text/@>' ' Text/@>
Is longer context text passed just as 'str/@>', or is some partition needed?
' text text text /@>'
Many thanks for all answers and efforts in advance.
Already checked: https://help.openai.com/en/articles/6811186-how-do-i-format-my-fine-tuning-data
END"}. Plus I was able to find multiple other possible formats. Many Many thanks for testing I will try to run something by myself (just starting so expecting some Errors :)) )