Your file is NOT csv so you may have to write own function to read it and split to columns
I used io
only to simulate file in memory - so everyone can copy and test it - but you should use open()
text = '''1 persona: i am adorkable.
2 persona: i am book dumb.
3 persona: i am token evil teammate.
4 persona: i am never my fault.
5 persona: i am honor before reason.
6 persona: i am jerk with a heart of gold.
7 persona: i am no social skills.
8 persona: i am bad liar'''
import io
#f = open('./Data/fold2_l25431/test.txt')
f = io.StringIO(text)
rows = []
for line in f:
line = line.strip() # remove '\n'
first, rest = line.split(' ', 1) # split only on first space
second, third = rest.split(': ') # split on ": "
rows.append( [first, second, third] )
print(rows)
Result:
[
['1', 'persona', 'i am adorkable.'],
['2', 'persona', 'i am book dumb.'],
['3', 'persona', 'i am token evil teammate.'],
['4', 'persona', 'i am never my fault.'],
['5', 'persona', 'i am honor before reason.'],
['6', 'persona', 'i am jerk with a heart of gold.'],
['7', 'persona', 'i am no social skills.'],
['8', 'persona', 'i am bad liar']
]
And later you can convert this list to DataFrame
import pandas as pd
df = pd.DataFrame(rows, columns=['1', '2', '3'])
print(df)
Result:
1 2 3
0 1 persona i am adorkable.
1 2 persona i am book dumb.
2 3 persona i am token evil teammate.
3 4 persona i am never my fault.
4 5 persona i am honor before reason.
5 6 persona i am jerk with a heart of gold.
6 7 persona i am no social skills.
7 8 persona i am bad liar