I have a massive 8GB CSV file that contains information about companies created in France. I managed to read the file in python using:
df = pd.read_csv('File', sep=";", encoding="latin", iterator = True, chunksize=1000)
I know that this code worked, because when I write the code below, I get a dataframe-like output with 3 random rows:
df.get_chunk(3)
The problem is, now I want to be able to manipulate the data; make extractions, using criteria on rows, as I would do in a usual dataframe, like:
df[(df.Country == "France")]
For example, in a dataframe, the code above would only keep the rows for France. But when I tried :
df.read(df[(df.Country == "France")])
I got: 'TextFileReader' object has no attribute 'Country'
How can I rows columns using criteria like "==" or ">" or "<" and store these into a dataframe?
Thank you,