memory error while loading csv file?

Question

i have a Ubuntu laptop with 8 GB ram .and also have a 2 GB CSV file but when i use pandas method read_csv to load my data the ram is completely filled while there was 7 GB ram free . how does 2 GB file fill 7 GB ram ?

These SO threads maybe helpful http://stackoverflow.com/questions/19590966/memory-error-with-large-data-sets-for-pandas-concat-and-numpy-append http://stackoverflow.com/questions/17557074/memory-error-when-using-pandas-read-csv — Bharath, Nov 09 '16 at 20:28

score 3 · Accepted Answer · answered Nov 09 '16 at 20:21

3

The reason you get this low_memory warning might be because guessing dtypes for each column is very memory demanding. Pandas tries to determine what dtype to set by analyzing the data in each column.

In case using 32-bit system : Memory errors happens a lot with python when using the 32bit version in Windows. This is because 32bit processes only gets 2GB of memory to play with by default.

Try this :

tp = pd.read_csv('file_name.csv', header=None, chunksize=1000)
df = pd.concat(tp, ignore_index=True)

answered Nov 09 '16 at 20:21

harshil9968

3,254
1
16
26

yes .it was because of dtypes , and i converted some columns dtype as i was loading . thanks. – Shoobi Nov 11 '16 at 05:20
i have tried to upvote , but it is not displayed publicly because i have less than 15 reputation ;) – Shoobi Nov 16 '16 at 07:14

MaxU - stand with Ukraine · Answer 2 · 2016-11-09T20:25:28.930

0

try to make use of chunksize parameter:

df = pd.concat((chunk for chunk in pd.read_csv('/path/to/file.csv', chunksize=10**4)),
               ignore_index=True)

edited Nov 09 '16 at 20:25

answered Nov 09 '16 at 20:16

MaxU - stand with Ukraine

205,989
36
386
419

your first is horribly inefficient add the note: http://pandas.pydata.org/pandas-docs/stable/merging.html – Jeff Nov 09 '16 at 20:23
1

every loop iteration you were making a copy of a bigger and bigger frame; instead append to a list and call concat once (as the current example does) – Jeff Nov 09 '16 at 20:29

memory error while loading csv file?

2 Answers2