I have a csv
file that has about 2 million lines, and about 150 columns of data. Total file size is about 1.3 GB. That's about 300 million array members.
I started with a 3.5 million line file, and through trial and error learned that FORTRAN
would not even compile unless the array was defined at 3.9 million or less. 4 million, no go. Bus error/core dumps.
So anyway, I thought my 2 million line file would work. I read a few posts about a 2 GB limit. However, when I print out the line number when reading the data in, I only get to 250,000 or so before it just ends. Strangely enough, I have an almost identical file (used the split command), and it only gets to 85,000 before conking out. Not sure why so different, same number of characters per line.
Is there anything I can do to get this data read in? It would be a major pain to compile all the data hundreds of times.