Reading line by line with python's Pandas after skipping first 48 rows

Question

The title is fairly explanatory.

I have a long CSV file that I would like to read line by line with the following code:

lines = []
for line in pd.read_csv(file, chunksize = 1, header = None):
    lines.append(line.iloc[0 0])
print(lines)

I'd like to skip the first 48 rows. At first it seemed simple enough and I thought all I needed to do was change my read function to:

pd.read_csv(file,chunksize = 1, header = None, skiprows = 48):

Sadly, this seems to produce the effect of skipping 48 rows every single loops. Not a great outcome.

How can I read line by line which is effectively reading this file while simultaneously skipping the first 48 rows of this long, irregular file?

score 2 · Accepted Answer · answered May 13 '20 at 01:03

2

You could set skiprows to a variable that gets reset after its first execution.

lines = []
row_skip = 48
for line in pd.read_csv(file, chunksize = 1, header = None,skiprows=row_skip):
    lines.append(line.iloc[0,0])
    if row_skip:
        row_skip = None
print(lines)

answered May 13 '20 at 01:03

Umar.H

22,559
7
39
74

1

Excellent! That helped me so much with the issue I brought to hand. I now have some further problems ahead of me, but thank you so much for the assistance! – Taylor May 13 '20 at 13:30

Reading line by line with python's Pandas after skipping first 48 rows

1 Answers1