Python: Specify end-of-line format for reading files

Question

I'm writing a Python script which processes a text file. I expect to process files generated from different people, working under different operating systems. Is there a nice way to figure out which OS created the text file, and specify the end-of-line convention to make parsing line-by-line trivial?

score 3 · Accepted Answer · answered Dec 04 '10 at 02:55

3

Use universal newline mode when opening the file.

with open('input.txt', 'rU') as fp:
  for line in fp:
    print line

answered Dec 04 '10 at 02:55

Ignacio Vazquez-Abrams

776,304
153
1,341
1,358

mechanical_meat · Answer 2 · 2010-12-04T02:38:26.873

2

splitlines() handles various line terminators:

>>> 'foo\nbar'.splitlines()
['foo', 'bar']
>>> 'foo\rbar'.splitlines()
['foo', 'bar']
>>> 'foo\r\nbar'.splitlines()
['foo', 'bar']

edited Dec 04 '10 at 02:38

answered Dec 04 '10 at 01:52

mechanical_meat

163,903
24
228
223

nate c · Answer 3 · 2010-12-04T02:12:10.477

1

If you do not care about ending white space then:

for line in [l.rstrip() for l in open('test.py').read().split('\n')]:
    print line

'\n' will take care of Linux / Mac and rstrip will eat up any '\r' from Windows.

edited Dec 04 '10 at 02:12

answered Dec 04 '10 at 02:03

nate c

8,802
2
27
28

This gives me an error: AttributeError: 'list' object has no attribute 'rstrip' – ajwood Dec 04 '10 at 02:09
Mauahah got it! I had to change split('\n') to split('\r') though. Thanks very much! – ajwood Dec 04 '10 at 02:17
Oh darn, there's one more thing... I've got to do something special with the 1st line of the file, then process the rest... – ajwood Dec 04 '10 at 02:19

score 1 · Answer 4 · answered Dec 04 '10 at 02:15

1

You want to use file.readlines(), which returns a list containing the lines in the file.

lines = open('info.txt').readlines()
for line in lines:
    print line

See the documentation on Python file objects.

answered Dec 04 '10 at 02:15

Velociraptors

2,012
16
22

Python: Specify end-of-line format for reading files

4 Answers4