how to make grep ignore first line and process other line

Question

I need to remove line beginning with '#' in some txt file. but ignoring the first line as it header. how to make grep ignore first lines and remove any line beginning with # for rest of the lines?

cat sample.txt
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,xyz

cat sample.txt | grep -v "^\s*[#\;]\|^\s*$" > "out.txt"

but this removes the header too!

Possible duplicate of [Omitting the first line from any Linux command output](https://stackoverflow.com/q/7318497/608639), [Print a file skipping first X lines in Bash](https://stackoverflow.com/q/604864/608639), etc. — jww, Apr 21 '19 at 05:30
i dont think its same. I need to write header in the output file too — Aprilian8, Apr 21 '19 at 05:36

Cyrus · Accepted Answer · 2019-04-21T06:29:22.427

6

With sed:

sed '2,${/^#/d}' sample.txt

From second row (2) to last row ($): search (/.../) for rows beginning (^) with # and delete (d) them. Default action of sed is to print current row.

Output:

#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

edited Apr 21 '19 at 06:29

answered Apr 21 '19 at 05:57

Cyrus

84,225
14
89
153

score 2 · Answer 2 · answered Apr 21 '19 at 21:33

2

This might work for you (GNU sed):

sed '1b;/^#/d' file

Ignore the first line and delete any other lines that start with #.

answered Apr 21 '19 at 21:33

potong

55,640
6
51
83

raven-rock · Answer 3 · 2022-07-04T00:17:57.727

2

Applying an arbitrary command to all but the first line - a "header" - of a file or stream of tabular data is such a common task for me that I define a helper utility called body for it:

As a shell function (put this in your ~/.bashrc or equivalent):

body() {
  IFS= read -r header
  printf '%s\n' "$header"
  "$@"
}

Now:

$ cat sample.txt | body grep -v '^#'
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

Credit: adapted from: Command line tools for doing data science, where it's a one of many handy data tools you can put in your shell's PATH variable. Wish many of these could be canonicalized as standard UNIX tools.

edited Jul 04 '22 at 00:17

answered Feb 04 '21 at 20:29

raven-rock

53
5

2

Perfect for grepping `lsof` and `ps` results and keeping the header! – John Feb 04 '22 at 07:01
Thank you for sharing this! I completely agree that this should be in the canon. – C. Murtaugh Feb 21 '23 at 18:28

score 1 · Answer 4 · answered Apr 21 '19 at 05:39

1

Try a combination of head and grep like so:

head -1 sample.txt > out.txt && grep -v "^#" sample.txt >> out.txt

Result

#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

Alternate method

grep "^#" sample.txt | head -1 > out.txt && grep -v "^#" sample.txt >> out.txt

That is - grep lines beginning with # but just choose the first one and write it to a file. Then, grep all lines not starting with # and append those liens to the same output file.

answered Apr 21 '19 at 05:39

zedfoxus

35,121
5
64
63

When the header line doesn't start with `#`, it goes wrong. Also wrong is `head -1 sample.txt out.txt && grep -v "^#" sample.txt >> out.txt`. – Walter A Sep 03 '22 at 13:28

score 1 · Answer 5 · answered Apr 21 '19 at 21:13

1

This will cause any awk to print each line if its line number is 1 or it doesn't start with #:

$ awk 'NR==1 || !/^#/' file
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

answered Apr 21 '19 at 21:13

Ed Morton

188,023
17
78
185

score 0 · Answer 6 · 2019-04-23T18:27:42.020

0

tried on gnu sed

sed '0,/^#/n;/^#/d' sample.txt

edited Apr 23 '19 at 18:27

answered Apr 21 '19 at 10:55

how to make grep ignore first line and process other line

6 Answers6