Fetch the rows(and some rows before and after) that have a specific text or a condition

Question

I am analyzing some log data from a process and have various columns such as id, date,time,log code, log text. id is unique for a product date and time are the time components when the log was captured. log code is the code specific to the log text log text is some 256 character text that describes the process

e.g.

ID  Date             time   log id          log text
A   01/10/18    9:00:00 bbb process begin
A   01/10/18    9:00:00 yyy dimensions not specified
A   01/10/18    9:00:30 fff failure
A   01/10/18    9:00:30 ddd dispatched
A   01/10/18    9:00:30 sss process success
B   01/10/18    9:01:01 bbb process begin
B   01/10/18    9:01:50 mmm moved to stage2
B   01/10/18    9:02:50 aaa space not allocated
B   01/10/18    9:02:50 fff failure

I want to grep(or rather create a subset) of the above dataset in a csv or xls output which meets the below conditions(can be changed) for example-

2 rows above the line where log text = failed
all rows where log id was sss

so my expected output is -

ID  Date            time    log id  log text
A   01/10/18    9:00:00 bbb process begin
A   01/10/18    9:00:00 yyy dimensions not specified
A   01/10/18    9:00:30 fff failure
B   01/10/18    9:01:50 mmm moved to stage2
B   01/10/18    9:02:50 aaa space not allocated
B   01/10/18    9:02:50 fff failure
A   01/10/18    9:00:30 sss process success

using the discussion in the thread below: Grep for a word, and if found print 10 lines before and 10 lines after the pattern match

I tried some piece of code to get the below piece- import subprocess

filename = "filename.csv"    
string_to_search = "failure"    
extract = (subprocess.getstatusoutput("grep -C 2 '%s' %s"%(string_to_search, filename)))[1]
print(extract)

Ali Hallaji · Answer 1 · 2019-03-28T15:36:21.350

2

you can use this code:

with open("text.txt", "r") as f:
    output = open("output.txt", "w")
    count = 0
    lines = f.readlines()
    for line in lines:
        if "sss" in line:
            output.write(line)
        elif "failure" in line:
            output.write(lines[lines.index(line) - 2])
            output.write(lines[lines.index(line) - 1])
            output.write(line)

edited Mar 28 '19 at 15:36

answered Mar 28 '19 at 13:45

Ali Hallaji

3,712
2
29
36

It is giving the same output file as input. Where is it writing the new file at or is it updating the older file. – hitesh Mar 28 '19 at 15:33
Now this write it in a file. – Ali Hallaji Mar 28 '19 at 15:42
thanks. i think it wasnt working because of some formatting issues. – hitesh Mar 29 '19 at 09:12

Fetch the rows(and some rows before and after) that have a specific text or a condition

1 Answers1