I'm trying to filter the following log file:
+---------+---------+---------+---------+---------+---------+---------+----
.Logon hostname/username,
*** Logon successfully completed.
*** Teradata Database Release is 14.00.06.05
*** Teradata Database Version is 14.00.06.05
*** Transaction Semantics are BTET.
*** Session Character Set Name is 'ASCII'.
*** Total elapsed time was 1 second.
+---------+---------+---------+---------+---------+---------+---------+----
select current_timestamp as started_test;
*** Query completed. One row found. One column returned.
*** Total elapsed time was 1 second.
started_test
--------------------------------
2014-10-06 17:44:39.220000+00:00
+---------+---------+---------+---------+---------+---------+---------+----
select * from database.view sample 2;
*** Query completed. 2 rows found. 41 columns returned.
*** Total elapsed time was 2 seconds.
select current_timestamp as finished_test;
*** Query completed. One row found. One column returned.
*** Total elapsed time was 1 second.
finished_test
--------------------------------
2014-10-06 17:44:41.330000+00:00
with this logstash filter
input{
file {
path => "/home/iv41/perfmon.log"
}
stdin {}
}
filter {
grok{
match => ["message", "%{/\s+started_test/:start_time} START id: (?<task_id>.*)"]
add_tag => ["testStarted"]
}
grok{
match => ["message", "%{/\s+finished_test/:end_time} END id: (?<task_id>.*)"]
add_tag => ["testEnded"]
}
if [start_time] != "/\s+started_test/"{
if [end_time] != "/\s+finished_test/"{
drop {}
}
}
elapsed {
start_tag => "testStarted"
end_tag => "testEnded"
unique_id_field => "task_id"
}
}
output{
stdout {}
}
I think there may be issues with my regex's and task ids.
Essentially, I'm trying to pull out the time it takes between "started_test" and "finished_test". Does anyone know a better way of doing it? or know where my code is out?