I have multiline XML files (~800 lines) in my s3 bucket and i want to index them in Elasticsearch but I can't parse them in logstash. Fields are sometimes empty so it's impossible to manually parse files.
My xml looks like:
<ServiceSalesClosed>
<ErrorLevel>0</ErrorLevel>
<ErrorMessage/>
<LaborSaleCustomerPay>50.00</LaborSaleCustomerPay>`
...
In my input I have the config:
codec => multiline
{
pattern => "<ServiceSalesClosed.*"
what => next
}
In my filter the following config:
multiline {
pattern => ["\t\t"]
what => next
}