I am using EmEditor and I see there is a "find and extract to new document" function that supports Regex statements. I am trying to extract some specific text from a Thunderbird mailbox text file. In the mailbox there are copies of customer service chats. Unfortunately, because we use a free version of this chat program it does not allow to export the data. In the body of the email is a lot of text including the chats and decoded attachments. But on the bottom of each chat is the name, email, company name, etc.
It looks like this:
Name: Tan
Email: someone@domcin.com
Operator: OperatorName
Start Time: 07/01/2014 14:43:47
End Time: 07/01/2014 15:35:22
Product/Service: Delivery
Phone: 123 1234567
Company: MyCompany Inc.
I try to extract the name, email, operator, product, phone and company. To make matters worse, not all have company since there are private people too. Also the telephone sometimes has a +60 or (60) or spaces, since the chat user could enter what we wants. I can do this manually but its 6k entries.
The question is if there would be a regex statement to find them. I could then use EmEditor to find this block and place the result in a new document and with a bit of tweaking I should be able to make a excel file to import into a CRM.
If this does not work with regex than does anyone know of a smart way to do this so I do not have to copy and paste all this?