I'm searching on how to use wildcard characters as part of the removal criteria for a section of a corpus. I was unable to find anything on SO or google related to this issue.
Purpose: Analyzing large dataset of standardized notes where employee input is broken into sections of the text.
Example data:
***Date; Area: asdfwerqw Detail: xxxxx Requested Action: xxxxxx Assigned to: John Doe
Portion to extract for analysis:
Detail:xxxxx Requested Action:xxxxxx
Number of items before Detail may be more. Also, Assigned to: may not appear.