I am trying to exclude delimiters within text qualifiers. For this, I am trying to use Regex. However, I am new to Regex and am not able to fully accomplish my needs. I would be very greatful if someone can help me out.
In Alteryx, I load a delimited flat text file as 'non-delimited' and say that it does not have text qualifiers. Thus, the input will look something like this:
- "aabb"|ccdd|eeff|gghh
- "aa|bb"|ccdd|eeff|gghh
- "aa|bb"|ccdd|"ee|ff"|gghh
- "aa|bb"|"cc|dd"|"ee|ff"|"gg|hh"
- "aabb"|"ccdd"|"eeff"|"gghh"
- "aabb"|"ccdd"|"eeff"|"gg|hh"
- aabb|ccdd|eeff|gghh
- "aa|bb"|ccdd|eeff|"gg|hh"
- aabb|cc|dd|eeff|gghh
- aabb|"cc||dd"|eeff|gghh
- aabb|"c|c|dd"|eeff|gghh
- "aa||bb"|ccdd|eeff|gghh
- "a|a|b|b"|ccdd|eeff|gghh
- "aabb"|ccdd|eeff|"g|g|hh"
- "aabb"|ccdd|eeff|"gg||hh"
I want to exclude all delimiters that are in between text qualifiers.
I have tried to use Regex to replace the delimiters within text qualifiers with nothing.
So far, I have tried the following Regex code for my target:
(")(.*?[^"])\|+(.*?)(")
And I have used the following for my replace: $1$2$3$4
However, this will not fix te lines 11, 13, 14 and 15.
I wish to obtain the following results:
- "aabb"|ccdd|eeff|gghh
- "aabb"|ccdd|eeff|gghh
- "aabb"|ccdd|"eeff"|gghh
- "aabb"|"ccdd"|"eeff"|"gghh"
- "aabb"|"ccdd"|"eeff"|"gghh"
- "aabb"|"ccdd"|"eeff"|"gghh"
- aabb|ccdd|eeff|gghh
- "aabb"|ccdd|eeff|"gghh"
- aabb|cc|dd|eeff|gghh
- aabb|"ccdd"|eeff|gghh
- aabb|"ccdd"|eeff|gghh
- "aabb"|ccdd|eeff|gghh
- "aabb"|ccdd|eeff|gghh
- "aabb"|ccdd|eeff|"gghh"
- "aabb"|ccdd|eeff|"gghh"
Thank you in advance for helping me out!
With kind regards, Robin