example file
aaa [bbb bb] ccc "ddd dd" eee
bbb [ccc cc] ddd "eee ee" fff
expected:
line1
s1="aaa", s2="bbb bb", s3="ccc", s4="ddd dd", s5="eee"
line2
s1="bbb", s2="ccc cc", s3="ddd", s5="eee ee", s5="fff"
Thanks in advance!
example file
aaa [bbb bb] ccc "ddd dd" eee
bbb [ccc cc] ddd "eee ee" fff
expected:
line1
s1="aaa", s2="bbb bb", s3="ccc", s4="ddd dd", s5="eee"
line2
s1="bbb", s2="ccc cc", s3="ddd", s5="eee ee", s5="fff"
Thanks in advance!
Using gnu awk
you may use this:
awk -v OFS=", " -v FPAT='\\[[^]]*\\]|"[^"]*"|[^[:space:]]+' '{
for (i=1; i<=NF; i++) {
gsub(/^[["]|[]"]$/, "", $i)
$i = "s" i "=\"" $i "\""
}
$0 = "line" NR ORS $0
} 1' file
Output:
line1
s1="aaa", s2="bbb bb", s3="ccc", s4="ddd dd", s5="eee"
line2
s1="bbb", s2="ccc cc", s3="ddd", s4="eee ee", s5="fff"
bash-only -
$: IFS=']"[' read -a line < infile # read the "groups"
$: line=( "${line[@]% }" ) # strip training spaces
$: line=( "${line[@]# }" ) # strip leading spaces
The line
array now has your scrubbed data.
Shown in steps -
$: IFS=']"[' read -a line < infile
$: printf "[%s]\n" "${line[@]}"
[aaa ]
[bbb bb]
[ ccc ]
[ddd dd]
[ eee]
$: line=( "${line[@]% }" )
$: printf "[%s]\n" "${line[@]}"
[aaa]
[bbb bb]
[ ccc]
[ddd dd]
[ eee]
$: line=( "${line[@]# }" )
$: printf "[%s]\n" "${line[@]}"
[aaa]
[bbb bb]
[ccc]
[ddd dd]
[eee]