uniq
(GNU coreutils 8.5) does not seem to distinguish between em- and en-dashes:
$ echo -e "a–b\na—b" | uniq -c
2 a–b
Is there any way to force this distinction? I've tried various settings for LC_COLLATE with no luck.
uniq
(GNU coreutils 8.5) does not seem to distinguish between em- and en-dashes:
$ echo -e "a–b\na—b" | uniq -c
2 a–b
Is there any way to force this distinction? I've tried various settings for LC_COLLATE with no luck.
Worked for me
echo -e "a–b\na—b" | LC_COLLATE=C uniq -c
1 a–b
1 a—b