I'm using the awesome regex module, trying its \X
grapheme support.
First, I try with the plain old .
>>> print regex.match('.', 'Ä').group(0)
>>> print regex.match('..', 'Ä').group(0)
Ä
It went as expected. Move on to \X
>>> print regex.match('\X', 'Ä').group(0)
>>> print regex.match('\X\X', 'Ä').group(0)
Ä
Why is it the same as .
? Shouldn't a single \X
be enough to capture the A-umlaut? Is it:
- My understanding of grapheme or the meaning of
\X
is wrong? - Some flag/switch I need to turn on first? (I've searched the documentation, couldn't find)
- Something with my environment? (Python 2.7.3, pip reports regex==2014.12.24)
- Bug in the library?
- Something else?