1

Just reading a web page here on the C# language specificaion https://msdn.microsoft.com/en-us/library/aa664665(v=vs.71).aspx there is some text that says "Any character with Unicode class Zs", in the context of defining white space.

Please, what is meant by "Any character with Unicode class Zs"?

It might be linked to regular expressions, adding regex tag.

S Meaden
  • 8,050
  • 3
  • 34
  • 65

2 Answers2

1

Unicode codepoints have a classification assigned to each of them. Codepoints in the Zs class belong to the "Separator, Space" category.

Remy Lebeau
  • 555,201
  • 31
  • 458
  • 770
  • That's really good, I am looking at http://stackoverflow.com/questions/4731055/whitespace-matching-regex-java. Do all these characters match together with regular expression [\s\u0085\p{Z}] as per an answer in that post? – S Meaden May 30 '16 at 22:15
  • Ok, will accept this now, will go on to test some regular expressions in Java to see if [\s\u0085\p{Z}] works. Thanks. – S Meaden May 30 '16 at 22:20
0

It means whitespace characters (separators). Check out this link http://www.fileformat.info/info/unicode/category/Zs/list.htm

Omar El Halabi
  • 2,118
  • 1
  • 18
  • 26