I've been using ScalaCheck for automatic unit testing. Its default String generator (i.e., its default Arbitrary[String]
instance) is a little too powerful, generally producing an unreadable jumble made up mainly of characters I'm not trying to support and my system can't even render.
I've set out to create some more Arbitrary[String]
instances, and am trying to find out what's out there.
Here are some examples of String classes that would be helpful for testing:
- basic multilingual plane strings
- astral strings
- latinate strings (including extensions a/b)
- French words
- left-to-right language strings
- right-to-left language strings
- Chinese sentences
- "web strings" (strings drawn from a character set that constitutes 99.9999% of web content)
- use your imagination ...
Are there libraries out there that can make these, or similar strings, at random?