I'm writing some Ruby code that does some text analysis on domain names. In looking at the list of valid TLDs, I see some that use right-to-left languages such as:
- تونس.
- سوريا.
- السعودية.
Just looking at those TLDs alone shows that the dot (.) appears to the right instead of the left. If I came across a domain like this in the wild, how would the URL be structured? Specifically, a left-to-right URL is structured as:
<protocol>://[<user>:<pass>@]<host>:<port>/<path>[?<query>]
Additionally, the <host>
portion above could be broken out to look like:
[<subdomain>.]<domain>.<tld>
(e.g. "foo.example.com")
What is the structure of a right-to-left language URL?