chrix Posted March 4, 2010 at 02:27 PM Report Posted March 4, 2010 at 02:27 PM bump. What's the state of affairs in 2010? Quote
renzhe Posted March 4, 2010 at 07:56 PM Report Posted March 4, 2010 at 07:56 PM Well, the URLs with Unicode characters are rather commonplace, see any Chinese wikipedia site for an example. As for the actual domain names, I'd say this is still very rare. I did notice one domain: http://好動網.tw/ but it's the exception rather than a rule. Unicode domains are mostly used for phishing, unfortunately. Quote
chrix Posted March 4, 2010 at 10:13 PM Report Posted March 4, 2010 at 10:13 PM Yeah, I've noticed the Wikipedia stuff, too. But with many browsers you don't get the Unicode characters in the URL, you often get the % Hex code thing. And I tried the URL, that's quite cool, but my browser (Chrome) doesn't display it, it's all numbers, in this case http://xn--hhrt8ms97a.tw/ (I'm not sure if this is an alternate URL as it doesn't look like Hex code). I was just checking if I hadn't missed anything big. The way I see it they made a big hype about it 6,7 years ago, and then it kinda fizzled... Quote
renzhe Posted March 4, 2010 at 11:09 PM Report Posted March 4, 2010 at 11:09 PM It will depend on the browser. I get full Unicode throughout. Quote
Hofmann Posted March 5, 2010 at 02:10 AM Report Posted March 5, 2010 at 02:10 AM Those who use Chinese characters might buy many domain names, e.g. 誰鬼網站.com and 谁鬼网站.com. Quote
imron Posted March 5, 2010 at 06:37 AM Report Posted March 5, 2010 at 06:37 AM it's all numbers, in this case http://xn--hhrt8ms97a.tw/ That's called punycode. Basically, it's unicode encoded into 7bit ascii so that it works within the existing framework for domain names. Many modern browsers will always display the punycode representation of an international address in the address bar, because in Unicode there are many characters that look similar to standard characters, making it easy for malicious sites to deceive users. Quote
roddy Posted July 7, 2010 at 01:45 AM Report Posted July 7, 2010 at 01:45 AM More news here. Five new top-level domains, 中国 and 台湾 in both simplified and traditional versions (will those be sold separately?) and 香港。 Quote
Matty Posted September 10, 2010 at 02:27 AM Report Posted September 10, 2010 at 02:27 AM The problem currently with IDNs (International Domain Names) is the domain name system is not properly designed for UTF-8 encoding. For example if you were to type the web address: http://www.汉语水平考试.com it would be translated to: http://www.xn--gwt091a1by53ikqn9d.com This is not very appealing for most people and one of the main reasons I'm concluding that they haven't been widely adopted. I was however considering doing a paper on this for my PHD, however that's another story. The point is the browser doesn't keep the International characters at the top and they're not stored in international characters at the end destination. Quote
Recommended Posts
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.