Displaying 1 result from an estimated 1 matches for "tr27".
Did you mean:
r27
2002 Jan 10
3
UTF8_LANG: a much better idea
I've found a much better solution; it's standard (in Unicode itself),
simple and more flexible: Unicode language tagging. It was made for just
this purpose, in fact.
A technical description is at http://www.unicode.org/unicode/reports/tr27/#tag
which, like all specs, makes it sound a bit more complicated than it
really is.
It comes down to this: mark the language of text with U+E0001 LANGUAGE
TAG, followed by the RFC 3066 language ID (ie. "ja") encoded in
lowercase ASCII plus 0xE0000.
There's really nothing needed in...