-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi, I've a problem with FreeBSD, UTF-8 and Sorting. e.g. there is a file with the following content Meier M?ller ?ger Ofner Schmidt I have set my Terminal to ISO-8859-1 Encoding and call sort on this file I get the following output. Meier M?ller Ofner ?ger Schmidt Which is correctly sorted. When i change my Terminal to UTF-8 encoding and convert the file to UTF-8 and call sort again I get the following output. Meier M?ller Ofner Schmidt ?ger which is wrong. The problem seams to be that the LC_COLLATE file in the de_DE.UTF-8 locale is linked to ../la_LN.US-ASCII/LC_COLLATE (as are all LC_COLLATE Files in any UTF-8 locale). After some Research i found a Mail from Kuba Lida in December 2008 (yeah that's 5 Years ago) stating the same Problem and got no response. Why isn't there a UTF-8 LC_COLLATE file for any language. Kuba Lida believed there was a Problem with multibyte collate files in FreeBSD. Is this true and are there plans to fix this problem. The same test under Linux works without problem. Regards Estartu - -- - --------------------------------------------------------------------------- Gerhard Schmidt | http://www.augusta.de/~estartu | Fischbachweg 3 | | PGP Public Key 86856 Hiltenfingen | JabberID: estartu at augusta.de | on request Germany | | -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iQCVAwUBUyApfwzx22nOTJQRAQJIbgP+MMSPepEsyG8Kx+QRDGJlfyQKK+r98/e+ ZiNPRMNjBpT7qrElJLvYfAuix3pOyqL3mq1DQJvZmqQxfoxEdy6GUf42i1Yk5gEX T05YtaeVRoXK/TetFt0UEcC3bXuXheu63aBpO4FU2v8CPTAyBwU6DUvV/v3AzXr6 j+mwws5n7so=J2tH -----END PGP SIGNATURE-----
Hi, * Gerhard Schmidt <estartu at ze.tum.de> [140312 10:31]:> -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1> I've a problem with FreeBSD, UTF-8 and Sorting. > [..] > The problem seams to be that the LC_COLLATE file in the de_DE.UTF-8 > locale is linked to ../la_LN.US-ASCII/LC_COLLATE (as are all > LC_COLLATE Files in any UTF-8 locale).> After some Research i found a Mail from Kuba Lida in December 2008 > (yeah that's 5 Years ago) stating the same Problem and got no response.> Why isn't there a UTF-8 LC_COLLATE file for any language. Kuba Lida > believed there was a Problem with multibyte collate files in FreeBSD. > Is this true and are there plans to fix this problem.AFAIR this is not just a question of changing the collate files. Have a look at https://wiki.freebsd.org/KonradJankowski/Collation for a project to fix that problem (stalled, unfortunately). Wolfgang