bugzilla-daemon@dp3.samba.org
2005-Dec-04 22:58 UTC
[Bug 3299] New: rsync: now replaces non-ASCII character with numerical values
https://bugzilla.samba.org/show_bug.cgi?id=3299 Summary: rsync: now replaces non-ASCII character with numerical values Product: rsync Version: 2.6.6 Platform: Other URL: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=307242 OS/Version: Linux Status: NEW Severity: major Priority: P3 Component: core AssignedTo: wayned@samba.org ReportedBy: debian@fnen.eu.org QAContact: rsync-qa@samba.org Package: rsync Version: 2.6.6-1 Followup-For: Bug #307242 I've again used two identical Sarge systems, both using UTF-8. Using rsync (over ssh), syncing (or listing the contents) from one system to the other, non-ASCII characters get replaced with numerical values like '\303\245', eg: user@system1:~$ rsync system2:~/test_åäö_test drwxr-xr-x 72 2005/09/25 01:39:30 test_\303\245\303\244\303\266_test The changelog states: -------------------------------------------------------------------------- rsync (2.6.5-1) unstable; urgency=low * Now should handle locale-specific characters better in logging output (i.e. the correct chars should be displayed, not '?'). -------------------------------------------------------------------------- This statement is obviously not correct. The '?' has just been replaced with a numerical value instead. (Almost as useless.) Is this something that's being worked on upstream? Is there a workaround? All scripts and programs depending upon the output is almost useless since several month now. -- System Information: Debian Release: 3.1 Architecture: i386 (i686) Kernel: Linux 2.6.8-2-686-smp Locale: LANG=sv_SE.UTF-8, LC_CTYPE=sv_SE.UTF-8 (charmap=UTF-8) Versions of packages rsync depends on: ii libc6 2.3.2.ds1-22 GNU C Library: Shared libraries an ii libpopt0 1.7-5 lib for parsing cmdline parameters -- no debconf information -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug, or are watching the QA contact.
bugzilla-daemon@dp3.samba.org
2006-Jan-17 04:04 UTC
DO NOT REPLY [Bug 3299] rsync: now replaces non-ASCII character with numerical values
https://bugzilla.samba.org/show_bug.cgi?id=3299 wayned@samba.org changed: What |Removed |Added ---------------------------------------------------------------------------- Severity|major |normal Status|NEW |ASSIGNED ------- Comment #1 from wayned@samba.org 2006-01-16 21:04 MST ------- The changelog statement you cited is correct for locales that don't use multibyte encodings (of which UTF-8 is not one). For instance, rsync outputs all the extended characters from ISO-8859-1 without any mangling. I've been considering how best to add multibyte support to rsync, and I think that I can leverage the way iconv() works to have it tell me if characters are valid in the current locale. A patch that does this (along with adding filename conversion support) is here: http://opencoder.net/iconv.diff This is still a young patch, so be careful if you decide to give it a try. The patch applies to the latest CVS source. See the diff for build and usage instructions. -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug, or are watching the QA contact.
bugzilla-daemon@dp3.samba.org
2006-Feb-06 18:09 UTC
DO NOT REPLY [Bug 3299] rsync: now replaces non-ASCII character with numerical values
https://bugzilla.samba.org/show_bug.cgi?id=3299 wayned@samba.org changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED ------- Comment #2 from wayned@samba.org 2006-02-06 11:08 MST ------- The CVS version now handles multibyte locales as long as the local system has iconv() (rsync uses an identity conversion to determine if the characters in a name are valid in the current characterset or not). -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug, or are watching the QA contact.
bugzilla-daemon@jump.samba.org
2006-Feb-07 16:41 UTC
DO NOT REPLY [Bug 3299] rsync: now replaces non-ASCII character with numerical values
https://bugzilla.samba.org/show_bug.cgi?id=3299 ------- Comment #3 from wayned@samba.org 2006-02-07 05:33 MST ------- I should also mention that there is now an option that tells rsync that you want it to pass through all high-bit characers unescaped (instead of trying to escape only the invalid ones in the current locale): --8-bit-output (-8). -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug, or are watching the QA contact.