José García Juanino
2016-Jul-27 09:19 UTC
sed command does not behave equal from 10.3 to 11.0
On 27 July 2016 at 11:01, Matthew D. Fuller <fullermd at over-yonder.net> wrote:> On Wed, Jul 27, 2016 at 09:45:23AM +0100 I heard the voice of > krad, and lo! it spake thus: >> are you sure you aren't hitting a port or something? > > Locale dependant. > > % echo "abc_ABC.def" | env LANG=C sed -e 's/[^A-Z0-9]//g' > ABC > > % echo "abc_ABC.def" | env LANG=en_US.UTF-8 sed -e 's/[^A-Z0-9]//g' > bcABCdef > > (pre-branch -CURRENT) >The issue is that, under the same locale, the output is not the same in 10.3 as 11.0. It sounds to me a bug ...
Hi. There were some collation related changes (*1) between 10.3 and 11. So the results can be changed even with the same locale. *1: For example, r302512. https://lists.freebsd.org/pipermail/svn-src-head/2016-July/088919.html But I cannot understand why ASCII range of characters are affected with UTF-8 encoding. On Wed, 27 Jul 2016 11:19:06 +0200 Jos? Garc?a Juanino <jjuanino at gmail.com> wrote:> On 27 July 2016 at 11:01, Matthew D. Fuller <fullermd at over-yonder.net> wrote: > > On Wed, Jul 27, 2016 at 09:45:23AM +0100 I heard the voice of > > krad, and lo! it spake thus: > >> are you sure you aren't hitting a port or something? > > > > Locale dependant. > > > > % echo "abc_ABC.def" | env LANG=C sed -e 's/[^A-Z0-9]//g' > > ABC > > > > % echo "abc_ABC.def" | env LANG=en_US.UTF-8 sed -e 's/[^A-Z0-9]//g' > > bcABCdef > > > > (pre-branch -CURRENT) > > > > The issue is that, under the same locale, the output is not the same > in 10.3 as 11.0. It sounds to me a bug ... > _______________________________________________ > freebsd-stable at freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org" >-- Tomoaki AOKI junchoon at dec.sakura.ne.jp