thr3ads.net - similar to: "Unicode JSON Support"

Displaying 20 results from an estimated 40000 matches similar to: "Unicode JSON Support"

Parsing and deparsing of escaped unicode characters

2014 Jul 28

Parsing and deparsing of escaped unicode characters

In both R and JSON (and many other languages), unicode characters can be escaped using a backslash followed by a lowercase "u" and a 4 digit hex code. However when deparsing a character vector in R on Windows, the non-latin characters get escaped as "<U+" followed by their 4 digit hex code and ">": > x <- "I like \u5BFF\u53F8" > cat(x) I like

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

2024 Jan 09

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

On Mon, Jan 08, 2024 at 02:01:46PM +0100, Robert Stepanek wrote: > Removing the whole block will cause word-breaker to not correctly > handle halfwidth Katakana, such as "??????????" which it would treat > as a single term, whereas it should be two: ??????and ????). > > My pull request causes word-breaker to only handle halfwidth Katakana > and Hangul codepoints as

[Repost, with Formatting] Trying to understand unicode character entry, goes into postgres DB backing rails, saved to yaml as \xc4\x81

2008 Jan 05

[Repost, with Formatting] Trying to understand unicode character entry, goes into postgres DB backing rails, saved to yaml as \xc4\x81

Apologies on unformatted send previously, i hit Enter and the web UI posted, to my chagrin. 1. Examine the Unicode standard''s code page collection for "Latin small letter a with macron". 2. Nets U0100.pdf 3. "Latin small letter a with macron" appears on chart as 0101. This is a hexidemial number which points to U+0101 as its code point. Converting 0101 to decimal

Trying to understand unicode character entry, goes into postgres DB backing rails, saved to yaml as \

2008 Jan 05

Trying to understand unicode character entry, goes into postgres DB backing rails, saved to yaml as \

Examine the Unicode standard''s code page collection for "Latin small letter a with macron". Nets U0100.pdf "Latin small letter a with macron" appears on chart as 0101. This is a hexidemial number which points to U+0101 as its code point. Converting 0101 to decimal gets you 257, this is the same as the HTML entity code. HTML code point is 257. That is &257; gives

Unicode Support

2012 Oct 22

Unicode Support

Hi, I have a problem about the Latin letters in R. I try to get data from facebook to oracle and while R is reading the data, it cannot write some letters like "ü,ş,ğ" etc and just mark as "?". I know these are the Latin letters and R cannot read it but is there any tool or function to convert these letters to "u,s,g"? Thanks so lot [[alternative HTML version

No subject

2003 Dec 01

No subject

8. You have join to this mail a text file with the few characters that don't = match. On Swat, it's easier. = I use Swat from my win98 box. When I write a share comment with some chara= cters, there are not translated properly inside smb.conf file. = When I read my smb.conf file, I noticed that for example a cp1252 characte= r like (0xC1) has been changed onto an ISO8859-15 character

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

2024 Jan 08

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

On Sun, Jan 7, 2024, at 7:45 PM, Olly Betts wrote: > I've restarted trac. I now created a pull request: https://github.com/xapian/xapian/pull/329 Should I create a trac issue, too? > Assuming the latter is valid, just removing this block (or removing the > parts of it which are Lu or Ll) should fix the problem as then > tokenisation will switch mode - I tried this and it fixes

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

2024 Jan 04

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

I think I found a bug in Xapian 1.5 when using FLAG_WORD_BREAKS for input that contains characters in Unicode Halfwidth and Fullwidth Forms (https://unicode.org/charts/PDF/UFF00.pdf). Since I am undecided yet if and how to fix this in Xapian I haven't come up with a pull request. Because trac currently is offline, I could not file a bug. I hope it's OK to post my analysis here first,

WxRuby2 Unicode

2005 Nov 28

WxRuby2 Unicode

Hi Please find some bits n pieces for CVS for Wxruby2''s unicode support 1) a patch for typemap.i (typemap_unicode.patch) - this makes methods that return strings (eg TextCtrl#get_value) return them as proper UTF8 strings. Currently, if the strings contain non-latin characters, an empty string is returned. This should only affect unicode builds, according to the method docs. 2) a

How to make UNICODE characters to display correctly?

2008 Jul 24

How to make UNICODE characters to display correctly?

I run WBridge5 (French version) under wine-1.1.1 on FreeBSD-7.0. It runs ok except for every UNICODE character is replaced by the little box. Also in some message boxes symbols that obviously should be some French characters that aren't part of normal Latin alphabet are replaced with Chinese characters. How to make UNICODE to display correctly? Thank you, Yuri

5. Re: Leer datos de Unicode (Juan JosŽé Vidal Agust’ín)

2010 Apr 23

5. Re: Leer datos de Unicode (Juan JosŽé Vidal Agust’ín)

Estimados: Yo tengo un archivo de texto Unicode y no funciona read.table con encoding Latin-1 o UTF-8 estando en un entorno Windows. Si no es con ninguno de los anteriores, ¿como lo importo? ¿Cómo puedo saber que codificación tiene? Gracias, Sebastián. El día 22 de abril de 2010 07:00, <r-help-es-request en r-project.org> escribió: > Envíe los mensajes para la lista R-help-es a

[Bug 1726] New: invalid json generated by ipset list -output json

2023 Dec 05

[Bug 1726] New: invalid json generated by ipset list -output json

https://bugzilla.netfilter.org/show_bug.cgi?id=1726 Bug ID: 1726 Summary: invalid json generated by ipset list -output json Product: ipset Version: unspecified Hardware: x86_64 OS: Debian GNU/Linux Status: NEW Severity: trivial Priority: P5 Component: default Assignee:

[ANNOUNCE] libX11 1.6.3

2015 Mar 09

[ANNOUNCE] libX11 1.6.3

This release of libX11 looks bigger than it is, due to a lot of spec/doc cleanup work that doesn't affect the code itself. There is still a good deal of bug fixes, code cleanup, locale improvements, and compose key table additions, including new UTF-8 compose sequences for: <Multi_key> <R> <equal> : "<U+20B9>" U20b9 # INDIAN RUPEE SIGN <Multi_key>

special latin1 do not print as glyphs in current devel on windows

2017 Aug 01

special latin1 do not print as glyphs in current devel on windows

You seem confused about Latin-1: those characters are not in Latin-1. (MicroSoft code pages are a proprietary encoding, some code pages such as CP1252 being extensions to Latin-1.) You have not given the 'at a minimum information' asked for in the posting guide so we have no way to reproduce this, and without showing us the output on your system, we have no idea what you saw. [As a

Rails DB defaults to utf8 for mysql -- but unicode for postgresql

2011 May 20

Rails DB defaults to utf8 for mysql -- but unicode for postgresql

Hi, Creating a new Rails application with -d postgresql sets the encoding in the database.yml to unicode Creating a new Rails with -d mysql sets the encoding to utf8 Any ideas why this difference? I''ve been having problems with encoding due to localization, accents, json and what no. I thought I was using utf8 and not unicode. Would this have any impact? If this would be an issue?

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

2024 Jan 10

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

On Tue, Jan 9, 2024, at 3:28 AM, Olly Betts wrote: > Thanks, that looks good - now merged. Thanks! > Did you already check the other ranges for cased letters? I can but if > you have already there's not much point. I did not. If you find time, that'd be great. Otherwise I can make room for it in the next days. > > The fullwidth "????? ??????" tests suggests to

string comparison functions

2007 Apr 03

string comparison functions

Hi, What rules are used when Wine libraries compare Unicode strings? I need to know when ascii characters can match their approximate Unicode equivalents. For example: 'U+1D49 MODIFIER LETTER SMALL E' matches 'U+0065 LATIN SMALL LETTER E' I am debugging IE javascript security issues. IE6 on Wine (Linux Debian) matches more equivalents than IE6 on Win98 and WinXP. -- Tomas

Question about unicode characters in tcltk

2007 Aug 14

Question about unicode characters in tcltk

hello list, Can someone help me figure out why the following code doesn't work? I'm trying to but both Greek letters and subscripts into a tcltk menu. The code creates all the mu's, and the 1 and 2 subscripts, but it won't create the 0. Is there a certain set of characters that R won't recognize the unicode for? Or am I input the \u2080 incorrectly? library(tcltk) m

DO NOT REPLY [Bug 5554] New: File has vanished for files with foreign/unicode characters

2008 Jun 21

DO NOT REPLY [Bug 5554] New: File has vanished for files with foreign/unicode characters

https://bugzilla.samba.org/show_bug.cgi?id=5554 Summary: File has vanished for files with foreign/unicode characters Product: rsync Version: 2.6.9 Platform: x86 OS/Version: Windows XP Status: NEW Severity: normal Priority: P3 Component: core AssignedTo: wayned@samba.org

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

2024 Jan 07

Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints

On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote: > Since I am undecided yet if and how to fix this in Xapian I haven't > come up with a pull request. Because trac currently is offline, I > could not file a bug. I hope it's OK to post my analysis here first, > I'll be happy to follow up reporting that bug proper later (should we > conclude that it actually

similar to: Unicode JSON Support