similar to: Unicode JSON Support

Displaying 20 results from an estimated 40000 matches similar to: "Unicode JSON Support"

2014 Jul 28
1
Parsing and deparsing of escaped unicode characters
In both R and JSON (and many other languages), unicode characters can be escaped using a backslash followed by a lowercase "u" and a 4 digit hex code. However when deparsing a character vector in R on Windows, the non-latin characters get escaped as "<U+" followed by their 4 digit hex code and ">": > x <- "I like \u5BFF\u53F8" > cat(x) I like
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Mon, Jan 08, 2024 at 02:01:46PM +0100, Robert Stepanek wrote: > Removing the whole block will cause word-breaker to not correctly > handle halfwidth Katakana, such as "??????????" which it would treat > as a single term, whereas it should be two: ??????and ????). > > My pull request causes word-breaker to only handle halfwidth Katakana > and Hangul codepoints as
2008 Jan 05
0
[Repost, with Formatting] Trying to understand unicode character entry, goes into postgres DB backing rails, saved to yaml as \xc4\x81
Apologies on unformatted send previously, i hit Enter and the web UI posted, to my chagrin. 1. Examine the Unicode standard''s code page collection for "Latin small letter a with macron". 2. Nets U0100.pdf 3. "Latin small letter a with macron" appears on chart as 0101. This is a hexidemial number which points to U+0101 as its code point. Converting 0101 to decimal
2008 Jan 05
0
Trying to understand unicode character entry, goes into postgres DB backing rails, saved to yaml as \
Examine the Unicode standard''s code page collection for "Latin small letter a with macron". Nets U0100.pdf "Latin small letter a with macron" appears on chart as 0101. This is a hexidemial number which points to U+0101 as its code point. Converting 0101 to decimal gets you 257, this is the same as the HTML entity code. HTML code point is 257. That is &257; gives
2012 Oct 22
0
Unicode Support
Hi, I have a problem about the Latin letters in R. I try to get data from facebook to oracle and while R is reading the data, it cannot write some letters like "ü,ş,ğ" etc and just mark as "?". I know these are the Latin letters and R cannot read it but is there any tool or function to convert these letters to "u,s,g"? Thanks so lot [[alternative HTML version
2003 Dec 01
0
No subject
8. You have join to this mail a text file with the few characters that don't = match. On Swat, it's easier. = I use Swat from my win98 box. When I write a share comment with some chara= cters, there are not translated properly inside smb.conf file. = When I read my smb.conf file, I noticed that for example a cp1252 characte= r like (0xC1) has been changed onto an ISO8859-15 character
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Sun, Jan 7, 2024, at 7:45 PM, Olly Betts wrote: > I've restarted trac. I now created a pull request: https://github.com/xapian/xapian/pull/329 Should I create a trac issue, too? > Assuming the latter is valid, just removing this block (or removing the > parts of it which are Lu or Ll) should fix the problem as then > tokenisation will switch mode - I tried this and it fixes
2024 Jan 04
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
I think I found a bug in Xapian 1.5 when using FLAG_WORD_BREAKS for input that contains characters in Unicode Halfwidth and Fullwidth Forms (https://unicode.org/charts/PDF/UFF00.pdf). Since I am undecided yet if and how to fix this in Xapian I haven't come up with a pull request. Because trac currently is offline, I could not file a bug. I hope it's OK to post my analysis here first,
2005 Nov 28
1
WxRuby2 Unicode
Hi Please find some bits n pieces for CVS for Wxruby2''s unicode support 1) a patch for typemap.i (typemap_unicode.patch) - this makes methods that return strings (eg TextCtrl#get_value) return them as proper UTF8 strings. Currently, if the strings contain non-latin characters, an empty string is returned. This should only affect unicode builds, according to the method docs. 2) a
2008 Jul 24
1
How to make UNICODE characters to display correctly?
I run WBridge5 (French version) under wine-1.1.1 on FreeBSD-7.0. It runs ok except for every UNICODE character is replaced by the little box. Also in some message boxes symbols that obviously should be some French characters that aren't part of normal Latin alphabet are replaced with Chinese characters. How to make UNICODE to display correctly? Thank you, Yuri
2010 Apr 23
3
5. Re: Leer datos de Unicode (Juan JosŽé Vidal Agust’ín)
Estimados: Yo tengo un archivo de texto Unicode y no funciona read.table con encoding Latin-1 o UTF-8 estando en un entorno Windows. Si no es con ninguno de los anteriores, ¿como lo importo? ¿Cómo puedo saber que codificación tiene? Gracias, Sebastián. El día 22 de abril de 2010 07:00, <r-help-es-request en r-project.org> escribió: > Envíe los mensajes para la lista R-help-es a
2023 Dec 05
3
[Bug 1726] New: invalid json generated by ipset list -output json
https://bugzilla.netfilter.org/show_bug.cgi?id=1726 Bug ID: 1726 Summary: invalid json generated by ipset list -output json Product: ipset Version: unspecified Hardware: x86_64 OS: Debian GNU/Linux Status: NEW Severity: trivial Priority: P5 Component: default Assignee:
2015 Mar 09
0
[ANNOUNCE] libX11 1.6.3
This release of libX11 looks bigger than it is, due to a lot of spec/doc cleanup work that doesn't affect the code itself. There is still a good deal of bug fixes, code cleanup, locale improvements, and compose key table additions, including new UTF-8 compose sequences for: <Multi_key> <R> <equal> : "<U+20B9>" U20b9 # INDIAN RUPEE SIGN <Multi_key>
2017 Aug 01
0
special latin1 do not print as glyphs in current devel on windows
You seem confused about Latin-1: those characters are not in Latin-1. (MicroSoft code pages are a proprietary encoding, some code pages such as CP1252 being extensions to Latin-1.) You have not given the 'at a minimum information' asked for in the posting guide so we have no way to reproduce this, and without showing us the output on your system, we have no idea what you saw. [As a
2011 May 20
2
Rails DB defaults to utf8 for mysql -- but unicode for postgresql
Hi, Creating a new Rails application with -d postgresql sets the encoding in the database.yml to unicode Creating a new Rails with -d mysql sets the encoding to utf8 Any ideas why this difference? I''ve been having problems with encoding due to localization, accents, json and what no. I thought I was using utf8 and not unicode. Would this have any impact? If this would be an issue?
2024 Jan 10
2
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Tue, Jan 9, 2024, at 3:28 AM, Olly Betts wrote: > Thanks, that looks good - now merged. Thanks! > Did you already check the other ranges for cased letters? I can but if > you have already there's not much point. I did not. If you find time, that'd be great. Otherwise I can make room for it in the next days. > > The fullwidth "????? ??????" tests suggests to
2007 Apr 03
1
string comparison functions
Hi, What rules are used when Wine libraries compare Unicode strings? I need to know when ascii characters can match their approximate Unicode equivalents. For example: 'U+1D49 MODIFIER LETTER SMALL E' matches 'U+0065 LATIN SMALL LETTER E' I am debugging IE javascript security issues. IE6 on Wine (Linux Debian) matches more equivalents than IE6 on Win98 and WinXP. -- Tomas
2007 Aug 14
2
Question about unicode characters in tcltk
hello list, Can someone help me figure out why the following code doesn't work? I'm trying to but both Greek letters and subscripts into a tcltk menu. The code creates all the mu's, and the 1 and 2 subscripts, but it won't create the 0. Is there a certain set of characters that R won't recognize the unicode for? Or am I input the \u2080 incorrectly? library(tcltk) m
2008 Jun 21
1
DO NOT REPLY [Bug 5554] New: File has vanished for files with foreign/unicode characters
https://bugzilla.samba.org/show_bug.cgi?id=5554 Summary: File has vanished for files with foreign/unicode characters Product: rsync Version: 2.6.9 Platform: x86 OS/Version: Windows XP Status: NEW Severity: normal Priority: P3 Component: core AssignedTo: wayned@samba.org
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote: > Since I am undecided yet if and how to fix this in Xapian I haven't > come up with a pull request. Because trac currently is offline, I > could not file a bug. I hope it's OK to post my analysis here first, > I'll be happy to follow up reporting that bug proper later (should we > conclude that it actually