search for: inttoutf8

Displaying 20 results from an estimated 25 matches for "inttoutf8".

2008 Sep 19
1
intToUtf8
Hi there, any explanation for this? > intToUtf8(66) Error in intToUtf8(66) : argument 'x' must be an integer vector > intToUtf8(c(66,55)) Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector > intToUtf8(c(66,55),multiple=TRUE) Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector...
2009 Nov 16
3
R crash with intToUtf8 on huge vectors (PR#14068)
Full_Name: George Russell Version: 2.10.0 OS: Windows XP Professional Version 2002 Service Pack 2 Submission from: (NULL) (217.111.3.131) Typing the following command into R --vanilla causes R to crash: k <- intToUtf8(rep(1e3,1e7)) This is the output of sessionInfo(): R version 2.10.0 (2009-10-26) i386-pc-mingw32 locale: [1] LC_COLLATE=German_Germany.1252 LC_CTYPE=German_Germany.1252 [3] LC_MONETARY=German_Germany.1252 LC_NUMERIC=C [5] LC_TIME=German_Germany.1252 attached base pac...
2009 Dec 10
1
Antwort: Re: Crash with Unicode and sub (PR#14114)
...to come to a stop using SUSE + WINE. Is it possible to run that lot on top of valgrind? Of course, it will probably take all day ... If not, I have a clue which might help. The problem seems to lie in the "sub" routine. In the original report I used -- cut here -- gctorture() u <- intToUtf8(c(rep(1e3,1e2),32,c(rep(1e3,1e2)))) v <- rep(u,1e2) v <- sub(" ","",v) v %in% "" -- cut here -- I've tried reducing this a bit more. Replacing intToUtf8 with a direct assignment writing out the string with Unicode escapes seems to make no difference. The %...
2009 Dec 07
3
Crash with Unicode and sub (PR#14114)
Full_Name: George Russell Version: 2.10.0 OS: Windows XP Version 2002 SP 2 Submission from: (NULL) (217.111.3.131) The following typed into R --vanilla induces a crash: -- cut here -- gctorture() u <- intToUtf8(c(rep(1e3,1e2),32,c(rep(1e3,1e2)))) v <- rep(u,1e2) v <- sub(" ","",v) v %in% "" -- cut here -- sessionInfo() says: -- cut here -- R version 2.10.0 (2009-10-26) i386-pc-mingw32 locale: [1] LC_COLLATE=German_Germany.1252 LC_CTYPE=German_Germany.1252 [3] L...
2009 Sep 17
0
Antwort: Re: Memory-management crash with UTF-8 on Windows (PR#13956)
Dear Professor Ripley, a. I apologise for not including the file. I sent the E-Mail to=20 R-bugs at r-project.org with an attachment, but the attachment evidently got=20 lost. However the text of the file is as follows: --- CUT HERE ---- gctorture() characters <- intToUtf8(200:300) indices <- rep(sequence(5),10) substrings <- substring(characters,indices,indices + 5) factor(substrings) --- CUT HERE --- b. There is no Unicode file involved, the Unicode comes from the=20 "intToUtf8" function. c. The operating system is Windows XP Professional (2002), S...
2017 Nov 30
2
R 3.4.3 is released
...etting when R is configured with --with-internal-tzcode (the default on macOS). * R CMD javareconf has been updated to recognize the use of a Java 9 SDK on macOS. BUG FIXES: * raw(0) & raw(0) and raw(0) | raw(0) again return raw(0) (rather than logical(0)). * intToUtf8() converts integers corresponding to surrogate code points to NA rather than invalid UTF-8, as well as values larger than the current Unicode maximum of 0x10FFFF. (This aligns with the current RFC3629.) * Fix calling of methods on S4 generics that dispatch on ... when...
2017 Nov 30
2
R 3.4.3 is released
...etting when R is configured with --with-internal-tzcode (the default on macOS). * R CMD javareconf has been updated to recognize the use of a Java 9 SDK on macOS. BUG FIXES: * raw(0) & raw(0) and raw(0) | raw(0) again return raw(0) (rather than logical(0)). * intToUtf8() converts integers corresponding to surrogate code points to NA rather than invalid UTF-8, as well as values larger than the current Unicode maximum of 0x10FFFF. (This aligns with the current RFC3629.) * Fix calling of methods on S4 generics that dispatch on ... when...
2017 Dec 01
0
R 3.4.3 is released
...(the default on macOS). > > * R CMD javareconf has been updated to recognize the use of a Java > 9 SDK on macOS. > > BUG FIXES: > > * raw(0) & raw(0) and raw(0) | raw(0) again return raw(0) (rather > than logical(0)). > > * intToUtf8() converts integers corresponding to surrogate code > points to NA rather than invalid UTF-8, as well as values larger > than the current Unicode maximum of 0x10FFFF. (This aligns with > the current RFC3629.) > > * Fix calling of methods on S4 generics tha...
2011 Nov 11
1
Generating the Ctrl-M character
Dear R-helpers, I want to append a Ctrl-M character to a string and then save it to a text file. mystring<-"This is a test." # How do I add a Ctrl-M to it in the end ?? cat(mystring,file="testfile") Many thanks, Ashim [[alternative HTML version deleted]]
2020 Apr 04
0
Possible Bug In Validation of UTF-8 Sequences
As per `?intToUtf8`, and in the comments to `valid_utf8`[1], R intends to prevent illegal UTF-8 such as UTF-8 encoded UTF-16 surrogate pairs.? `R_nchar`, invoked via `base::nchar`, explicitly validates UTF-8 strings[2], but allows the surrogate: ??? > Encoding('\ud800') ??? [1] "UTF-8" ??? >...
2009 Nov 25
1
Eliminating 'Unprintable ASCII' characters
Hi all, I have a csv file containing words with *UNPRINTABLE ASCII* characters (described in the following table). Are there any viable method in eliminating these characters? I realise that *EXTENDED ASCII* characters (i.e , ¡, ¢, £, ¤ etc) can be removed or replaced via *"gsub"* or *"gregexpr"* functions. But am not certain with the *UNPRINTABLE ASCII* characters. Your
2008 Apr 15
2
a question of alphabetical order
Hi all, In Spanish vowels with accent like ?, ?, ... doesn't affect to the alphabetical order of vector of strings. I mean, a or ? don't matter for establishing the alphabetical order. Nevertheless, while working with R order, here is what I get. Given a file transport.txt medio#variable avi?n#34 barco#33 bicicleta#3 ?ngulo#37 cami?n#54 coche#23 tren#67 > toPlot <-
2009 Dec 14
0
R 2.10.1 is released
...For ncp less than but close to 80, pchisq() and qchisq() are more accurate for probablilities very close to 1 (a series expansion was truncated slightly too early). pchisq(x, df, ncp) can no longer return values just larger than one for large values of ncp. o intToUtf8() could fail when asked to produce 10Mb or more strings, something it was never intended to do: unfortunately Windows crashed R (other OSes reported a lack of resources). (PR#14068) o chisq.test() could fail when given argument 'x' or 'y' which...
2009 Dec 14
0
R 2.10.1 is released
...For ncp less than but close to 80, pchisq() and qchisq() are more accurate for probablilities very close to 1 (a series expansion was truncated slightly too early). pchisq(x, df, ncp) can no longer return values just larger than one for large values of ncp. o intToUtf8() could fail when asked to produce 10Mb or more strings, something it was never intended to do: unfortunately Windows crashed R (other OSes reported a lack of resources). (PR#14068) o chisq.test() could fail when given argument 'x' or 'y' which...
2010 Apr 22
2
R2.11.0 - rasterImage() and barplot fill-patterns
...t not 'wget', but also for some > hard-to-access URLs. > > o In Rd, \eqn and \deqn will render in HTML (and convert to text) > upper- and lower-case Greek letters (entered as \alpha ...), > \ldots, \dots, \ge and \le. > > o utf8ToInt() and intToUtf8() now map NA inputs to NA outputs. > > o file() has a new argument 'raw' which may help if it is used > with something other than a regular file, e.g. a character device. > > o New function strtoi(), a wrapper for the C function strtol. > > o as.oc...
2008 Oct 20
2
R 2.8.0 is released
...er. o help.search() now assumes that non-ASCII items are in latin1 if that makes sense (all known examples on CRAN are). o HoltWinters() and decompose() use a (statistically) more efficient computation for seasonal fits (they used to waste one period). o intToUtf8() and intToBits() now accept numeric vectors, truncating them to integers. o is.unsorted() gains an argument 'strictly'. It now works for classed objects with a >= or > method (as incorrectly documented earlier). o library() no longer warns about...
2008 Oct 20
2
R 2.8.0 is released
...er. o help.search() now assumes that non-ASCII items are in latin1 if that makes sense (all known examples on CRAN are). o HoltWinters() and decompose() use a (statistically) more efficient computation for seasonal fits (they used to waste one period). o intToUtf8() and intToBits() now accept numeric vectors, truncating them to integers. o is.unsorted() gains an argument 'strictly'. It now works for classed objects with a >= or > method (as incorrectly documented earlier). o library() no longer warns about...
2010 Apr 22
0
R 2.11.0 is released
...which have 'curl' but not 'wget', but also for some hard-to-access URLs. o In Rd, \eqn and \deqn will render in HTML (and convert to text) upper- and lower-case Greek letters (entered as \alpha ...), \ldots, \dots, \ge and \le. o utf8ToInt() and intToUtf8() now map NA inputs to NA outputs. o file() has a new argument 'raw' which may help if it is used with something other than a regular file, e.g. a character device. o New function strtoi(), a wrapper for the C function strtol. o as.octmode() and as.hexmode() now...
2010 Apr 22
0
R 2.11.0 is released
...which have 'curl' but not 'wget', but also for some hard-to-access URLs. o In Rd, \eqn and \deqn will render in HTML (and convert to text) upper- and lower-case Greek letters (entered as \alpha ...), \ldots, \dots, \ge and \le. o utf8ToInt() and intToUtf8() now map NA inputs to NA outputs. o file() has a new argument 'raw' which may help if it is used with something other than a regular file, e.g. a character device. o New function strtoi(), a wrapper for the C function strtol. o as.octmode() and as.hexmode() now...
2018 Apr 23
0
R 3.5.0 is released
...) and glm.fit get the same singular.ok = TRUE argument that lm() has had forever. As a consequence, in glm(*, method = <your_own>), user specified methods need to accept a singular.ok argument as well. * aspell() gains a filter for Markdown (.md and .Rmd) files. * intToUtf8(multiple = FALSE) gains an argument to allow surrogate pairs to be interpreted. * The maximum number of DLLs that can be loaded into R e.g. _via_ dyn.load() has been increased up to 614 when the OS limit on the number of open files allows. * Sys.timezone() on a Unix-alik...