Displaying 20 results from an estimated 26 matches for "inttoutf8".
2008 Sep 19
1
intToUtf8
Hi there,
any explanation for this?
> intToUtf8(66)
Error in intToUtf8(66) : argument 'x' must be an integer vector
> intToUtf8(c(66,55))
Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector
> intToUtf8(c(66,55),multiple=TRUE)
Error in intToUtf8(c(66, 55)) : argument 'x' must be an integer vector...
2009 Nov 16
3
R crash with intToUtf8 on huge vectors (PR#14068)
Full_Name: George Russell
Version: 2.10.0
OS: Windows XP Professional Version 2002 Service Pack 2
Submission from: (NULL) (217.111.3.131)
Typing the following command into R --vanilla causes R to crash:
k <- intToUtf8(rep(1e3,1e7))
This is the output of sessionInfo():
R version 2.10.0 (2009-10-26)
i386-pc-mingw32
locale:
[1] LC_COLLATE=German_Germany.1252 LC_CTYPE=German_Germany.1252
[3] LC_MONETARY=German_Germany.1252 LC_NUMERIC=C
[5] LC_TIME=German_Germany.1252
attached base pac...
2009 Dec 10
1
Antwort: Re: Crash with Unicode and sub (PR#14114)
...to come to a stop using SUSE + WINE. Is it
possible to run that lot on top of valgrind? Of course, it will probably
take all day ...
If not, I have a clue which might help. The problem seems to lie in the
"sub" routine. In the original report I used
-- cut here --
gctorture()
u <- intToUtf8(c(rep(1e3,1e2),32,c(rep(1e3,1e2))))
v <- rep(u,1e2)
v <- sub(" ","",v)
v %in% ""
-- cut here --
I've tried reducing this a bit more. Replacing intToUtf8 with a direct
assignment writing out the string with Unicode escapes seems to make no
difference. The %...
2009 Dec 07
3
Crash with Unicode and sub (PR#14114)
Full_Name: George Russell
Version: 2.10.0
OS: Windows XP Version 2002 SP 2
Submission from: (NULL) (217.111.3.131)
The following typed into R --vanilla induces a crash:
-- cut here --
gctorture()
u <- intToUtf8(c(rep(1e3,1e2),32,c(rep(1e3,1e2))))
v <- rep(u,1e2)
v <- sub(" ","",v)
v %in% ""
-- cut here --
sessionInfo() says:
-- cut here --
R version 2.10.0 (2009-10-26)
i386-pc-mingw32
locale:
[1] LC_COLLATE=German_Germany.1252 LC_CTYPE=German_Germany.1252
[3] L...
2024 Jun 01
2
Tools to modify highlighted areas in pdf documents?
...L = isNL & (x[which(isNL) + 1] == 10);
NL = NL + sum(isNL);
}
close(con)
idP = which(isNL)
idS = 935; # will vary with pdf and Annotations and ...;
nLast = 4; # usually 2 chunks
idx = idP[seq(idS, length.out = nLast)]
# Check: Right position?
# tmp = x[seq(idx[1] + 2, idx[1 + 2] - 1)]
# intToUtf8(tmp)
tmp = inflate(x[seq(idx[1] + 2, idx[nLast] - 1)])
intToUtf8(tmp$output)
# Output of inflate: an Example
# "/GS gs .56078434 .87058824 .97647059 rg\n
# 337.298 183.836 m 364.322 183.836 l 364.322 171.83 l 337.298 171.83 l h f\n"
# Note: /BBox[ 337.298 171.83 364.322 183.836]
The r...
2009 Sep 17
0
Antwort: Re: Memory-management crash with UTF-8 on Windows (PR#13956)
Dear Professor Ripley,
a. I apologise for not including the file. I sent the E-Mail to=20
R-bugs at r-project.org with an attachment, but the attachment evidently got=20
lost. However the text of the file is as follows:
--- CUT HERE ----
gctorture()
characters <- intToUtf8(200:300)
indices <- rep(sequence(5),10)
substrings <- substring(characters,indices,indices + 5)
factor(substrings)
--- CUT HERE ---
b. There is no Unicode file involved, the Unicode comes from the=20
"intToUtf8" function.
c. The operating system is Windows XP Professional (2002), S...
2017 Nov 30
2
R 3.4.3 is released
...etting when R is configured with --with-internal-tzcode
(the default on macOS).
* R CMD javareconf has been updated to recognize the use of a Java
9 SDK on macOS.
BUG FIXES:
* raw(0) & raw(0) and raw(0) | raw(0) again return raw(0) (rather
than logical(0)).
* intToUtf8() converts integers corresponding to surrogate code
points to NA rather than invalid UTF-8, as well as values larger
than the current Unicode maximum of 0x10FFFF. (This aligns with
the current RFC3629.)
* Fix calling of methods on S4 generics that dispatch on ... when...
2017 Nov 30
2
R 3.4.3 is released
...etting when R is configured with --with-internal-tzcode
(the default on macOS).
* R CMD javareconf has been updated to recognize the use of a Java
9 SDK on macOS.
BUG FIXES:
* raw(0) & raw(0) and raw(0) | raw(0) again return raw(0) (rather
than logical(0)).
* intToUtf8() converts integers corresponding to surrogate code
points to NA rather than invalid UTF-8, as well as values larger
than the current Unicode maximum of 0x10FFFF. (This aligns with
the current RFC3629.)
* Fix calling of methods on S4 generics that dispatch on ... when...
2017 Dec 01
0
R 3.4.3 is released
...(the default on macOS).
>
> * R CMD javareconf has been updated to recognize the use of a Java
> 9 SDK on macOS.
>
> BUG FIXES:
>
> * raw(0) & raw(0) and raw(0) | raw(0) again return raw(0) (rather
> than logical(0)).
>
> * intToUtf8() converts integers corresponding to surrogate code
> points to NA rather than invalid UTF-8, as well as values larger
> than the current Unicode maximum of 0x10FFFF. (This aligns with
> the current RFC3629.)
>
> * Fix calling of methods on S4 generics tha...
2011 Nov 11
1
Generating the Ctrl-M character
Dear R-helpers,
I want to append a Ctrl-M character to a string and then save it to a text
file.
mystring<-"This is a test."
# How do I add a Ctrl-M to it in the end ??
cat(mystring,file="testfile")
Many thanks,
Ashim
[[alternative HTML version deleted]]
2020 Apr 04
0
Possible Bug In Validation of UTF-8 Sequences
As per `?intToUtf8`, and in the comments to `valid_utf8`[1], R
intends to prevent illegal UTF-8 such as UTF-8 encoded
UTF-16 surrogate pairs.? `R_nchar`, invoked via `base::nchar`,
explicitly validates UTF-8 strings[2], but allows the surrogate:
??? > Encoding('\ud800')
??? [1] "UTF-8"
??? >...
2009 Nov 25
1
Eliminating 'Unprintable ASCII' characters
Hi all,
I have a csv file containing words with *UNPRINTABLE ASCII* characters
(described in the following table).
Are there any viable method in eliminating these characters?
I realise that *EXTENDED ASCII* characters (i.e , ¡, ¢, £, ¤ etc) can be
removed or replaced via *"gsub"* or *"gregexpr"* functions. But am not
certain with the *UNPRINTABLE ASCII* characters.
Your
2008 Apr 15
2
a question of alphabetical order
Hi all,
In Spanish vowels with accent like ?, ?, ... doesn't affect to the
alphabetical order of vector of strings. I mean, a or ? don't matter for
establishing the alphabetical order.
Nevertheless, while working with R order, here is what I get.
Given a file transport.txt
medio#variable
avi?n#34
barco#33
bicicleta#3
?ngulo#37
cami?n#54
coche#23
tren#67
> toPlot <-
2009 Dec 14
0
R 2.10.1 is released
...For ncp less than but close to 80, pchisq() and qchisq() are
more accurate for probablilities very close to 1 (a series
expansion was truncated slightly too early).
pchisq(x, df, ncp) can no longer return values just larger
than one for large values of ncp.
o intToUtf8() could fail when asked to produce 10Mb or more
strings, something it was never intended to do: unfortunately
Windows crashed R (other OSes reported a lack of resources).
(PR#14068)
o chisq.test() could fail when given argument 'x' or 'y' which...
2009 Dec 14
0
R 2.10.1 is released
...For ncp less than but close to 80, pchisq() and qchisq() are
more accurate for probablilities very close to 1 (a series
expansion was truncated slightly too early).
pchisq(x, df, ncp) can no longer return values just larger
than one for large values of ncp.
o intToUtf8() could fail when asked to produce 10Mb or more
strings, something it was never intended to do: unfortunately
Windows crashed R (other OSes reported a lack of resources).
(PR#14068)
o chisq.test() could fail when given argument 'x' or 'y' which...
2010 Apr 22
2
R2.11.0 - rasterImage() and barplot fill-patterns
...t not 'wget', but also for some
> hard-to-access URLs.
>
> o In Rd, \eqn and \deqn will render in HTML (and convert to text)
> upper- and lower-case Greek letters (entered as \alpha ...),
> \ldots, \dots, \ge and \le.
>
> o utf8ToInt() and intToUtf8() now map NA inputs to NA outputs.
>
> o file() has a new argument 'raw' which may help if it is used
> with something other than a regular file, e.g. a character device.
>
> o New function strtoi(), a wrapper for the C function strtol.
>
> o as.oc...
2008 Oct 20
2
R 2.8.0 is released
...er.
o help.search() now assumes that non-ASCII items are in latin1
if that makes sense (all known examples on CRAN are).
o HoltWinters() and decompose() use a (statistically) more
efficient computation for seasonal fits (they used to waste
one period).
o intToUtf8() and intToBits() now accept numeric vectors,
truncating them to integers.
o is.unsorted() gains an argument 'strictly'. It now works for
classed objects with a >= or > method (as incorrectly
documented earlier).
o library() no longer warns about...
2008 Oct 20
2
R 2.8.0 is released
...er.
o help.search() now assumes that non-ASCII items are in latin1
if that makes sense (all known examples on CRAN are).
o HoltWinters() and decompose() use a (statistically) more
efficient computation for seasonal fits (they used to waste
one period).
o intToUtf8() and intToBits() now accept numeric vectors,
truncating them to integers.
o is.unsorted() gains an argument 'strictly'. It now works for
classed objects with a >= or > method (as incorrectly
documented earlier).
o library() no longer warns about...
2010 Apr 22
0
R 2.11.0 is released
...which have 'curl' but not 'wget', but also for some
hard-to-access URLs.
o In Rd, \eqn and \deqn will render in HTML (and convert to text)
upper- and lower-case Greek letters (entered as \alpha ...),
\ldots, \dots, \ge and \le.
o utf8ToInt() and intToUtf8() now map NA inputs to NA outputs.
o file() has a new argument 'raw' which may help if it is used
with something other than a regular file, e.g. a character device.
o New function strtoi(), a wrapper for the C function strtol.
o as.octmode() and as.hexmode() now...
2010 Apr 22
0
R 2.11.0 is released
...which have 'curl' but not 'wget', but also for some
hard-to-access URLs.
o In Rd, \eqn and \deqn will render in HTML (and convert to text)
upper- and lower-case Greek letters (entered as \alpha ...),
\ldots, \dots, \ge and \le.
o utf8ToInt() and intToUtf8() now map NA inputs to NA outputs.
o file() has a new argument 'raw' which may help if it is used
with something other than a regular file, e.g. a character device.
o New function strtoi(), a wrapper for the C function strtol.
o as.octmode() and as.hexmode() now...