thr3ads.net - similar to: "Error in readChar(): invalid UTF-8 input"

Displaying 20 results from an estimated 8000 matches similar to: "Error in readChar(): invalid UTF-8 input"

2002 Apr 26

Memory "leak" in readChar (PR#1483)

Full_Name: Hugh C. Pumphrey Version: 1.4.1 OS: Linux (Debian Woody) Submission from: (NULL) (129.215.133.170) The function readChar() appears to have some type of problem with memory allocation. I don't know if "memory leak" is the correct term but if one uses readChar() many times, the R binary grows in size until it eats all your memory and swap space. The code enclosed below

[External] readChar() could read the whole file by default?

2024 Jan 26

[External] readChar() could read the whole file by default?

On Fri, 26 Jan 2024, Michael Chirico wrote: > I am curious why readLines() has a default (n=-1L) to read the full > file while readChar() has no default for nchars= (i.e., readChar(file) > is an error). Is there a technical reason for this? > > I often[1] see code like paste(readLines(f), collapse="\n") which > would be better served by readChar(), especially given

[External] readChar() could read the whole file by default?

2024 Jan 29

[External] readChar() could read the whole file by default?

My opinion is that the proposed feature would be greatly appreciated by users. I had always wondered if I was the only one doing paste(readLines(f), collapse="\n") all the time. It would be great to have the proposed, more straightforward way to read the whole file as a string: readChar("my_file.txt", -1) or even better readChar("my_file.txt") Thanks for your detailed

readChar() could read the whole file by default?

2024 Jan 26

readChar() could read the whole file by default?

I am curious why readLines() has a default (n=-1L) to read the full file while readChar() has no default for nchars= (i.e., readChar(file) is an error). Is there a technical reason for this? I often[1] see code like paste(readLines(f), collapse="\n") which would be better served by readChar(), especially given issues with the global string cache I've come across[2]. But lacking the

reading VERY large binary files

2006 Nov 07

reading VERY large binary files

Hello, I am trying to read in elements out of a very large binary file ... the total file is 4 gigs. I want to select rows out of the file, and the current procedure I run works but is prohibitively slow (takes more than a day to run and still won't complete). Is there any faster way to accomplish this? My current procedure looks like this: readHH <- function(file_name,

Typo fix for readBin.Rd

2006 Jun 02

Typo fix for readBin.Rd

Hi, The man page for readBin has a small typo: --- a/src/library/base/man/readBin.Rd +++ b/src/library/base/man/readBin.Rd @@ -58,7 +58,7 @@ writeBin(object, con, size = NA, endian \code{readBin} and \code{writeBin} read and write C-style zero-terminated character strings. Input strings are limited to 10000 - characters. \code{\link{readChar}} and \code{\code{writeChar}} +

Importing binary data

2004 Jun 01

Importing binary data

Hi everybody! I've a large dataset, about 2 Mio entries of the format which I would like to import into a frame: <integer><integer><float><string><float><string><string> Because to the huge data amount I've choosen a binary format instead of a text format when exporting from Matlab. My import function is attached below. It works fine for only

Clarification for readChar man page

2007 Jun 14

Clarification for readChar man page

Hi, Here's a patch to the readChar manual page (R-trunk as of today) that better clarifies readChar's return value. It could use some work as I'd also like to add some text about using nchar() to find the length of the string that readchar() returns, but I'm unsure which of type="bytes" or type="chars" to mention. Is it type="chars"? Index:

Buffering in R 3.5 connections causes incorrect data in readChar

2018 May 26

Buffering in R 3.5 connections causes incorrect data in readChar

I noticed an issue where readChar does not return the correct value after a call to readline. It appears that readChar is not aware of the buffering, so it reads from the end of the buffer, rather than the current position in the file. This is a significant change of behavior from R-3.4.4. Below is a test case that I used to home in on the problem. --- p<-"test2.txt"

Buffering in R 3.5 connections causes incorrect data in readChar

2018 May 29

Buffering in R 3.5 connections causes incorrect data in readChar

Tomas, Thank you for the explanation. I see in the documentation: "These functions are intended to be used with binary-mode connections." So I see how using it on a text connection is undefined, and not a bug. An error or warning when attempting to use a it on a text connection would be helpful considering how the behavior has changed in R-3.5. On Tue, May 29, 2018 at 3:09 AM, Tomas

handling nulls while reading mainframe file

2010 May 05

handling nulls while reading mainframe file

Hi, I am trying to read in records from an old main frame file. Each record is 640 bytes. I had some success using the following statement: iconv(readChar(con, nchars=640, useBytes = TRUE),from="IBM037",to="",sub="#") What I have found, however, is that ocassionally a record will contain a Null halfway through, which causes readChar() not to read the rest of the

Wavelets correlation test

2003 Feb 25

Wavelets correlation test

Hello, I use wavethresh packages to perform wavelet analysis. In particular, I would like to compare 2 signals (vectors) after a wavelet decomposition. I would like to use cor.test function, but this function acts on the entire vector values. I plan to perform a cor.test on each level of the wavelet decomposition, say N. So I will have at the end of a first step N results of cor.test. How can

Help with readBin

2012 May 03

Help with readBin

I'm trying to read a binary file created by a fortran code using readBin and readChar. Everything reads fine (integers and strings) except for double precision numbers, they are read as huge or very small number (1E-250,...). I tried various endianness, swap, But nothing has worked so far. I also tried on R 64 bit for linux and windows (R 2.14) and R 2.11 on windows XP 32 bit. Any help would

slightly speeding up readChar()

2011 Aug 04

slightly speeding up readChar()

Hi, I was trying to have R read files faster with readChar(). That was before I noticed that readChar() is not that bad! In any case, below I suggest a few simple changes that will make readChar slightly faster. I followed readChar(useBytes=T), and tried to identify all O(N) operations, where N is the size of the file. The assumption is that for LARGE files we want to avoid any O(N) operations,

scan() vs readChar() speed

2012 Apr 01

scan() vs readChar() speed

Dear list, I am trying to find a fast solution to read moderately large (1 -- 10 million entries) text files containing only tab-delimited numeric values. My test file is the following, nr <- 1000 nc <- 5000 m <- matrix(round(rnorm(nr*nc),3),nr=nr) write.table(m, file = "a.txt", append=FALSE, row.names = FALSE, col.names = FALSE) scan() is faster than

ESRI shape file import and time-space models

2003 Feb 13

ESRI shape file import and time-space models

Dear R user, I am running R under Windows 2000. I am looking for a routine for importing - shape files (ESRI) into R - dbase files (FOXPRO) into R and I am looking for time-space models for description and prediction of Bernoulli-, Binomial- and Poissonvaraibles. Thank's a lot for a reply. Sincerely yours, Ekkehardt Altpeter Swiss Federal Office of Public

Possible bugs in 'seek' and 'readBin'

2006 Oct 21

Possible bugs in 'seek' and 'readBin'

I found that seek(..., origin = 'current', ...) and readBin(..., what = 'integer', ...) or 'int' do not work correctly. Did anyone have the same experience?

matrix log

2006 Jun 14

matrix log

Dear R users, Has anyone implemented a "matrix log" function in R similar to the function logm() in Matlab? I did a quick R site search and browsed the contributed packages to no avail. The octave function is far too simplistic and fails for the Matlab test matrix. Ideally, the code of Cheng, Higham, and Laub (2001) or something similar could be utilized. Just checking before I

readBin or writeBin adds extra nulls (PR#2333)

2002 Nov 29

readBin or writeBin adds extra nulls (PR#2333)

Full_Name: Ken Yap Version: 1.6.1 OS: Linux (SuSE 8.0) Submission from: (NULL) (129.78.64.5) I'm trying to copy a file using readBin and writeBin. (The reason is to be able to pipe PostScript or PDF output to a socket later, this is just an experiment.) I do: zz <- file("foo.ps", "rb") r <- readBin(zz, character(), 1000000) yy <- file("bar.ps",

Re: [R] File Reading Problem (PR#4043)

2003 Sep 01

Re: [R] File Reading Problem (PR#4043)

On Mon, 1 Sep 2003 14:26:43 -0700, "Jiming Yu" <jimingyu@princeton.edu> wrote: >Dear all, > I am trying to read characters byte by byte(in their ASCII codes) from a >file I was going to suggest using readBin, but there seems to be a bug: > con <- file('c:/test.txt','rb') > readBin(con,'c',15,1) stack imbalance in internal readBin, 9

similar to: Error in readChar(): invalid UTF-8 input