Displaying 20 results from an estimated 7000 matches similar to: "reading and parsing gzipped files"
2010 Jun 22
4
seek() and gzfile() on 32-bit R2.12.0 in linux
I have installed both 32-bit and 64-bit versions of R2.12.0 (2010-06-15
r52300) on my Ubuntu 10.04 64-bit system. I observe the following behavior
when running the examples from base::connections. There appears to be a
problem with seek() on a .gz file when using a 32-bit installation of
R2.12.0, but the problem doesn't appear in the 64-bit installation. I
realize that seek() has been
2011 Sep 23
2
Issue with seek() on gzipped connections in R-devel
Dear all,
In R-devel (2011-09-23 r57050), I'm running into a serious problem
with seek()ing on connections opened with gzfile(). A warning is
generated and the file position does not seek to the requested
location. It doesn't seem to occur all the time - I tried to create a
small example file to illustrate it, but the problem didn't occur.
However, it can be seen with a file I use for
2015 Nov 06
2
corrupt PACKAGES.gz?
Is it just me, or did a corrupt PACKAGES.gz file get installed in the
bin/windows/contrib/3.2 directory of CRAN mirrors recently? gzfile()
complains about it and Cygwin's gzip cannot decompress it. I tried the
following
repos <- "https://cran.rstudio.com"
v <- "3.2"
pkgs.gz <- paste(sep="/", repos, "bin/windows/contrib", v,
2009 Feb 08
1
compressing data without writing output to file
This might seem like a strange question but is there any way to compress an
R object (such as a matrix) and know its resulting size in bytes ?
Clearly, I could implement this in the following way (if x is my matrix):
zz <- gzfile(fname,"w");
write.table(x,zz);
close(zz);
file.info(fname)[,"size"];
However, I need to do this for hundreds of thousands
2010 Jun 06
1
R CMD in 2.11.1 error
Hi,
I have a set of user defined package which I have installed in 2.8.1 for quite some time. I installed 2.11.0 and 2.11.1 a couple of days ago while still having 2.8.1 version in my system. For the package, package.skeleton works fine in R 2.11. However, it comes problem when I run Rcmd check/intall. The intall.out shows below.
I further tested that I reran R-2.8.1/bin/R CMD check in
2011 Jan 21
1
Reading gz compressed csv file - 'incomplete line found'
Hi all,
I am trying to download, decompress and read a csv file. My code:
myurl <-
"ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/series/GSE24729/GSE24729_MitoNuclear_suppl_male_stats.csv.gz"
#
myfile <- "GSE24729_MitoNuclear_suppl_male_stats.csv.gz"
#
download.file(myurl, destfile=myfile, mode="w")
#
mycon <- gzcon(gzfile(myfile,
2005 Aug 25
1
how to close connections?
Hi everybody,
I am using gzipped files quite extensively and cannot figure out how I
can close (or reuse) connections.
Example:
> for(i in 1:300) {cat(i, "\n"); a <- readLines(gzfile("file.gz"), n=1) }
1
2
...
46
47
Error in gzfile("~/tyyq/andmebaasiq/ETU/ETU1997.tsv.gz") :
all connections are in use
To use just gzfile without any open/close should
2007 Jul 03
1
bug in closing gzfile-opened connections?
Hi,
I am making multiple calls to gzfile() via read.table(), e.g.
> x <- read.table( gzfile( "xxx.gz" ) )
After i do this many times (I haven't counted, but probably between 50 and
100 times) I get the error message:
Error in open.connection(file, "r") : unable to open connection
In addition: Warning message:
cannot open compressed file 'xxx.gz'
however, I
2002 Dec 05
1
writing to gzfile: segmentation fault (PR#2347)
Full_Name: Vadim Ogranovich
Version: Version 1.6.0 (2002-10-01)
OS: Red Hat 7.1
Submission from: (NULL) (209.99.241.1)
The following sequence of commands crashes my R session. The first weirdness
happens after the second command that appears not to change the "foo.gz" file,
no error generated.
> con <- gzfile("foo.gz", open="w"); cat("goo\n",
2002 Dec 04
2
problem with load('http://....') (PR#2344)
Full_Name: Frank Harrell
Version: 1.6.1
OS: RedHat 8.0 Linux
Submission from: (NULL) (128.143.108.90)
I get an error when trying to load a URL that contains a file that was saved
using save(object, compress=TRUE):
> load('http://hesweb1.med.virginia.edu/biostat/s/data/sav/kprats.sav')
Error in gzfile(file, "rb") : unable to open connection
In addition: Warning message:
2008 Mar 07
2
Problems installing packages using the inbuilt facility: "Error i n gzfile(file, "r") : unable to open connection"
Hi
I have been trawling the web, FAQs, and R manuals for help on the following issue, but have failed and was wondering if anyone has a solution to the following problem:
After having installed R 2.6.2 for Windows (binary), I tried to install various packages. Every time I try loading a package (any package) via the built-in menu, I run into the following error message.
>
2013 May 08
1
getting corrupted data when using readBin() after seek() on a gzfile connection
Hi,
I'm running into more issues when reading data from a gzfile connection.
If I read the data sequentially with successive calls to readBin(), the
data I get looks ok. But if I call seek() between the successive calls
to readBin(), I get corrupted data.
Here is a (hopefully) reproducible example. See my sessionInfo() at the
end (I'm not on Windows, where, according to the man page,
2005 Dec 09
1
local source packages install from within R session - cross-platform
I realize that others have struggled with this issue...i.e.
http://tolstoy.newcastle.edu.au/~rking/R/help/05/01/9826.html
i am on os.x 10.4 w/ R2.2, and am (perhaps foolishly) also on this quest...
i would like to be able to install downloaded source (tar.gz'd) files from
within an R session, and have it work in a X-platform way..i am often not
connected to the internet and have libraries
2018 May 10
2
readLines() behaves differently for gzfile connection
When I read a .gz file with readLines() in 3.4.3, it returns text (and a
warning). In 3.5.0, it gives a warning, but no text. Is this expected
behavior or a bug?
3.4.3:
> source_file = "1k_annotation.gz"
> readfile_con <- gzfile(source_file, "r")
> readLines(readfile_con, n = 5)
[1] "#chr\tpos\tref\talt\t
<truncated output here>
Warning message:
In
2009 Oct 27
2
Why I get this error? Error in close.connection(f) : invalid connection
I don't understand why I can not close 'f'. This may be very simple,
but I don't see why. Could somebody let me know?
$ cat gzfile.csv
"","V1","V2","V3","V4","V5"
"1",1,5,9,13,17
"2",2,6,10,14,18
"3",3,7,11,15,19
"4",4,8,12,16,20
$ Rscript gzfile.R
> f =
2002 Dec 02
1
readLines() changes mode of connection
Hi,
It seems like reading a line from a gzfile() connection changes the mode of
the connection from text to binary (it also alters "can write", in case it
matters). The following transcript, produced on RedHat 7.1, demonstrates
this "feature" (note the evolution of file$text). Is this expected?
Thanks, Vadim
> file <- gzfile("foo.gz")
file <-
2004 Mar 15
1
gzfile & read.table on Win32
Hello ...
Are there any known problems or even gotchas to look out for when using a
gzfile connection in read.csv/read.table in Windows?
In the package PROcess, available at
www.bioconductor.org/repository/devel/package/html/PROcess.html
there are two files in the PROcess/inst/Test directory which are of the
extension *.csv.gz.
With both files, if I open up a gzfile connection, say:
vv <-
2007 Dec 19
1
unexpected behavior from gzfile and unz
I get unexpected behavior from "readLines()" and
"scan()" depending on how the file is opened with
"gzfile" or "unz". More specifically:
> file <- gzfile("file.gz")
> readLines(file,1)
[1] "a\tb\tc"
> readLines(file,1)
[1] "a\tb\tc"
> close(file)
It seems that the stream is rewound between calls to
readLines.
2018 May 10
1
readLines() behaves differently for gzfile connection
You bet - it's available on github at
https://github.com/UW-GAC/wgsaparsr/blob/master/tests/testthat/1k_annotation.gz
-Ben
On Thu, May 10, 2018 at 4:17 PM, Michael Lawrence <lawrence.michael at gene.com
> wrote:
> Would it be possible to get that file or a representative subset of it
> somewhere so that I can reproduce this?
>
> Thanks,
> Michael
>
> On Thu, May
2011 Oct 15
2
gctorture() and gzfile() doesn't get along.
Found the simpliest way of seeing I bug I encountered doing "R CMD check --use-gct": Just launch R (with --vanilla), and do this:
> ?gctorture
# this work
> gctorture()
> ?gctorture
Error in gzfile(file, "rb") :
can only weakly reference/finalize reference objects
# this does not
It seems that when gctorture() is on gzfile() doesn't work.