similar to: reading and parsing gzipped files

Displaying 20 results from an estimated 7000 matches similar to: "reading and parsing gzipped files"

2010 Jun 22
4
seek() and gzfile() on 32-bit R2.12.0 in linux
I have installed both 32-bit and 64-bit versions of R2.12.0 (2010-06-15 r52300) on my Ubuntu 10.04 64-bit system. I observe the following behavior when running the examples from base::connections. There appears to be a problem with seek() on a .gz file when using a 32-bit installation of R2.12.0, but the problem doesn't appear in the 64-bit installation. I realize that seek() has been
2011 Sep 23
2
Issue with seek() on gzipped connections in R-devel
Dear all, In R-devel (2011-09-23 r57050), I'm running into a serious problem with seek()ing on connections opened with gzfile(). A warning is generated and the file position does not seek to the requested location. It doesn't seem to occur all the time - I tried to create a small example file to illustrate it, but the problem didn't occur. However, it can be seen with a file I use for
2015 Nov 06
2
corrupt PACKAGES.gz?
Is it just me, or did a corrupt PACKAGES.gz file get installed in the bin/windows/contrib/3.2 directory of CRAN mirrors recently? gzfile() complains about it and Cygwin's gzip cannot decompress it. I tried the following repos <- "https://cran.rstudio.com" v <- "3.2" pkgs.gz <- paste(sep="/", repos, "bin/windows/contrib", v,
2009 Feb 08
1
compressing data without writing output to file
This might seem like a strange question but is there any way to compress an R object (such as a matrix) and know its resulting size in bytes ? Clearly, I could implement this in the following way (if x is my matrix): zz <- gzfile(fname,"w"); write.table(x,zz); close(zz); file.info(fname)[,"size"]; However, I need to do this for hundreds of thousands
2010 Jun 06
1
R CMD in 2.11.1 error
Hi, I have a set of user defined package which I have installed in 2.8.1 for quite some time. I installed 2.11.0 and 2.11.1 a couple of days ago while still having 2.8.1 version in my system. For the package, package.skeleton works fine in R 2.11. However, it comes problem when I run Rcmd check/intall. The intall.out shows below. I further tested that I reran R-2.8.1/bin/R CMD check in
2011 Jan 21
1
Reading gz compressed csv file - 'incomplete line found'
Hi all, I am trying to download, decompress and read a csv file. My code: myurl <- "ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/series/GSE24729/GSE24729_MitoNuclear_suppl_male_stats.csv.gz" # myfile <- "GSE24729_MitoNuclear_suppl_male_stats.csv.gz" # download.file(myurl, destfile=myfile, mode="w") # mycon <- gzcon(gzfile(myfile,
2005 Aug 25
1
how to close connections?
Hi everybody, I am using gzipped files quite extensively and cannot figure out how I can close (or reuse) connections. Example: > for(i in 1:300) {cat(i, "\n"); a <- readLines(gzfile("file.gz"), n=1) } 1 2 ... 46 47 Error in gzfile("~/tyyq/andmebaasiq/ETU/ETU1997.tsv.gz") : all connections are in use To use just gzfile without any open/close should
2007 Jul 03
1
bug in closing gzfile-opened connections?
Hi, I am making multiple calls to gzfile() via read.table(), e.g. > x <- read.table( gzfile( "xxx.gz" ) ) After i do this many times (I haven't counted, but probably between 50 and 100 times) I get the error message: Error in open.connection(file, "r") : unable to open connection In addition: Warning message: cannot open compressed file 'xxx.gz' however, I
2002 Dec 05
1
writing to gzfile: segmentation fault (PR#2347)
Full_Name: Vadim Ogranovich Version: Version 1.6.0 (2002-10-01) OS: Red Hat 7.1 Submission from: (NULL) (209.99.241.1) The following sequence of commands crashes my R session. The first weirdness happens after the second command that appears not to change the "foo.gz" file, no error generated. > con <- gzfile("foo.gz", open="w"); cat("goo\n",
2002 Dec 04
2
problem with load('http://....') (PR#2344)
Full_Name: Frank Harrell Version: 1.6.1 OS: RedHat 8.0 Linux Submission from: (NULL) (128.143.108.90) I get an error when trying to load a URL that contains a file that was saved using save(object, compress=TRUE): > load('http://hesweb1.med.virginia.edu/biostat/s/data/sav/kprats.sav') Error in gzfile(file, "rb") : unable to open connection In addition: Warning message:
2008 Mar 07
2
Problems installing packages using the inbuilt facility: "Error i n gzfile(file, "r") : unable to open connection"
Hi I have been trawling the web, FAQs, and R manuals for help on the following issue, but have failed and was wondering if anyone has a solution to the following problem: After having installed R 2.6.2 for Windows (binary), I tried to install various packages. Every time I try loading a package (any package) via the built-in menu, I run into the following error message. >
2013 May 08
1
getting corrupted data when using readBin() after seek() on a gzfile connection
Hi, I'm running into more issues when reading data from a gzfile connection. If I read the data sequentially with successive calls to readBin(), the data I get looks ok. But if I call seek() between the successive calls to readBin(), I get corrupted data. Here is a (hopefully) reproducible example. See my sessionInfo() at the end (I'm not on Windows, where, according to the man page,
2005 Dec 09
1
local source packages install from within R session - cross-platform
I realize that others have struggled with this issue...i.e. http://tolstoy.newcastle.edu.au/~rking/R/help/05/01/9826.html i am on os.x 10.4 w/ R2.2, and am (perhaps foolishly) also on this quest... i would like to be able to install downloaded source (tar.gz'd) files from within an R session, and have it work in a X-platform way..i am often not connected to the internet and have libraries
2018 May 10
2
readLines() behaves differently for gzfile connection
When I read a .gz file with readLines() in 3.4.3, it returns text (and a warning). In 3.5.0, it gives a warning, but no text. Is this expected behavior or a bug? 3.4.3: > source_file = "1k_annotation.gz" > readfile_con <- gzfile(source_file, "r") > readLines(readfile_con, n = 5) [1] "#chr\tpos\tref\talt\t <truncated output here> Warning message: In
2009 Oct 27
2
Why I get this error? Error in close.connection(f) : invalid connection
I don't understand why I can not close 'f'. This may be very simple, but I don't see why. Could somebody let me know? $ cat gzfile.csv "","V1","V2","V3","V4","V5" "1",1,5,9,13,17 "2",2,6,10,14,18 "3",3,7,11,15,19 "4",4,8,12,16,20 $ Rscript gzfile.R > f =
2002 Dec 02
1
readLines() changes mode of connection
Hi, It seems like reading a line from a gzfile() connection changes the mode of the connection from text to binary (it also alters "can write", in case it matters). The following transcript, produced on RedHat 7.1, demonstrates this "feature" (note the evolution of file$text). Is this expected? Thanks, Vadim > file <- gzfile("foo.gz") file <-
2004 Mar 15
1
gzfile & read.table on Win32
Hello ... Are there any known problems or even gotchas to look out for when using a gzfile connection in read.csv/read.table in Windows? In the package PROcess, available at www.bioconductor.org/repository/devel/package/html/PROcess.html there are two files in the PROcess/inst/Test directory which are of the extension *.csv.gz. With both files, if I open up a gzfile connection, say: vv <-
2007 Dec 19
1
unexpected behavior from gzfile and unz
I get unexpected behavior from "readLines()" and "scan()" depending on how the file is opened with "gzfile" or "unz". More specifically: > file <- gzfile("file.gz") > readLines(file,1) [1] "a\tb\tc" > readLines(file,1) [1] "a\tb\tc" > close(file) It seems that the stream is rewound between calls to readLines.
2018 May 10
1
readLines() behaves differently for gzfile connection
You bet - it's available on github at https://github.com/UW-GAC/wgsaparsr/blob/master/tests/testthat/1k_annotation.gz -Ben On Thu, May 10, 2018 at 4:17 PM, Michael Lawrence <lawrence.michael at gene.com > wrote: > Would it be possible to get that file or a representative subset of it > somewhere so that I can reproduce this? > > Thanks, > Michael > > On Thu, May
2011 Oct 15
2
gctorture() and gzfile() doesn't get along.
Found the simpliest way of seeing I bug I encountered doing "R CMD check --use-gct": Just launch R (with --vanilla), and do this: > ?gctorture # this work > gctorture() > ?gctorture Error in gzfile(file, "rb") : can only weakly reference/finalize reference objects # this does not It seems that when gctorture() is on gzfile() doesn't work.