similar to: unexpected behavior from gzfile and unz

Displaying 20 results from an estimated 3000 matches similar to: "unexpected behavior from gzfile and unz"

2005 Aug 24
1
How to get the list of the files when you read zip file by gzfile or unz
Dear List, Does somebody know how to get the filename lists in the ziped when "gzfile" or "unz" command is used ? Thanks for your help. Shawn Lee -- The information contained in this communication and any atta...{{dropped}}
2024 Oct 25
1
readLines() and unz() and non-empty final line
Hi again, The unz connection is non-blocking by default. I checked do_unz which calls R_newunz which calls init_con and the only place in any of those functions that sets 'blocking' is init_con which sets it to FALSE: https://github.com/wch/r-source/blob/0c26529e807a9b1dd65f7324958c17bf72e1de1a/src/main/connections.c#L713 I'll open an issue on R-bugzilla and see if they're
2003 Jul 24
0
unz( "x.zip", "y.csv" ) != pipe( "unzip -p x.zip y.csv" )
Not sure this is a bug in R. Maybe its a bug in my understanding of unz(). The character 'b2' (hexadecimal) is in position 535 of line 1 of 'naughty.csv'. This character appears as superscript '2' and came to me in an EXCEL file that I converted to text in a comma separated ( *.csv ) format. The first line gets truncated by readLines after 534 characters using unz():
2003 Jul 11
2
unz()
I am having problems getting the unz() function to work as a connection to start reading a file... z <- unz("c:/temp/stoxx.zip", "close_tmi_components.txt", "r") readLines(z,2) yields the following problems: > z <- unz("c:/temp/stoxx.zip", "close_tmi_components.txt", "r") Error in unz("c:/temp/stoxx.zip",
2024 Oct 25
1
readLines() and unz() and non-empty final line
Hi, you could use "scan" instead, it seems to work fine also when wrapped around "unz". Or, alternatively, you could use "unzip" instead of "unz". It works as expected, i.e. reads the last incomplete line and throws a warning about this. So it seems to me that "unz" creates a non-blocking connection, whereas "unzip" creates a
2024 Oct 24
2
readLines() and unz() and non-empty final line
Dear list, I'm seeing a strange interaction with readLines() and unz() when reading a file without an empty final line. The final line gets dropped silently: > cat("hello", file = "hello.txt") > zip("hello.zip", "hello.txt") adding: hello.txt (stored 0%) > readLines(unz("hello.zip", "hello.txt")) character(0) The
2009 Jul 28
2
Looking for example of usage of function unz
I would greatly appreciate some example of correct usage of function unz. I have to download and uncompress the following web compressef file: ftp://ftp.sanger.ac.uk/pub/mirbase/targets/v5/arch.v5.txt.homo_sapiens.zip I tried the following command that does not work: Targets.rec <- readLines(zz <- unz("ftp://ftp.sanger.ac.uk/pub/mirbase/targets/v5/arch.v5.txt.homo_sapiens.zip"))
2008 Oct 09
1
Reading zipped data directly from an FTP url
Hi Sorry, I am clearly missing something here. I want to read this file directly: ftp://ftp.sanger.ac.uk/pub/mirbase/targets/v5/arch.v5.txt.gallus_gallus. zip I tried using read.table(gzfile("ftp://ftp.sanger.ac.uk/pub/mirbase/targets/v5/arch.v5 .txt.gallus_gallus.zip")) But I got an error: Error in open.connection(file, "r") : cannot open the connection In addition:
2024 Oct 24
3
readLines() and unz() and non-empty final line
But note: > zip("hello.zip", "hello.txt") updating: hello.txt (stored 0%) > readChar(unz("hello.zip","hello.txt"),100) [1] "hello" I leave it to you and other wiser heads to figure out. Cheers, Bert On Thu, Oct 24, 2024 at 8:57?AM Iris Simmons <ikwsimmo at gmail.com> wrote: > Hi Mikko, > > > I tried running a few
2018 May 10
2
readLines() behaves differently for gzfile connection
When I read a .gz file with readLines() in 3.4.3, it returns text (and a warning). In 3.5.0, it gives a warning, but no text. Is this expected behavior or a bug? 3.4.3: > source_file = "1k_annotation.gz" > readfile_con <- gzfile(source_file, "r") > readLines(readfile_con, n = 5) [1] "#chr\tpos\tref\talt\t <truncated output here> Warning message: In
2008 Apr 20
1
How to read last (incomplete) line from gzfile()?
Hi, I have some text files that do not have trailing \n on the last (incomplete) line; how do I read in the last line? e.g. here is a test case: [linux + R version 2.6.1 (2007-11-26) ] z <- gzfile("short.gz", open="w") cat("7\n5\n3", file=z) close(z) z <- gzfile('short.gz') readLines(z) [1] "7" "5" readLines would indicate that
2018 May 10
1
readLines() behaves differently for gzfile connection
You bet - it's available on github at https://github.com/UW-GAC/wgsaparsr/blob/master/tests/testthat/1k_annotation.gz -Ben On Thu, May 10, 2018 at 4:17 PM, Michael Lawrence <lawrence.michael at gene.com > wrote: > Would it be possible to get that file or a representative subset of it > somewhere so that I can reproduce this? > > Thanks, > Michael > > On Thu, May
2002 Dec 05
1
writing to gzfile: segmentation fault (PR#2347)
Full_Name: Vadim Ogranovich Version: Version 1.6.0 (2002-10-01) OS: Red Hat 7.1 Submission from: (NULL) (209.99.241.1) The following sequence of commands crashes my R session. The first weirdness happens after the second command that appears not to change the "foo.gz" file, no error generated. > con <- gzfile("foo.gz", open="w"); cat("goo\n",
2004 Mar 15
1
gzfile & read.table on Win32
Hello ... Are there any known problems or even gotchas to look out for when using a gzfile connection in read.csv/read.table in Windows? In the package PROcess, available at www.bioconductor.org/repository/devel/package/html/PROcess.html there are two files in the PROcess/inst/Test directory which are of the extension *.csv.gz. With both files, if I open up a gzfile connection, say: vv <-
2006 Nov 14
2
gzfile with multiple entries in the archive
If I open a tgz archive with gzfile and then parse it using readLines I miss the initial line of each member of the archive - and also the name of the file although the archive otherwise complete (but useless!). Is there any way within R to extract both the list of files in a tgz archive and to extract any one of these files? Clearly I can use zcat and tar on Linux, but I need this to work
2007 Jul 03
1
bug in closing gzfile-opened connections?
Hi, I am making multiple calls to gzfile() via read.table(), e.g. > x <- read.table( gzfile( "xxx.gz" ) ) After i do this many times (I haven't counted, but probably between 50 and 100 times) I get the error message: Error in open.connection(file, "r") : unable to open connection In addition: Warning message: cannot open compressed file 'xxx.gz' however, I
2013 May 08
1
getting corrupted data when using readBin() after seek() on a gzfile connection
Hi, I'm running into more issues when reading data from a gzfile connection. If I read the data sequentially with successive calls to readBin(), the data I get looks ok. But if I call seek() between the successive calls to readBin(), I get corrupted data. Here is a (hopefully) reproducible example. See my sessionInfo() at the end (I'm not on Windows, where, according to the man page,
2011 Oct 15
2
gctorture() and gzfile() doesn't get along.
Found the simpliest way of seeing I bug I encountered doing "R CMD check --use-gct": Just launch R (with --vanilla), and do this: > ?gctorture # this work > gctorture() > ?gctorture Error in gzfile(file, "rb") : can only weakly reference/finalize reference objects # this does not It seems that when gctorture() is on gzfile() doesn't work.
2012 Jan 31
2
how to unzip files in R
Hi, I have downloaded a bunch of bz2 files. I wonder if R will be able to unzip them in a batch mode or even one at a time? I was looking at the unz function. But it didn't work well. Say I have a bz2 file in H:/Temp/65502805_532.pair.bz2. Anyone has any suggestion? thanks John > setwd("H:\\Temp\\") > getwd() [1] "H:/Temp" >
2010 Sep 20
0
unz() ignores encoding argument
Hi! I'm trying to read individual files from a ZIP archive, using the unz() function. Some of the files contain non-ASCII characters and I'd like to avoid unpacking them in a temporary directory. My problem is that unz() seems to ignore the encoding="latin1" option I need to read the non-ASCII characters properly. I can't find a clear indication in the documentation that