thr3ads.net - similar to: "pipe"

Displaying 20 results from an estimated 40000 matches similar to: "pipe"

2002 Nov 22

Need help with pipe()

Hello. I have an R program that calls gawk (GNU Awk 3.06 for Windows) from within pipe() to preprocess a large file before it is read into a data frame with read.table(). I've recently upgraded from Win98SE to WinXP, and have also upgraded from R1.5.0 to R1.6.1 over the past month or so. This program worked before the upgrade(s), but now fails. I observe the following sort of behavior with

Reading gz compressed csv file - 'incomplete line found'

2011 Jan 21

Reading gz compressed csv file - 'incomplete line found'

Hi all, I am trying to download, decompress and read a csv file. My code: myurl <- "ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/series/GSE24729/GSE24729_MitoNuclear_suppl_male_stats.csv.gz" # myfile <- "GSE24729_MitoNuclear_suppl_male_stats.csv.gz" # download.file(myurl, destfile=myfile, mode="w") # mycon <- gzcon(gzfile(myfile,

Read every second line from ASCII file

2007 Apr 30

Read every second line from ASCII file

Dear all, I have an ASCII file where records are separated by a blank. I would like to read those data; however, only the data in rows 1, 3, 5, 7, ... are important; the other lines (2,4,6,8,....) contain no useful information for me. So far I used awk/gawk to do it: gawk '{if ((FNR % 2) != 0) {print $0}}' infile.txt > outfile.txt What is the recommended way to accomplish this in R?

Reading large, non-tabular files

2011 Sep 14

Reading large, non-tabular files

Dear R-help, I have a very large ascii data file, of which I only want to read in selected lines (e.g. on fourth of the lines); determining which lines depends on the lines content. So far, I have found two approaches for doing this in R; 1) Read the file line by line using a repeat-loop and save the result in a temporary file or a variable, and 2) Read the entire file and filter/reshape it using

specifying header in sqldf() and RSQLite

2010 Apr 07

specifying header in sqldf() and RSQLite

I have a file that I'm planning on manipulating with sqldf(). The header for this file is malformed, and I was wondering if there's a way to specify the values of the header ahead of time. I can see there are T/F options for header in sqldf(... list(header=T, ...), but was wondering if there's a way to explicitly stated what the header should be. Thanks, Andrew [[alternative HTML

comma delimiter & comma in text

2006 Nov 29

comma delimiter & comma in text

Hi I have data like 1, A, 24, The Red House 2, A, 25, King's Home, by the Sea ... I'd like to read this in as three variables. I first tried temp <-read.csv(addresses, sep = "," ) it worked but line 2 was broken after King's Home, and by the Sea as placed in another line. and so i eneded up with more number of rows than in the data. when i tried temp

R in BATCH mode

2003 May 20

R in BATCH mode

Dear all, In R<=1.6.2 I usually used the following code (in DOS prompt) to run R in batch mode C:\documents> Rcmd BATCH myfile.R and I could see the results (including warning messages) in the file myfile.Rout In R.1.7.0 I'm experiencing the followings: (1) even if I type "Rcmd BATCH myfile.R myfile.Rout", no file myfile.Rout is created, but just a "&1" file

Reading large files quickly

2009 May 09

Reading large files quickly

I'm finding that readLines() and read.fwf() take nearly two hours to work through a 3.5 GB file, even when reading in large (100 MB) chunks. The unix command wc by contrast processes the same file in three minutes. Is there a faster way to read files in R? Thanks!

How can I find nonstandard or control characters in a large file?

2013 Dec 09

How can I find nonstandard or control characters in a large file?

I have a humongous csv file containing census data, far too big to read into RAM. I have been trying to extract individual columns from this file using the colbycol package. This works for certain subsets of the columns, but not for others. I have not yet been able to precisely identify the problem columns, as there are 731 columns and running colbycol on the file on my old slow machine takes

dir() and RegEx and gsub()

2005 Jun 09

dir() and RegEx and gsub()

Dear R-Users, I have two questions: a) in a directory there are 3 files: [1] "Data.~csv" "Kopie von Data.~csv" "VorlageTradefile.csv" The command "dir( fold, pattern = "\.csv" )" gives back *all* the 3 files With dir( fold, pattern = "\\.csv" ) I get back only VorlageTradefile.csv. I don't understand this behaviour,

loop, pipe connection, output objects

2006 Oct 13

loop, pipe connection, output objects

Hi all, I have the following -newbye- problem. Inside R, I am trying to process a file and creating from it many files. The file is organized in different columns, the second containing a code. I want to create as output objects, which contain only entries in a certain code range, and whose name contain the code itself. Here is my attempt indice <- (201:399) for(i in indice){ data.i <-

Z score

2012 Oct 24

Z score

Hi, I need to find the z-score of the data present in a speardsheet. The values needs to be calculated for each gene across the samples (refer the example). And, it should be a simple thing, but I am unable to do it right now ! The example re the structure of the spreadsheet is - # Example: MyFile <- read.csv( text= "Names,'Sample_1','Sample_2','Sample_3'

Large Data Set Help

2008 Aug 25

Large Data Set Help

I am attempting to perform some simple data manipulation on a large data set. I have a snippet of the whole data set, and my small snippet is 2GB in CSV. Is there a way I can read my csv, select a few columns, and write it to an output file in real time? This is what I do right now to a small test file: data <- read.csv('data.csv', header = FALSE) data_filter <- data[c(1,3,4)]

points rejected as lying outside the specified window

2013 Jan 29

points rejected as lying outside the specified window

Hello, I am using the following code to create ppp files from csv data and map shape files, but I am getting some errors which I have been unable to fix by searching them online: library(spatstat) library(maps) library(maptools) NYC2<-readShapePoly("nybb.shp") # this is a map of the NYC boroughs without waterways and no census tract divisions (but it does include lines separating

How to read data sequentially into R (line by line)?

2011 Oct 18

How to read data sequentially into R (line by line)?

I have a data set like this in one .txt file (cols separated by !): APE!KKU!684! APE!VAL!! APE!UASU!! APE!PLA!1! APE!E!10! APE!TPVA!17122009! APE!STAP!1! GG!KK!KK! APE!KKU!684! APE!VAL!! APE!UASU!! APE!PLA!1! APE!E!10! APE!TPVA!17122009! APE!STAP!1! GG!KK!KK! APE!KKU!684! APE!VAL!! APE!UASU!! APE!PLA!1! APE!E!10! APE!TPVA!17122009! APE!STAP!1! GG!KK!KK! it contains over 14 000 000 records. Now

how to run .r batch file

2002 Mar 28

how to run .r batch file

I am completely new to R, but have some bg in cs and stat. since R has an interactive programming environment, I thought I could write my command in a file and run it in batch mode in the Rgui. but it didnt work. I checked the doc. it said Rcmd BATCH inputfile. it didnt work either. Can someone help? thank you. thank you. also if you can shed some light on passing command line arguements (like

Wish list

2006 Jan 01

Wish list

This is my New Year wishlist for R features. One common thread is that I find I sometimes use languages other than R including javascript, Windows batch and gawk. Others have mentioned other languages too. It would be nice if, in those cases I could use R simplifying development into a single environment (viz. R). The following are not in any order. 1. Self Contained Executables Make it

read.table: mysterious line omissions

2009 Dec 20

read.table: mysterious line omissions

Hello again, I am simply trying to import a rectangular table of strings. The table's dimensions are 1990 x 2, yet my read.table() command can only find 362 of the rows (and they're not the first 362). I would've taken the time to figure out how to use scan, readLines, or some other tool that can read in character strings, and then parse and input to a table, but that seems like

Files with Missing Data

2010 Jul 02

Files with Missing Data

I'm a new R user so this is possibly a naive question. I'm trying to load an external CSV file into a dataframe using: df_name<-read.table("myfile.csv") myfile.csv should have 5 elements per row, though a percentage are missing the last two elements (the commas are present as placemarkers). However, R does not create the dataset but returns the message: line 1

reading row vectors from file

2005 Mar 03

reading row vectors from file

Hi, New to R, using version 2.0.1 (2004-11-15) on debian Linux (sid), kernel 2.6.8-2-686. I have data in files with separate vectors on each row of the file, e.g., $ cat /tmp/stats freq,0,1,2,3,4,5,6,7,8,9,16,17,18,19,20,... noise,49,47,48,48,50,47,48,47,46,50,48,54,49,47,49,... signal,99,0,100,0,0,100,0,100,100,0,100,101,100,0,0,... pctrcv,5,0,5,0,0,5,0,5,11,0,5,5,5,0,0,...

similar to: pipe