thr3ads.net - similar to: "Replace NAs in dataframe: what am I doing wrong"

Displaying 20 results from an estimated 40000 matches similar to: "Replace NAs in dataframe: what am I doing wrong"

2007 Nov 24

'Split' character

Dear R-users, The following code splits a very simple dataframe into a list, each element of the list being one line of the dataframe. You will see that the split function names each element of the list by using uses the content of a and b and merging them with a "." character. Is there a way to customize this character? a<-1:10 b<-21:30 mydata<-data.frame(a,b)

Reading gz compressed csv file - 'incomplete line found'

2011 Jan 21

Reading gz compressed csv file - 'incomplete line found'

Hi all, I am trying to download, decompress and read a csv file. My code: myurl <- "ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/series/GSE24729/GSE24729_MitoNuclear_suppl_male_stats.csv.gz" # myfile <- "GSE24729_MitoNuclear_suppl_male_stats.csv.gz" # download.file(myurl, destfile=myfile, mode="w") # mycon <- gzcon(gzfile(myfile,

How to handle "~" character after csv importation

2008 Aug 22

How to handle "~" character after csv importation

Dear R users, I have to import some csv files in which column headers contain the character "~". Following the following import call, the character seems to be replaced by dots in the column names of my data frame. Plus, I cannot query names(mydata) to find the column index which header should contain "~" or "." > mydata <-

suggestions regarding reading in a messy file

2011 Jul 12

suggestions regarding reading in a messy file

I have a file in stata format, which I have read in, and I am trying to create a text file. I have exported the data using various delimiters, but I'm unable to read it back in. I originally read in the file with: library(foreign) myData <- read.dta("mydata.dta") I then exported it with write.table using comma, tab, and exclamation marks as a delimiter. When I was unable to

select if + other questions

2007 Apr 26

select if + other questions

Hi, i am trying to read a .txt file, do a couple of select if statements on my data, and then finally use the ?table function to get frequency counts on the data. Specifically, i am looking at answering the following question: What is the frequency of Grade 7 students in the province of Alberta who are smokers? I am having some problems: 1)i cannot get the column names to show up when print

creating NAs for some values only

2011 Feb 13

creating NAs for some values only

Hello, I have some data file, say, mydata 1,2,3,4,5,6,7 3,3,4,4,w,w,1 w,3,6,5,7,8,9 4,4,w,5,3,3,0 i want to replace some percentages of "mydata" file in to NAs for those values that are NOT w's. I know how to apply the percentage thing here but don't know how to select those values that are not "w"s. So far, i was able to do it but the result replaces the w's

Replace selected columns of a dataframe with NA

2011 Jun 20

Replace selected columns of a dataframe with NA

I am using the following command to replace all the missing values and assorted typos in a dataframe with NA: mydata[mydata>80]=NA The problem is that the first column contains values which should be more than 80, so really I want to do it just for mydata[,2:length(mydata)] I can't seem to re-write the code to fit: mydata[,2:length(mydata)>80]=NA # no error message, but doesn't

as.numeric() generates NAs inside an apply call, but fine outside of it

2012 Jan 09

as.numeric() generates NAs inside an apply call, but fine outside of it

Hello- I have rather a messy SPSS file which I have imported to R, I've dput'd some of the columns at the end of this message. I wish to get rid of all the labels and have numeric values using as.numeric. The funny thing is it works like this: as.numeric(mydata[,2]) # generates correct numbers however, if I pass the whole dataframe at once like this: apply(mydata, 1:2, function(x)

'Split' chracter

2007 Nov 24

'Split' chracter

Overlaying lattice graphs

2007 Jun 11

Overlaying lattice graphs

Hello I apologize in advance if this question has already be posted on the list, although I could not find a relevant thread in the archives. I would like to overlay xyplots using different datasets for each plot. I typically work on the following data.frame (mydata) structure >mydata Drug Time Observed Predicted 1 A 0.05 10

How to do the same thing for all levels of a column?

2012 Jul 23

How to do the same thing for all levels of a column?

Dear all, I am a R beginner, and I am looking for a way to do the same thing for all levels of a column in a table. Basically, I have a bunch of protein sequences composed of different amino acid residues, and each residue is represented by an uppercase letter. I want to calculate the ratio of different amino acid residues at each position of the proteins. Here is an example table: Proteins

Overlaying lattice graphs (continued)

2007 Jun 21

Overlaying lattice graphs (continued)

Dear R Users, I recently posted an email on this list about the use of data.frame and overlaying multiple plots. Deepayan kindly indicated to me the panel.superposition command which worked perfectly in the context of the example I gave. I'd like to go a little bit further on this topic using a more complex dataset structure (actually the one I want to work on). >mydata Plot

replacing all NA's in a dataframe with zeros...

2007 Mar 15

replacing all NA's in a dataframe with zeros...

I've seen how to replace the NA's in a single column with a data frame *> mydata$ncigs[is.na(mydata$ncigs)]<-0 *But this is just one column... I have thousands of columns (!) that I need to do this, and I can't figure out a way, outside of the dreaded loop, do replace all NA's in an entire data frame (all vars) without naming each var separately. Yikes. I'm racking my

Creating a Model Matrix - keeping NAs

2001 Aug 12

Creating a Model Matrix - keeping NAs

I am wanting to create a model matrix and keep the NAs. stratmat <- model.matrix(myformula,mydata) Is there any way to do this? model.matrix doesn't have na.action as a parameter. Elsewhere I have made use of na.keep <- function(x){x}. Many thanks, Rachel Cunliffe -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read

matlab/gauss code in R

2007 Jun 24

matlab/gauss code in R

Hi all! I would like to import a matlab or gauss code to R. Could you help me? Bye, Sebasti?n. 2007/6/23, r-help-request en stat.math.ethz.ch <r-help-request en stat.math.ethz.ch>: > Send R-help mailing list submissions to > r-help en stat.math.ethz.ch > > To subscribe or unsubscribe via the World Wide Web, visit >

How to findout the name of a dataframe

2013 Feb 17

How to findout the name of a dataframe

Let'say we have a dataframe mydata with column v1. If mydata$v1 is passed to a function, is there way, then, to extract the name of the dataframe? What I now do is passing the name of the dataframe to the funcion, so passing two parameters. Maybe with mydata$v1 it is not possible, but with mydata['v1'] or mydata[,'v1'] it is? Thanks Frans ------------------- Frans Marcelissen

padding specific missing values with NA to allow cbind

2013 Jun 10

padding specific missing values with NA to allow cbind

Dear list Getting very frustrated with this simple-looking problem > m1 <- lm(x~y, data=mydata) > outliers <- abs(stdres(m1))>2 > plot(x~y, data=mydata) I would like to plot a simple x,y scatter plot with labels giving custom information displayed for the outliers only, i.e. I would like to define a column mydata$labels for the mydata dataframe so that the command >

Error in using nlevels in apply function

2007 Aug 06

Error in using nlevels in apply function

Dear R users, I am currently trying to create my first personnal function and use it with the apply function. The purpose of this function is to create a vector summarizing the number of levels in a given selection of data.frame columns. I tried to transpose the indexation method used by the nlevels function but it doesn't seem to work. I did not find anything uesful in the archives so

dividing a dataframe column by different constants

2009 Sep 03

dividing a dataframe column by different constants

Dear R users, today I've got the following problem. Here you are a dataframe as example. There are some SAMPLES for which a CONCentration was recorded through TIME. The time during which the concentration was recorded is not always the same, 10 points for Sample A, 7 points for Sample B and 11 for sample C Also the initial concentration was not the same for the three samples. I would like

change all . to 0 in a data.frame

2007 Sep 06

change all . to 0 in a data.frame

Hello, I read in a tab delimited text file via mydata = read.delim(myfile). The text file was originally an excel file where . was used in place of 0. Now all the columns which should be integers are factors. Any ideas how to change all the . to 0 and factors back to integer? Thanks a lot in advance for any suggestions, -- D --------------------------------- [[alternative HTML

similar to: Replace NAs in dataframe: what am I doing wrong