similar to: Does SQL group by have a heavy duty equivalent in R

Displaying 20 results from an estimated 300 matches similar to: "Does SQL group by have a heavy duty equivalent in R"

2007 Jan 01
1
Subset by using multiple values
I have a vector containg about 20 unique values. It is called rejectrs$rs. It is a factor I have a data frame with about 100000 rows. I want to exclude all rows where in variable rs the value is one of the 20 on the exclude list. I thought this would work but none did. RawSeqBig<-subset(RawSeqBig,ASSAY_ID!=rejectrs$rs) RawSeqBig<-subset(RawSeqBig,ASSAY_ID!=list(rejectrs$rs)) -- Farrel
2012 May 21
1
help with melt/cast in reshape-package
I'm sorry everyone for the inconvenience of spamming the R-help... Here's the complete post: Hi everyone, > > Since it's quite a while that I used the reshape package, I now feel kind > of rusty. > > I have a data.frame like this: > > > > id Sample.Name Marker Allele.1 > Allele.2 sample_id species
2009 Oct 23
5
Change positions of columns in data frame
Hi all, Probably a simple question, but I just can't find a simple answear in the older threads or anywhere else. I've added some new vectors as columns in a data frame using cbind(). As they're all put as the last columns inte the data frame, I would like to move them to specific positions. How do you do to change the position of a column in a data frame? I know I can use
2009 Jul 29
3
how to skip a numeric column for averaging other columns?
Data has the first row for variable name and the first column for sample name. I want to take "Log" for all data, but how to compute without the first column for sample name. That is, column 1: sample ID column 2-10: data I want to find an average on each column (2-10) > apply(raw_data,2,mean) Error in Math.data.frame(list(sample_id = c(1L, 2L, 3L, 4L, 5L, 6L, 7L, :
2008 Mar 19
2
problem with worker status
Hi, I have a page were users can start jobs that may take several hours to run. BackgrounDrb is perfect for me and seems to be working well most of the time but some times I have the following problem: My worker is set up like this: class UploadWorker < BackgrounDRb::MetaWorker set_worker_name :upload_worker set_no_auto_load(true) def create(args = nil ) file = args[:file]
2010 Sep 14
3
how to compute when row length is different
hi guys..please help me with this i am working on two data frames one goes like this: DF1 Sample_id RepairHours Denatured Dose ZeroMean FourtyFiveMean NinetyMean 1 SDM071 0 1 B 60.5 19.0 45.0 2 SDM071 1 1 B 46.0 23.0 42.5 3 SDM071 2 1 B 52.5 24.0 40.0 4 SDM071
2009 Jul 19
4
space in column name
I read a table from Microsoft Access using RODBC. Some of the variables had a name with a space in it. R has no problem with it but I do. I cannot find out how to specify the space names(alltime) [1] "ID" "LVL7" "Ref Pv No" "Ref Pv Name" "DOS" "Pt Last Name" "Pt First Name" "MRN"
2006 Jan 17
2
Installing a package yet it will not work.
I want R to read my Microsoft Access database or maybe even a Sybase database. I installed RODBC or at least thought I did. Then I issued the following command: library(RODBC) And got Error in lazyLoadDBfetch(key, datafile, compressed, envhook) : ReadItem: unknown type 241 In addition: Warning message: package 'RODBC' was built under R version 2.3.0 Error: package/namespace
2008 Dec 10
2
converting multiple columns from POSIX* to Date
converting a POSIX class variable to a date class is easy. dates<-as.Date(x) #where X is of class POSIX How does one do that to all columns in a data frame that are of POSIX class and leave all the other columns (integers, factors) as is. Feel free to reply with just one or two buzzwords that I could then search for to find how to do it. Farrel Buchinsky
2006 Nov 29
2
reshape command is (stats) dropping instances
I would really appreciate it if anyone could determine what is going on with the following command. It is only half-working and is losing lots of data. For the life of me I cannot even see the pattern of what it is losing and what it is not. I am attaching the R data set which you can use with the Load Workspace menu function.
2008 Dec 15
3
Reading from Google Docs
I saw a thread from September 24 in which Duncan Temple Lang told us: - The package currently has no Rd files, but there is a brief "user's guide". The package is available from http://www.omegahat.org/RGoogleDocs I could not find it by using Tinn-R or RGui's package install tool. Then when I went to the website I saw that package is only available as
2007 Jan 09
3
dimensions of a all objects
Why will the following command not work sapply(objects(),dim) What does it say about the objects list? What does it say about the dim command? Likewise, the following also does not work all<-ls() for (f in all) print(dim(f)) -- Farrel Buchinsky [[alternative HTML version deleted]]
2009 Dec 10
3
Have you used RGoogleDocs and RGoogleData?
Both of these applications fulfill a great need of mine: to read data directly from google spreadsheets that are private to myself and one or two collaborators. Thanks to the authors. I had been using RGoogleDocs for the about 6 months (maybe more) but have had to stop using it in the past month since for some reason that I do not understand it no longer reads google spreadsheets. I loved it. Its
2007 Oct 02
2
Calculating proportions from a data frame rather than a table
When one has raw data it is easy to create a table of one variable against another and then calculate proportions For example a.nice.table<-table(a,b) prop.table(a.nice.table,1) However, I looked at several papers and created a data frame of the aggregate data. That means I acually created a table except it is a data frame. The first column lists the name of the first author and the year. I
2006 May 03
5
Listing Variables
How does one create a vector whose contents is the list of variables in a dataframe pertaining to a particular pattern? This is so simple but I cannot find a straightforward answer. I want to be able to pass the contents of that list to a "for" loop. So let us assume that one has a dataframe whose name is Data. And let us assume one had the height of a group of people measured at
2008 Apr 25
2
Differentiate alphanumeric vs numeric strings
I have a bunch of tables in a Microsoft Access database. An updated database is sent to me every week containing a new table. I know that is inefficient and weird but welcome to my life. I want to read the tables whose names are something such as "040207" but not the ones that have alphanumeric names such as "everyone". Using RODBC I am easily able to create a character vector
2009 Jan 23
2
forward slash vs double backslash R and Tinn-R
I installed the newest version of R and once again ran into problem with Tinn-R failing when trying to use the R explorer. I had this problem once before and solved it when I added the following .trPaths = c( 'C:/Documents and Settings/fbuchins/Application Data/Tinn-R/tmp/', 'C:/Documents and Settings/fbuchins/Application Data/Tinn-R/tmp/search.txt', 'C:/Documents and
2006 Nov 24
1
Sunflower plot error; how to deal with NA
I suspect the problem stems from the fact that there are a couple of NA values. > sunflowerplot(lastoto,maxear) Error in rep.int(i.multi, number[number > 1]) : invalid number of copies in rep.int() So I used the subset command to get rid of the cases with NA hell<-subset(ChinOtoMayB,is.na(lastoto)==FALSE) Then it worked perfectly sunflowerplot(hell$lastoto,hell$maxear) Is
2008 Oct 03
1
Tinn-R explorer used to be my friend
I have upgraded everything lately and can no longer get the Tinn-R explorer to work. I think I have had this problem before but cannot recall how I solved it.I run Tinn-R 2.0.0.7 and Rgui version 2.7.2 When I click on the explorer button I get > trObjList(envir='.GlobalEnv', pattern='', group='', path=.trPaths[3]) Error in trObjList(envir = ".GlobalEnv",
2009 Mar 24
2
two different date formats in the same variable
How does one convert to a date format when survey respondents have used two different date formats whilst entering their data. There were clearly told to use mm/dd/yyyy but humans being humans some entered mm/dd/yy. There was even validity checks on the forms but I allowed them to be overridden since the data is more holy than the format. The data was downloaded as a csv and read.csv was used to