Displaying 20 results from an estimated 300 matches similar to: "Does SQL group by have a heavy duty equivalent in R"
2007 Jan 01
1
Subset by using multiple values
I have a vector containg about 20 unique values. It is called rejectrs$rs.
It is a factor
I have a data frame with about 100000 rows.
I want to exclude all rows where in variable rs the value is one of the 20
on the exclude list. I thought this would work but none did.
RawSeqBig<-subset(RawSeqBig,ASSAY_ID!=rejectrs$rs)
RawSeqBig<-subset(RawSeqBig,ASSAY_ID!=list(rejectrs$rs))
--
Farrel
2012 May 21
1
help with melt/cast in reshape-package
I'm sorry everyone for the inconvenience of spamming the R-help...
Here's the complete post:
Hi everyone,
>
> Since it's quite a while that I used the reshape package, I now feel kind
> of rusty.
>
> I have a data.frame like this:
>
>
>
> id Sample.Name Marker Allele.1
> Allele.2 sample_id species
2009 Oct 23
5
Change positions of columns in data frame
Hi all,
Probably a simple question, but I just can't find a simple answear in the older threads or anywhere else.
I've added some new vectors as columns in a data frame using cbind(). As they're all put as the last columns inte the data frame, I would like to move them to specific positions. How do you do to change the position of a column in a data frame?
I know I can use
2009 Jul 29
3
how to skip a numeric column for averaging other columns?
Data has the first row for variable name and the first column for sample
name. I want to take "Log" for all data, but how to compute without the
first column for sample name.
That is,
column 1: sample ID
column 2-10: data
I want to find an average on each column (2-10)
> apply(raw_data,2,mean)
Error in Math.data.frame(list(sample_id = c(1L, 2L, 3L, 4L, 5L, 6L, 7L, :
2008 Mar 19
2
problem with worker status
Hi,
I have a page were users can start jobs that may take several hours to
run. BackgrounDrb is perfect for me and seems to be working well most of
the time but some times I have the following problem:
My worker is set up like this:
class UploadWorker < BackgrounDRb::MetaWorker
set_worker_name :upload_worker
set_no_auto_load(true)
def create(args = nil )
file = args[:file]
2010 Sep 14
3
how to compute when row length is different
hi guys..please help me with this
i am working on two data frames
one goes like this:
DF1
Sample_id RepairHours Denatured Dose ZeroMean FourtyFiveMean NinetyMean
1 SDM071 0 1 B 60.5 19.0 45.0
2 SDM071 1 1 B 46.0 23.0 42.5
3 SDM071 2 1 B 52.5 24.0 40.0
4 SDM071
2009 Jul 19
4
space in column name
I read a table from Microsoft Access using RODBC. Some of the variables had
a name with a space in it.
R has no problem with it but I do.
I cannot find out how to specify the space
names(alltime)
[1] "ID" "LVL7" "Ref Pv No" "Ref Pv Name" "DOS"
"Pt Last Name" "Pt First Name" "MRN"
2006 Jan 17
2
Installing a package yet it will not work.
I want R to read my Microsoft Access database or maybe even a Sybase
database. I installed RODBC or at least thought I did. Then I issued the
following command:
library(RODBC)
And got
Error in lazyLoadDBfetch(key, datafile, compressed, envhook) :
ReadItem: unknown type 241
In addition: Warning message:
package 'RODBC' was built under R version 2.3.0
Error: package/namespace
2008 Dec 10
2
converting multiple columns from POSIX* to Date
converting a POSIX class variable to a date class is easy.
dates<-as.Date(x) #where X is of class POSIX
How does one do that to all columns in a data frame that are of POSIX
class and leave all the other columns (integers, factors) as is.
Feel free to reply with just one or two buzzwords that I could then
search for to find how to do it.
Farrel Buchinsky
2006 Nov 29
2
reshape command is (stats) dropping instances
I would really appreciate it if anyone could determine what is going
on with the following command. It is only half-working and is losing
lots of data. For the life of me I cannot even see the pattern of what
it is losing and what it is not.
I am attaching the R data set which you can use with the Load
Workspace menu function.
2008 Dec 15
3
Reading from Google Docs
I saw a thread from September 24 in which Duncan Temple Lang told us:
- The package currently has no Rd files, but there is a brief "user's
guide". The package is available from
http://www.omegahat.org/RGoogleDocs
I could not find it by using Tinn-R or RGui's package install tool.
Then when I went to the website I saw that package is only available
as
2007 Jan 09
3
dimensions of a all objects
Why will the following command not work
sapply(objects(),dim)
What does it say about the objects list? What does it say about the dim
command?
Likewise, the following also does not work
all<-ls()
for (f in all) print(dim(f))
--
Farrel Buchinsky
[[alternative HTML version deleted]]
2009 Dec 10
3
Have you used RGoogleDocs and RGoogleData?
Both of these applications fulfill a great need of mine: to read data
directly from google spreadsheets that are private to myself and one or two
collaborators. Thanks to the authors. I had been using RGoogleDocs for the
about 6 months (maybe more) but have had to stop using it in the past month
since for some reason that I do not understand it no longer reads google
spreadsheets. I loved it. Its
2007 Oct 02
2
Calculating proportions from a data frame rather than a table
When one has raw data it is easy to create a table of one variable against
another and then calculate proportions
For example
a.nice.table<-table(a,b)
prop.table(a.nice.table,1)
However, I looked at several papers and created a data frame of the
aggregate data. That means I acually created a table except it is a data
frame. The first column lists the name of the first author and the year.
I
2006 May 03
5
Listing Variables
How does one create a vector whose contents is the list of variables in a
dataframe pertaining to a particular pattern?
This is so simple but I cannot find a straightforward answer.
I want to be able to pass the contents of that list to a "for" loop.
So let us assume that one has a dataframe whose name is Data. And let us
assume one had the height of a group of people measured at
2008 Apr 25
2
Differentiate alphanumeric vs numeric strings
I have a bunch of tables in a Microsoft Access database. An updated database
is sent to me every week containing a new table. I know that is inefficient
and weird but welcome to my life. I want to read the tables whose names are
something such as "040207" but not the ones that have alphanumeric names
such as "everyone". Using RODBC I am easily able to create a character
vector
2009 Jan 23
2
forward slash vs double backslash R and Tinn-R
I installed the newest version of R and once again ran into problem
with Tinn-R failing when trying to use the R explorer. I had this
problem once before and solved it when I added the following
.trPaths = c(
'C:/Documents and Settings/fbuchins/Application Data/Tinn-R/tmp/',
'C:/Documents and Settings/fbuchins/Application Data/Tinn-R/tmp/search.txt',
'C:/Documents and
2006 Nov 24
1
Sunflower plot error; how to deal with NA
I suspect the problem stems from the fact that there are a couple of NA
values.
> sunflowerplot(lastoto,maxear)
Error in rep.int(i.multi, number[number > 1]) :
invalid number of copies in rep.int()
So I used the subset command to get rid of the cases with NA
hell<-subset(ChinOtoMayB,is.na(lastoto)==FALSE)
Then it worked perfectly
sunflowerplot(hell$lastoto,hell$maxear)
Is
2008 Oct 03
1
Tinn-R explorer used to be my friend
I have upgraded everything lately and can no longer get the Tinn-R explorer
to work. I think I have had this problem before but cannot recall how I
solved it.I run Tinn-R 2.0.0.7 and Rgui version 2.7.2
When I click on the explorer button I get
> trObjList(envir='.GlobalEnv', pattern='', group='', path=.trPaths[3])
Error in trObjList(envir = ".GlobalEnv",
2009 Mar 24
2
two different date formats in the same variable
How does one convert to a date format when survey respondents have
used two different date formats whilst entering their data. There were
clearly told to use mm/dd/yyyy but humans being humans some entered
mm/dd/yy. There was even validity checks on the forms but I allowed
them to be overridden since the data is more holy than the format.
The data was downloaded as a csv and read.csv was used to