Displaying 20 results from an estimated 1000 matches similar to: "reshape command is (stats) dropping instances"
2006 Jan 17
2
Installing a package yet it will not work.
I want R to read my Microsoft Access database or maybe even a Sybase
database. I installed RODBC or at least thought I did. Then I issued the
following command:
library(RODBC)
And got
Error in lazyLoadDBfetch(key, datafile, compressed, envhook) :
ReadItem: unknown type 241
In addition: Warning message:
package 'RODBC' was built under R version 2.3.0
Error: package/namespace
2006 May 03
5
Listing Variables
How does one create a vector whose contents is the list of variables in a
dataframe pertaining to a particular pattern?
This is so simple but I cannot find a straightforward answer.
I want to be able to pass the contents of that list to a "for" loop.
So let us assume that one has a dataframe whose name is Data. And let us
assume one had the height of a group of people measured at
2006 Apr 29
3
Writing responses to the R-Help list
A while back Gabor Grothendieck suggested that I try
http://news.gmane.org/gmane.comp.lang.r.general. This was after I asked how
to easily reply to posts on the listserve. Ideally I would like the
functionality that I find in Microsoft Outlook Express newsreader for usenet
groups or what I find in Google Groups.
I started using gmane about 3 weeks ago. I find it fantastic for searching
and for
2006 Apr 27
2
Incomplete Trio in TDT analysis
I am involved in a study where, as in most of life, men demonstrate
themselves to be recalcitrant. So while we have many probands and most of
their mothers we only have about 50% of the trios being complete.
I have been running tdt and trio.types. It appears as if it is ignoring the
duos. Sometimes a duo can be informative. For instance
Father ..missing
Mother 1/2
Proband 1/1
This duo shows that
2006 Mar 04
1
Npmc for doing post-hoc after Kruskal
I followed the threads that enquired about doing post-hoc tests after doing
Kruskal testing. It took me to npmc. But npmc is giving an output I do not
understand.
I noticed a thread entitled "npmc function: 'x' must be atomic" but there
never appeared to be a resolution.
> npmc(npmcinput)
Error in sort(unique.default(x), na.last = TRUE) :
'x' must be atomic
2006 May 02
4
Repeating tdt function on thousands of variables
I am using dgc.genetics to perform TDT analysis on SNP data from a cohort of
trios.
I now have a file with about 6008 variables. The first few variables related
to the pedigree data such as the pedigree ID the person ID etc. Thereafter
each variable is a specific locus or marker. The variables are named by a
pattern such as "Genotype.nnnnn" with nnnnn corresponding to a number which
2006 May 05
1
How to a handle an error in a loop
I am about one step away from heaven on earth. I think only one step!
I am using dgc.genetics to run a TDT test on thousands of genetic loci. I
have learnt (through the help of others on this mailing list) to send the
complex output to useful data frames which in turn allow me to look at the
big picture and screen the thousands of loci.
Resultdt<-lapply(PGWide[,240:290], tdt)
the above
2006 Apr 06
4
Reshaping genetic data from long to wide
Bottom Line Up Front: How does one reshape genetic data from long to wide?
I currently have a lot of data. About 180 individuals (some
probands/patients, some parents, rare siblings) and SNP data from 6000 loci
on each. The standard formats seem to be something along the lines of Famid,
pid, fatid, motid, affected, sex, locus1Allele1, locus1Allele2,
locus2Allele1, locus2Allele2, etc
In other
2006 Jan 25
2
Unequal sample sizes when calculating power
Power calculations two sample test for proportions is very useful. Is there
a way however, to get away from the two samples being of the same size. What
would happen if one had n=15 in the one sample and n=45 in the other sample.
Farrel Buchinsky, MD
Pediatric Otolaryngologist
Allegheny General Hospital
Pittsburgh, PA
**********************************************************************
2006 May 14
2
Bioconductor AND Genetics Library
I am aware of the R Genetics Project that developed the R library and
software called Bioconductor (http://www.bioconductor.org/ ) . How do the
two relate to each other? What is the one that the other is not and vice
versa? Can anybody link me to something that answers the question?
--
Farrel Buchinsky, MD
Pediatric Otolaryngologist
Allegheny General Hospital
Pittsburgh, PA
2006 May 30
1
sib TDT transmission/disequilibrium test
Does anyone know if the sib TDT has been implemented in R
1. Spielman, R.S., and Ewens, W.J. (1998) A sibship test for linkage in the
presence of association: the sib transmission/disequilibrium test. Am J Hum
Genet 62, 450-458
--
Farrel Buchinsky, MD
Pediatric Otolaryngologist
Allegheny General Hospital
Pittsburgh, PA
2009 Jul 19
4
space in column name
I read a table from Microsoft Access using RODBC. Some of the variables had
a name with a space in it.
R has no problem with it but I do.
I cannot find out how to specify the space
names(alltime)
[1] "ID" "LVL7" "Ref Pv No" "Ref Pv Name" "DOS"
"Pt Last Name" "Pt First Name" "MRN"
2008 Dec 10
2
converting multiple columns from POSIX* to Date
converting a POSIX class variable to a date class is easy.
dates<-as.Date(x) #where X is of class POSIX
How does one do that to all columns in a data frame that are of POSIX
class and leave all the other columns (integers, factors) as is.
Feel free to reply with just one or two buzzwords that I could then
search for to find how to do it.
Farrel Buchinsky
2007 Dec 14
6
Analyzing Publications from Pubmed via XML
I would like to track in which journals articles about a particular disease
are being published. Creating a pubmed search is trivial. The search
provides data but obviously not as an R dataframe. I can get the search to
export the data as an xml feed and the xml package seems to be able to read
it.
xmlTreeParse("
2008 Dec 15
3
Reading from Google Docs
I saw a thread from September 24 in which Duncan Temple Lang told us:
- The package currently has no Rd files, but there is a brief "user's
guide". The package is available from
http://www.omegahat.org/RGoogleDocs
I could not find it by using Tinn-R or RGui's package install tool.
Then when I went to the website I saw that package is only available
as
2009 Dec 10
3
Have you used RGoogleDocs and RGoogleData?
Both of these applications fulfill a great need of mine: to read data
directly from google spreadsheets that are private to myself and one or two
collaborators. Thanks to the authors. I had been using RGoogleDocs for the
about 6 months (maybe more) but have had to stop using it in the past month
since for some reason that I do not understand it no longer reads google
spreadsheets. I loved it. Its
2007 Jan 09
3
dimensions of a all objects
Why will the following command not work
sapply(objects(),dim)
What does it say about the objects list? What does it say about the dim
command?
Likewise, the following also does not work
all<-ls()
for (f in all) print(dim(f))
--
Farrel Buchinsky
[[alternative HTML version deleted]]
2006 May 07
0
How to a handle an error in a loop [Broadcast]
This ought to work:
resultdt <- lapply(PGWide[, 240:389], function(x, ...) try(tdt(x, ...)))
You can then check the class of each component to see which one failed.
Andy
From: Farrel Buchinsky
>
> "Berton Gunter" <gunter.berton at gene.com> wrote in message
> news:008601c67097$de1b46e0$5bc4fea9 at gne.windows.gene.com...
> > ?try
> >
> > as in
2007 Oct 02
2
Calculating proportions from a data frame rather than a table
When one has raw data it is easy to create a table of one variable against
another and then calculate proportions
For example
a.nice.table<-table(a,b)
prop.table(a.nice.table,1)
However, I looked at several papers and created a data frame of the
aggregate data. That means I acually created a table except it is a data
frame. The first column lists the name of the first author and the year.
I
2008 Apr 25
2
Differentiate alphanumeric vs numeric strings
I have a bunch of tables in a Microsoft Access database. An updated database
is sent to me every week containing a new table. I know that is inefficient
and weird but welcome to my life. I want to read the tables whose names are
something such as "040207" but not the ones that have alphanumeric names
such as "everyone". Using RODBC I am easily able to create a character
vector