similar to: reshape command is (stats) dropping instances

Displaying 20 results from an estimated 1000 matches similar to: "reshape command is (stats) dropping instances"

2006 Jan 17
2
Installing a package yet it will not work.
I want R to read my Microsoft Access database or maybe even a Sybase database. I installed RODBC or at least thought I did. Then I issued the following command: library(RODBC) And got Error in lazyLoadDBfetch(key, datafile, compressed, envhook) : ReadItem: unknown type 241 In addition: Warning message: package 'RODBC' was built under R version 2.3.0 Error: package/namespace
2006 May 03
5
Listing Variables
How does one create a vector whose contents is the list of variables in a dataframe pertaining to a particular pattern? This is so simple but I cannot find a straightforward answer. I want to be able to pass the contents of that list to a "for" loop. So let us assume that one has a dataframe whose name is Data. And let us assume one had the height of a group of people measured at
2006 Apr 29
3
Writing responses to the R-Help list
A while back Gabor Grothendieck suggested that I try http://news.gmane.org/gmane.comp.lang.r.general. This was after I asked how to easily reply to posts on the listserve. Ideally I would like the functionality that I find in Microsoft Outlook Express newsreader for usenet groups or what I find in Google Groups. I started using gmane about 3 weeks ago. I find it fantastic for searching and for
2006 Apr 27
2
Incomplete Trio in TDT analysis
I am involved in a study where, as in most of life, men demonstrate themselves to be recalcitrant. So while we have many probands and most of their mothers we only have about 50% of the trios being complete. I have been running tdt and trio.types. It appears as if it is ignoring the duos. Sometimes a duo can be informative. For instance Father ..missing Mother 1/2 Proband 1/1 This duo shows that
2006 Mar 04
1
Npmc for doing post-hoc after Kruskal
I followed the threads that enquired about doing post-hoc tests after doing Kruskal testing. It took me to npmc. But npmc is giving an output I do not understand. I noticed a thread entitled "npmc function: 'x' must be atomic" but there never appeared to be a resolution. > npmc(npmcinput) Error in sort(unique.default(x), na.last = TRUE) : 'x' must be atomic
2006 May 02
4
Repeating tdt function on thousands of variables
I am using dgc.genetics to perform TDT analysis on SNP data from a cohort of trios. I now have a file with about 6008 variables. The first few variables related to the pedigree data such as the pedigree ID the person ID etc. Thereafter each variable is a specific locus or marker. The variables are named by a pattern such as "Genotype.nnnnn" with nnnnn corresponding to a number which
2006 May 05
1
How to a handle an error in a loop
I am about one step away from heaven on earth. I think only one step! I am using dgc.genetics to run a TDT test on thousands of genetic loci. I have learnt (through the help of others on this mailing list) to send the complex output to useful data frames which in turn allow me to look at the big picture and screen the thousands of loci. Resultdt<-lapply(PGWide[,240:290], tdt) the above
2006 Apr 06
4
Reshaping genetic data from long to wide
Bottom Line Up Front: How does one reshape genetic data from long to wide? I currently have a lot of data. About 180 individuals (some probands/patients, some parents, rare siblings) and SNP data from 6000 loci on each. The standard formats seem to be something along the lines of Famid, pid, fatid, motid, affected, sex, locus1Allele1, locus1Allele2, locus2Allele1, locus2Allele2, etc In other
2006 Jan 25
2
Unequal sample sizes when calculating power
Power calculations two sample test for proportions is very useful. Is there a way however, to get away from the two samples being of the same size. What would happen if one had n=15 in the one sample and n=45 in the other sample. Farrel Buchinsky, MD Pediatric Otolaryngologist Allegheny General Hospital Pittsburgh, PA **********************************************************************
2006 May 14
2
Bioconductor AND Genetics Library
I am aware of the R Genetics Project that developed the R library and software called Bioconductor (http://www.bioconductor.org/ ) . How do the two relate to each other? What is the one that the other is not and vice versa? Can anybody link me to something that answers the question? -- Farrel Buchinsky, MD Pediatric Otolaryngologist Allegheny General Hospital Pittsburgh, PA
2006 May 30
1
sib TDT transmission/disequilibrium test
Does anyone know if the sib TDT has been implemented in R 1. Spielman, R.S., and Ewens, W.J. (1998) A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test. Am J Hum Genet 62, 450-458 -- Farrel Buchinsky, MD Pediatric Otolaryngologist Allegheny General Hospital Pittsburgh, PA
2009 Jul 19
4
space in column name
I read a table from Microsoft Access using RODBC. Some of the variables had a name with a space in it. R has no problem with it but I do. I cannot find out how to specify the space names(alltime) [1] "ID" "LVL7" "Ref Pv No" "Ref Pv Name" "DOS" "Pt Last Name" "Pt First Name" "MRN"
2008 Dec 10
2
converting multiple columns from POSIX* to Date
converting a POSIX class variable to a date class is easy. dates<-as.Date(x) #where X is of class POSIX How does one do that to all columns in a data frame that are of POSIX class and leave all the other columns (integers, factors) as is. Feel free to reply with just one or two buzzwords that I could then search for to find how to do it. Farrel Buchinsky
2007 Dec 14
6
Analyzing Publications from Pubmed via XML
I would like to track in which journals articles about a particular disease are being published. Creating a pubmed search is trivial. The search provides data but obviously not as an R dataframe. I can get the search to export the data as an xml feed and the xml package seems to be able to read it. xmlTreeParse("
2008 Dec 15
3
Reading from Google Docs
I saw a thread from September 24 in which Duncan Temple Lang told us: - The package currently has no Rd files, but there is a brief "user's guide". The package is available from http://www.omegahat.org/RGoogleDocs I could not find it by using Tinn-R or RGui's package install tool. Then when I went to the website I saw that package is only available as
2009 Dec 10
3
Have you used RGoogleDocs and RGoogleData?
Both of these applications fulfill a great need of mine: to read data directly from google spreadsheets that are private to myself and one or two collaborators. Thanks to the authors. I had been using RGoogleDocs for the about 6 months (maybe more) but have had to stop using it in the past month since for some reason that I do not understand it no longer reads google spreadsheets. I loved it. Its
2007 Jan 09
3
dimensions of a all objects
Why will the following command not work sapply(objects(),dim) What does it say about the objects list? What does it say about the dim command? Likewise, the following also does not work all<-ls() for (f in all) print(dim(f)) -- Farrel Buchinsky [[alternative HTML version deleted]]
2006 May 07
0
How to a handle an error in a loop [Broadcast]
This ought to work: resultdt <- lapply(PGWide[, 240:389], function(x, ...) try(tdt(x, ...))) You can then check the class of each component to see which one failed. Andy From: Farrel Buchinsky > > "Berton Gunter" <gunter.berton at gene.com> wrote in message > news:008601c67097$de1b46e0$5bc4fea9 at gne.windows.gene.com... > > ?try > > > > as in
2007 Oct 02
2
Calculating proportions from a data frame rather than a table
When one has raw data it is easy to create a table of one variable against another and then calculate proportions For example a.nice.table<-table(a,b) prop.table(a.nice.table,1) However, I looked at several papers and created a data frame of the aggregate data. That means I acually created a table except it is a data frame. The first column lists the name of the first author and the year. I
2008 Apr 25
2
Differentiate alphanumeric vs numeric strings
I have a bunch of tables in a Microsoft Access database. An updated database is sent to me every week containing a new table. I know that is inefficient and weird but welcome to my life. I want to read the tables whose names are something such as "040207" but not the ones that have alphanumeric names such as "everyone". Using RODBC I am easily able to create a character vector