similar to: Merging data frames, or one column/vector with a data frame filling out empty rows with NA's

Displaying 20 results from an estimated 100 matches similar to: "Merging data frames, or one column/vector with a data frame filling out empty rows with NA's"

2007 Jan 23
1
"tapply" and "data.frame"?
I want to transform the data by "tapply" to one dataframe. But I can not get it. For example: > tst=tapply(point,pp,length) > tst[1:10] p1 p10 p100 p1000 p1001 p1002 p1003 p1004 p1005 p1006 1 5 1 8 6 5 8 7 4 4 > res=as.data.frame(tst) # I try to transform it > res[1:10,] p1 p10 p100 p1000 p1001 p1002 p1003 p1004 p1005 p1006 1 5 1 8 6 5
2011 Mar 10
1
snp-chip table
Dear R helpers I have a table and i need to make new table table1: sire snp1 snp2 snp3 snp4 snp5 snp6 snp7 snp8 snp9 snp10 snp11 snp12 snp13 snp14 snp15 8877 -1 -1 -1 -1 0 0 -1 -1 -1 0 1 1 1 -1 -1 7765 1 1 1 0 0 0 -1 1 1 1 0 0 0 1 0 8766 1 1 -1 0 -1 -1 0 -1 0 -1 -1 -1 0 1 0 6756 0 1 0 -1 1 -1 -1 0 0 0 0 -1 0 1 1 5644 -1 0 1 -1 0 0 0 0 -1 -1 0 0 0 0 1 I have table2 sire
2009 Feb 27
5
Filtering a dataset's columns by another dataset's column names
Hello all, I hope some of you can come to my rescue, yet again. I have two genetic datasets, and I want one of the datasets to have only the columns that are in common with the other dataset. Here is a toy example (my real datasets have hundreds of columns): Dataset 1: Individual SNP1 SNP2 SNP3 SNP4 SNP5 1 A G T C A 2 T C A G T 3 A C T
2011 Jan 22
1
R TABELS
Hi ihave one table that look like SNP1 SNP2 SNP3 SNP4 SNP5 SIRE1 1 -1 -1 1 -1 SIRE2 1 -1 1 1 1 SIRE3 -1 -1 1 1 0 SIRE4 -1 1 1 0 1 SIRE5 -1 1 -1 -1 1 SIRE6 0 0 0 1 -1 SIRE7 -1 0 -1 1 1 SIRE8 1 -1 NA 0 NA SIRE9 -1 1 1 -1 -1 SIRE10 1 1 1 1 1 table 2 only one line SNP1 SNP2 SNP3 SNP4 SNP5 SIRE100 -1 -1 1
2011 Jan 23
1
SNP IMPUTATION
Hi ihave one table that look like SNP1 SNP2 SNP3 SNP4 SNP5 SIRE1 1 -1 -1 1 -1 SIRE2 1 -1 1 1 1 SIRE3 -1 -1 1 1 0 SIRE4 -1 1 1 0 1 SIRE5 -1 1 -1 -1 1 SIRE6 0 0 0 1 -1 SIRE7 -1 0 -1 1 1 SIRE8 1 -1 NA 0 NA SIRE9 -1 1 1 -1 -1 SIRE10 1 1 1 1 1 table 2 only one line SNP1 SNP2 SNP3 SNP4 SNP5 SIRE100 -1 -1 1 1 -1 I need to male
2009 Sep 01
1
permutation and reshuffling
Hi, I'm looking for an efficient code that will enable me to reshuffle data (phenotype) for certain number of individuals and creating a loop that will randomly simulate it for 10000 times *(permutation)*. I also need to find how I keep the information (p value for each SNP) gathered for all the 10000 iterations. My data set looks like this (n=500): Individual # Phenotype SNP1 SNP2
2013 Jul 02
2
Recoding variables based on reference values in data frame
I'm new to R (previously used SAS primarily) and I have a genetics data frame consisting of genotypes for each of 300+ subjects (ID1, ID2, ID3, ...) at 3000+ genetic locations (SNP1, SNP2, SNP3...). A small subset of the data is shown below: SNP_ID SNP1 SNP2 SNP3 SNP4 Maj_Allele C G C A Min_Allele T A T G ID1 CC GG CT AA ID2 CC GG CC AA ID3 CC GG nc AA
2011 Jan 03
0
Using PCA to correct p-values from snpMatrix
Hi R-help folks, I have been doing some single SNP association work using snpMatrix. This works well, but produces a lot of false positives, because of population structure in my data. I would like to correct the p-values (which snpMatrix gives me) for population structure, possibly using principle component analysis (PCA). My data is complicated, so here's a simple example of what
2008 May 13
2
array dimension changes with assignment
Why does the assignment of a 3178x93 object to another 3178x93 object remove the dimension attribute? > GT <- array(dim = c(6,nrow(InData),ncol(InSNPs))) > dim(GT) [1] 6 3178 93 > SNP1 <- InSNPs[InData[,"C1"],] > dim(SNP1) [1] 3178 93 > SNP2 <- InSNPs[InData[,"C2"],] > dim(SNP2) [1] 3178 93 > dim(pmin(SNP1,SNP2)) [1] 3178 93
2009 Mar 26
4
same value in column-->delete
Hi Readers, I have a question. I have a large dataset and want to throw away columns that have the same value in the column itself and I want to know which column this was. For example > x<-data.frame(id=c(1,2,3), snp1=c("A","G", "G"),snp2=c("G","G","G"),snp3=c("G","G","A"))
2009 Mar 20
1
reshape dataframe
Hi, I have a large dataset on which I would like to do the following: x<-data.frame(id=c(1,2,3), snp1=c("AA","GG", "AG"),snp2=c("GG","AG","GG"),snp3=c("GG","AG","AA")) > x id snp1 snp2 snp3 1 1 AA GG GG 2 2 GG AG AG 3 3 AG GG AA And then
2014 Jun 03
2
LIbvirt Python Snapshot -Domain Crashing
Hi, I'm using libvirt(1.0.0) with python, for managing virtual machines.. but while taking multiple snapshot domain is crashing... Snapshot XML ------------------------- <domainsnapshot> <name>snp1</name> <creationTime></creationTime> <description>Description</description> <state></state> <domain>
2010 Aug 05
2
questions about string handling
Hi, I have a question about the data handling. I have a dataset as following: ID snp1 snp2 snp3 1001 0/0 1/1 1/1 1002 2/2 3/3 1/1 1003 4/4 3/3 2/2 I want to convert the dataset to the following format: ID snp1 snp2 snp3 1001 00 AA AA 1002 GG
2010 Aug 05
2
a question about 'read.table' with or without 'read.table'.(urgent)
Hi, I've got a quite tricky question. I have a txt file, named 'temp.txt', as the following: snp1 snp2 snp3 AA 00 00 GG GG 00 00 AA 00 I want to read the file into R. 1) when I use 'read.table' without 'header=T' option, > temp <- read.table('temp.txt') # I got > temp V1
2011 Jul 27
1
SNP Tables
Hello, I have indicators for the present of absent of a snps in columns and the categorey (case control column). I would like to extract ONLY the tables and the indices (SNPS) that give me 2 x 3 tables. Some gives 2x 2 tables when one of the allelle is missing. The data look like the matrix snpmat below: so the first snp should give me the following table: (aa=0, Aa=1 and AA=2) aa
2007 May 07
1
Predicted Cox survival curves - factor coding problems..
The combination of survfit, coxph, and factors is getting confused. It is not smart enough to match a new data frame that contains a numeric for sitenew to a fit that contained that variable as a factor. (Perhaps it should be smart enough to at least die gracefully -- but it's not). The simple solution is to not use factors. site1 <- 1*(coxsnps$sitenew==1) site2 <-
2008 Jan 21
2
reordering huge data file
Dear R-experts, My problem is how to handle a 10GB data file containing genotype data. The file is in a particular format (Illumina final report) and needs to be altered and merged with phenotype data for further analysis. PERL seems to be an frequently used solution for this type of work, however I am inclined to think it should be doable with R. How do I open a text-file, line by line,
2003 Feb 12
0
Issues regarding multiple NAT'd ssh servers
Greetings- I recall mention of this bug at some tim elast year, but do not know if anything yet has been done to address it or if it is in any plans... I have multiple servers behind a f/w. I have ports forwarded on the f/w which map to port 22 on the various servers. i.e. ssh -p1001 FIREWALL_IP ---> NON_ROUTEABLE:22 This is fine, but the hostkey negotiation obviously fails... Is
2008 May 08
1
scrime Package simulatedSNP function
Hello, I need some help with the simulatedSNPs function from scrime package. I am trying to simulate some genotype of a case/control disease locus. The allele frequence are cases/controls Sample cases controls 2000 .5 .10 1500 .6 .40 In each of the row, i need to simulate 100 snp and calculate the pvalue ##############Download Scrime
2014 Jun 03
1
Re: LIbvirt Python Snapshot -Domain Crashing
Martin, Actually domain doesn't crash, but it state got changed to pause, till the snapshot action got complete. Its a time consuming process. so here is my requirement. I want to take snapshot of a Running Domain on every 3 hours,( at a later point of time It should be possible to revert to the actual domain state when I took the snapshot), I'm using libvirt APIs python bindings, there