similar to: resampling from distributions

Displaying 20 results from an estimated 900 matches similar to: "resampling from distributions"

2009 Jan 19
1
Deleting columns where the frequency of values are too disparate
Hello R-help community, I have another question about filtering datasets. Please consider the following "toy" data matrix example, called "x" for simplicity. There are 20 different individuals ("ID"), with information about the alleles (A,T, G, C) at six different loci ("Locus1" - "Locus6") for each of these 20 individuals. At any single locus
2006 Jun 05
3
Fastest way to do HWE.exact test on 100K SNP data?
Hi everyone, I'm using the function 'HWE.exact' of 'genetics' package to compute p-values of the HWE test. My data set consists of ~600 subjects (cases and controls) typed at ~ 10K SNP markers; the test is applied separately to cases and controls. The genotypes are stored in a list of 'genotype' objects, all.geno, and p-values are calculated inside the loop over all
2010 May 31
0
miss.loc function in MCMC Geneland: can't make it work
I am trying to use the function 'filter.NA=TRUE' in Geneland. The function appears to be set on TRUE by default, as it appears as TRUE in the 'parameter.txt' file output and hence I do not need to enter the function per se (as it is an 'Unused argument otherwise') . Hence all my missing data (individuals that I have not yet scored at that specific loci) are scored as
2009 Jan 13
3
problem whit Geneland
I do the these passages: library(Geneland) set.seed(1) data <- simdata(nindiv=200, coord.lim=c(0,1,0,1) , number.nuclei=5 , allele.numbers=rep(10,20), IBD=FALSE, npop=2, give.tess.grid=FALSE) geno <- data$genotypes coord <- t(data$coord.indiv) path.mcmc <-
2007 Mar 16
1
ideas to speed up code: converting a matrix of integers to a matrix of normally distributed values
Hi all, [this is a bit hard to describe, so if my initial description is confusing, please try running my code below] #WHAT I'M TRYING TO DO I'd appreciate any help in trying to speed up some code. I've written a script that converts a matrix of integers (usually between 1-10,000 - these represent allele names) into two new matrices of normally distributed values (representing
2010 Sep 01
2
Rd-file error: non-ASCII input and no declared encoding
Dear list, I came across the following error for three of my newly written Rd-files: non-ASCII input and no declared encoding I can't make sense of this. Below I copied in one of the three files. Can anybody please tell me what's wrong with it? Thank you, Christian \name{tetragonula} \alias{tetragonula} \alias{tetragonula.coord} \docType{data} % \non_function{} \title{Microsatellite
2008 Aug 10
1
Scripting - query
I have a vector: alleles.present<-c("D3", "D16", ... ) The alleles present changes given the case I'm dealing with - i.e. either all of the alleles I use for my calculations are present, or some of them. Depending on what alleles are present, I need to make matrices and do calculations on those alleles present and completely disregard any formula or other use of the
2019 Oct 11
2
Failed when join to an existing Active Directory Domain
Hi, I've tried to update my samba AD/DC environment. Then, I've removed a existing offline DC with "samba-tool domain demote --remove-other-dead-server=genos". I've re-created "genos" (yes, I try to keep the same name and IP address) and install a 4.10.2 samba version (I know the new version is 4.11.0). When I've tried to join it on my domain, I've received
2003 Sep 04
1
Allelic Differentiation, sampling, unique(), duplicated()
Hi people, I have made some progress trying to work out how to solve this problem but I have got a bit stuck - sorry if this turns out to be a simple exercise . . Allelic Differentiation (AD) in genetics measures the number of different alleles between (say) two populations eg: Organisms in Pop 1 have alleles: a, b, c, d, e Organisms in Pop 2 have alleles: b, b, c, d, e Different
2005 Jul 07
3
What method I should to use for these data?
Dear R user: I am studying the allele data of two populations. the following is the data: a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 a12 a13 a14 a15 a16 a17 pop1 0.0217 0.0000 0.0109 0.0435 0.0435 0.0000 0.0109 0.0543 0.1739 0.0761 0.1413 0.1522 0.1087 0.0870 0.0435 0.0217 0.0109 pop2 0.0213 0.0213 0.0000 0.0000 0.0000 0.0426 0.1702 0.2128 0.1596 0.1809 0.0957 0.0745 0.0106
2007 Sep 21
1
Help create a loopto conduct multiple pairwise operations
#Hello, #I have three data frames, X,Y and Z with two columns each and different numbers of rows. # creation of data frame X X.alleles <- c(1,5,6,7,8) X.Freq <- c(0.35, 0.15, 0.05 , 0.10, 0.35) Loc1 <- cbind( X.alleles,X.Freq) X <- data.frame(Loc1) #creation of data frame Y Y.alleles <- c(1,4,6,8) Y.Freq <- c(0.35, 0.35, 0.10, 0.20 )
2012 May 21
1
help with melt/cast in reshape-package
I'm sorry everyone for the inconvenience of spamming the R-help... Here's the complete post: Hi everyone, > > Since it's quite a while that I used the reshape package, I now feel kind > of rusty. > > I have a data.frame like this: > > > > id Sample.Name Marker Allele.1 > Allele.2 sample_id species
2018 Mar 15
3
stats 'dist' euclidean distance calculation
Hello, I am working with a matrix of multilocus genotypes for ~180 individual snail samples, with substantial missing data. I am trying to calculate the pairwise genetic distance between individuals using the stats package 'dist' function, using euclidean distance. I took a subset of this dataset (3 samples x 3 loci) to test how euclidean distance is calculated: 3x3 subset used
2007 Aug 30
2
How to multiply all dataframe rows by another dataframe's columns
Hello, I have two data frames, X and Y, with two columns each and different numbers of rows. # creation of data frame X Loc1.alleles <- c(1,5,6,7,8) Loc1.Freq <- c(0.35, 0.15, 0.05, 0.10, 0.35) Loc1 <- cbind( Loc1.alleles,Loc1.Freq) X <- data.frame(Loc1) #creation of data frame Y Loc2.alleles <- c(1,4,6,8) Loc2.Freq <- c(0.35, 0.35,
2013 Jul 02
2
Recoding variables based on reference values in data frame
I'm new to R (previously used SAS primarily) and I have a genetics data frame consisting of genotypes for each of 300+ subjects (ID1, ID2, ID3, ...) at 3000+ genetic locations (SNP1, SNP2, SNP3...). A small subset of the data is shown below: SNP_ID SNP1 SNP2 SNP3 SNP4 Maj_Allele C G C A Min_Allele T A T G ID1 CC GG CT AA ID2 CC GG CC AA ID3 CC GG nc AA
2006 Dec 29
1
Genotypes are not all the same
I have been merrily using the genetics package and more specifically have been using the makeGenotypes and genotypes function. I check my accomplishments by going > class(g2) [1] "genotype" "factor" and likewise > class(g1) [1] "genotype" "factor" Yet when I execute a command such as allele count I get this > allele.count(g1) D I [1,]
2006 May 03
1
Vector searching and counting speed optimization
R-users, I'm seeking any suggestions on optimizing some code for speed. Here's the setup: the code below is part of a larger chunk that is calculating Fst values across loci and alleles. This chunk is designed to calculate the proportion ('p.a') of an allele ('a') at a locus in each population ('p') and the proportion of individuals heterozygous for that
2005 Apr 05
2
cat bailing out in a for loop
Dear All, I am trying to calculate the Hardy-Weinberg Equilibrium p-value for 42 SNPs. I am using the function HWE.exact from the package "genetics". In order not to do a lot of coding "by hand", I have a for loop that goes through each column (each column is one SNP) and gives me the p.value for HWE.exact. Unfortunately some SNP have reached fixation and HWE.exact requires a
2006 Apr 27
2
Incomplete Trio in TDT analysis
I am involved in a study where, as in most of life, men demonstrate themselves to be recalcitrant. So while we have many probands and most of their mothers we only have about 50% of the trios being complete. I have been running tdt and trio.types. It appears as if it is ignoring the duos. Sometimes a duo can be informative. For instance Father ..missing Mother 1/2 Proband 1/1 This duo shows that
2006 May 05
1
How to a handle an error in a loop
I am about one step away from heaven on earth. I think only one step! I am using dgc.genetics to run a TDT test on thousands of genetic loci. I have learnt (through the help of others on this mailing list) to send the complex output to useful data frames which in turn allow me to look at the big picture and screen the thousands of loci. Resultdt<-lapply(PGWide[,240:290], tdt) the above