thr3ads.net - similar to: "resampling from distributions"

Displaying 20 results from an estimated 900 matches similar to: "resampling from distributions"

Deleting columns where the frequency of values are too disparate

2009 Jan 19

Deleting columns where the frequency of values are too disparate

Hello R-help community, I have another question about filtering datasets. Please consider the following "toy" data matrix example, called "x" for simplicity. There are 20 different individuals ("ID"), with information about the alleles (A,T, G, C) at six different loci ("Locus1" - "Locus6") for each of these 20 individuals. At any single locus

Fastest way to do HWE.exact test on 100K SNP data?

2006 Jun 05

Fastest way to do HWE.exact test on 100K SNP data?

Hi everyone, I'm using the function 'HWE.exact' of 'genetics' package to compute p-values of the HWE test. My data set consists of ~600 subjects (cases and controls) typed at ~ 10K SNP markers; the test is applied separately to cases and controls. The genotypes are stored in a list of 'genotype' objects, all.geno, and p-values are calculated inside the loop over all

miss.loc function in MCMC Geneland: can't make it work

2010 May 31

miss.loc function in MCMC Geneland: can't make it work

I am trying to use the function 'filter.NA=TRUE' in Geneland. The function appears to be set on TRUE by default, as it appears as TRUE in the 'parameter.txt' file output and hence I do not need to enter the function per se (as it is an 'Unused argument otherwise') . Hence all my missing data (individuals that I have not yet scored at that specific loci) are scored as

problem whit Geneland

2009 Jan 13

problem whit Geneland

I do the these passages: library(Geneland) set.seed(1) data <- simdata(nindiv=200, coord.lim=c(0,1,0,1) , number.nuclei=5 , allele.numbers=rep(10,20), IBD=FALSE, npop=2, give.tess.grid=FALSE) geno <- data$genotypes coord <- t(data$coord.indiv) path.mcmc <-

ideas to speed up code: converting a matrix of integers to a matrix of normally distributed values

2007 Mar 16

ideas to speed up code: converting a matrix of integers to a matrix of normally distributed values

Hi all, [this is a bit hard to describe, so if my initial description is confusing, please try running my code below] #WHAT I'M TRYING TO DO I'd appreciate any help in trying to speed up some code. I've written a script that converts a matrix of integers (usually between 1-10,000 - these represent allele names) into two new matrices of normally distributed values (representing

Rd-file error: non-ASCII input and no declared encoding

2010 Sep 01

Rd-file error: non-ASCII input and no declared encoding

Dear list, I came across the following error for three of my newly written Rd-files: non-ASCII input and no declared encoding I can't make sense of this. Below I copied in one of the three files. Can anybody please tell me what's wrong with it? Thank you, Christian \name{tetragonula} \alias{tetragonula} \alias{tetragonula.coord} \docType{data} % \non_function{} \title{Microsatellite

Scripting - query

2008 Aug 10

Scripting - query

I have a vector: alleles.present<-c("D3", "D16", ... ) The alleles present changes given the case I'm dealing with - i.e. either all of the alleles I use for my calculations are present, or some of them. Depending on what alleles are present, I need to make matrices and do calculations on those alleles present and completely disregard any formula or other use of the

R coding to extract allele frequencies from NCBI for ALL alleles of one SNP?

2024 Nov 15

R coding to extract allele frequencies from NCBI for ALL alleles of one SNP?

Dear All, The following code extracts from NCBI very nice output for ONE allele of a SNP (often the allele with the second largest frequency - usually termed the minor allele). It gives an average minor allele frequency from all NCBI sources (which is what I want, except I'd like the addition of data for all the other alleles of one SNP) plus a table of minor allele frequencies from each

Failed when join to an existing Active Directory Domain

2019 Oct 11

Failed when join to an existing Active Directory Domain

Hi, I've tried to update my samba AD/DC environment. Then, I've removed a existing offline DC with "samba-tool domain demote --remove-other-dead-server=genos". I've re-created "genos" (yes, I try to keep the same name and IP address) and install a 4.10.2 samba version (I know the new version is 4.11.0). When I've tried to join it on my domain, I've received

Allelic Differentiation, sampling, unique(), duplicated()

2003 Sep 04

Allelic Differentiation, sampling, unique(), duplicated()

Hi people, I have made some progress trying to work out how to solve this problem but I have got a bit stuck - sorry if this turns out to be a simple exercise . . Allelic Differentiation (AD) in genetics measures the number of different alleles between (say) two populations eg: Organisms in Pop 1 have alleles: a, b, c, d, e Organisms in Pop 2 have alleles: b, b, c, d, e Different

What method I should to use for these data?

2005 Jul 07

What method I should to use for these data?

Dear R user: I am studying the allele data of two populations. the following is the data: a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 a12 a13 a14 a15 a16 a17 pop1 0.0217 0.0000 0.0109 0.0435 0.0435 0.0000 0.0109 0.0543 0.1739 0.0761 0.1413 0.1522 0.1087 0.0870 0.0435 0.0217 0.0109 pop2 0.0213 0.0213 0.0000 0.0000 0.0000 0.0426 0.1702 0.2128 0.1596 0.1809 0.0957 0.0745 0.0106

help with melt/cast in reshape-package

2012 May 21

help with melt/cast in reshape-package

I'm sorry everyone for the inconvenience of spamming the R-help... Here's the complete post: Hi everyone, > > Since it's quite a while that I used the reshape package, I now feel kind > of rusty. > > I have a data.frame like this: > > > > id Sample.Name Marker Allele.1 > Allele.2 sample_id species

Help create a loopto conduct multiple pairwise operations

2007 Sep 21

Help create a loopto conduct multiple pairwise operations

#Hello, #I have three data frames, X,Y and Z with two columns each and different numbers of rows. # creation of data frame X X.alleles <- c(1,5,6,7,8) X.Freq <- c(0.35, 0.15, 0.05 , 0.10, 0.35) Loc1 <- cbind( X.alleles,X.Freq) X <- data.frame(Loc1) #creation of data frame Y Y.alleles <- c(1,4,6,8) Y.Freq <- c(0.35, 0.35, 0.10, 0.20 )

stats 'dist' euclidean distance calculation

2018 Mar 15

stats 'dist' euclidean distance calculation

Hello, I am working with a matrix of multilocus genotypes for ~180 individual snail samples, with substantial missing data. I am trying to calculate the pairwise genetic distance between individuals using the stats package 'dist' function, using euclidean distance. I took a subset of this dataset (3 samples x 3 loci) to test how euclidean distance is calculated: 3x3 subset used

How to multiply all dataframe rows by another dataframe's columns

2007 Aug 30

How to multiply all dataframe rows by another dataframe's columns

Hello, I have two data frames, X and Y, with two columns each and different numbers of rows. # creation of data frame X Loc1.alleles <- c(1,5,6,7,8) Loc1.Freq <- c(0.35, 0.15, 0.05, 0.10, 0.35) Loc1 <- cbind( Loc1.alleles,Loc1.Freq) X <- data.frame(Loc1) #creation of data frame Y Loc2.alleles <- c(1,4,6,8) Loc2.Freq <- c(0.35, 0.35,

Recoding variables based on reference values in data frame

2013 Jul 02

Recoding variables based on reference values in data frame

I'm new to R (previously used SAS primarily) and I have a genetics data frame consisting of genotypes for each of 300+ subjects (ID1, ID2, ID3, ...) at 3000+ genetic locations (SNP1, SNP2, SNP3...). A small subset of the data is shown below: SNP_ID SNP1 SNP2 SNP3 SNP4 Maj_Allele C G C A Min_Allele T A T G ID1 CC GG CT AA ID2 CC GG CC AA ID3 CC GG nc AA

Genotypes are not all the same

2006 Dec 29

Genotypes are not all the same

I have been merrily using the genetics package and more specifically have been using the makeGenotypes and genotypes function. I check my accomplishments by going > class(g2) [1] "genotype" "factor" and likewise > class(g1) [1] "genotype" "factor" Yet when I execute a command such as allele count I get this > allele.count(g1) D I [1,]

Vector searching and counting speed optimization

2006 May 03

Vector searching and counting speed optimization

R-users, I'm seeking any suggestions on optimizing some code for speed. Here's the setup: the code below is part of a larger chunk that is calculating Fst values across loci and alleles. This chunk is designed to calculate the proportion ('p.a') of an allele ('a') at a locus in each population ('p') and the proportion of individuals heterozygous for that

cat bailing out in a for loop

2005 Apr 05

cat bailing out in a for loop

Dear All, I am trying to calculate the Hardy-Weinberg Equilibrium p-value for 42 SNPs. I am using the function HWE.exact from the package "genetics". In order not to do a lot of coding "by hand", I have a for loop that goes through each column (each column is one SNP) and gives me the p.value for HWE.exact. Unfortunately some SNP have reached fixation and HWE.exact requires a

Incomplete Trio in TDT analysis

2006 Apr 27

Incomplete Trio in TDT analysis

I am involved in a study where, as in most of life, men demonstrate themselves to be recalcitrant. So while we have many probands and most of their mothers we only have about 50% of the trios being complete. I have been running tdt and trio.types. It appears as if it is ignoring the duos. Sometimes a duo can be informative. For instance Father ..missing Mother 1/2 Proband 1/1 This duo shows that

similar to: resampling from distributions