thr3ads.net - similar to: "generic genotype calling algorithm?"

Displaying 20 results from an estimated 3000 matches similar to: "generic genotype calling algorithm?"

2008 Jan 21

reordering huge data file

Dear R-experts, My problem is how to handle a 10GB data file containing genotype data. The file is in a particular format (Illumina final report) and needs to be altered and merged with phenotype data for further analysis. PERL seems to be an frequently used solution for this type of work, however I am inclined to think it should be doable with R. How do I open a text-file, line by line,

Genotype importing from Sequenom

2006 Dec 31

Genotype importing from Sequenom

Sequenom has an odd format of calling a SNP genotype gg [1] "C" "GA" "A" "C" "C" "AG" "C" "C" "T" "G" homozygous A is called A and heterozygous is called AT The genetics package cannot handle the fact that some genotypes are declared with 2 letter while other are declared with only 1.

HWEBayes, swapping the homozygotes genotype frequencies

2011 Oct 08

HWEBayes, swapping the homozygotes genotype frequencies

I evaluated the Bayes factor in the k=2 allele case with a "triangular" prior under the null as in the example in the help file: HWETriangBF2(nvec=c(88,10,2)) [1] 0.4580336 When I swap the n11 entry and n22 entry of nvec, I received totally different Bayes factor: > > HWETriangBF2(nvec=c(2,10,88)) [1] 5.710153 > In my understanding, defining the genotype frequency as

Cannot clean infinite values

2009 Apr 27

Cannot clean infinite values

Hello all, I have to import numeric data from file but found it contains Infinite values which need to be eliminated. I tried to replace them in this way: data[which(data=="-Inf")] <- -0.3 data[which(data=="+Inf")] <- 0.3 But, somehow, the Infinite values stayed there. Any suggestions? regards, N. -- View this message in context:

THE EQUIVALENT OF SQL INNER TABLE JOIN IN R

2009 Apr 26

THE EQUIVALENT OF SQL INNER TABLE JOIN IN R

Hello all, Apologize for the newbie question. What's the easiest way to do a SQL inner table join in R? Say I have a table containing column names A, B, C and another which has columns named C, D, E. I would like to do an inner table join on C and produce a table A, B, C, D, E. thanks a lot, N. -- View this message in context:

Error in sort(abs(diff(genomdat)))[1:n.keep] : only 0's may be mixed with negative subscripts

2012 Jun 12

Error in sort(abs(diff(genomdat)))[1:n.keep] : only 0's may be mixed with negative subscripts

Hello everyone, I`m trying to normalize and analize an illumina SNP array. But when i`m trying to segmentate i`m getting an error: Error in sort(abs(diff(genomdat)))[1:n.keep] : only 0's may be mixed with negative subscripts. I`ve tried everything to fix this but the error still occours. Can anybody give me a tip? Thanks in advance! -- View this message in context:

Dominance in qtl model

2007 Apr 23

Dominance in qtl model

Hi, I'm using R for a QTL analysis of SNP data. I was wondering if anyone had any advice on fitting a dominance effect into the following function; > myfun4 function (x) { x <- scan(con, nmax=169) y <- unique(x[which(!is.na(x))]) if(length(y)>1) { summary(lme(Ad ~ x, random= ~1|sire, na.action="na.omit")) } else {print("no.infomation")} } Con is the

Using something like the "by" command, but on rows instead of columns

2009 Nov 09

Using something like the "by" command, but on rows instead of columns

Hello R Forum users, I was hoping someone could help me with the following problem. Consider the following "toy" dataset: Accession SNP_CRY2 SNP_FLC Phenotype 1 NA A 0.783143079 2 BQ A 0.881714811 3 BQ A 0.886619488 4 AQ B 0.416893034 5 AQ B 0.621392903 6 AS B 0.031719125 7 AS NA 0.652375037 "Accession"

Genotypes are not all the same

2006 Dec 29

Genotypes are not all the same

I have been merrily using the genetics package and more specifically have been using the makeGenotypes and genotypes function. I check my accomplishments by going > class(g2) [1] "genotype" "factor" and likewise > class(g1) [1] "genotype" "factor" Yet when I execute a command such as allele count I get this > allele.count(g1) D I [1,]

Repeating tdt function on thousands of variables

2006 May 02

Repeating tdt function on thousands of variables

I am using dgc.genetics to perform TDT analysis on SNP data from a cohort of trios. I now have a file with about 6008 variables. The first few variables related to the pedigree data such as the pedigree ID the person ID etc. Thereafter each variable is a specific locus or marker. The variables are named by a pattern such as "Genotype.nnnnn" with nnnnn corresponding to a number which

Reshaping genetic data from long to wide

2006 Apr 06

Reshaping genetic data from long to wide

Bottom Line Up Front: How does one reshape genetic data from long to wide? I currently have a lot of data. About 180 individuals (some probands/patients, some parents, rare siblings) and SNP data from 6000 loci on each. The standard formats seem to be something along the lines of Famid, pid, fatid, motid, affected, sex, locus1Allele1, locus1Allele2, locus2Allele1, locus2Allele2, etc In other

Can someone recommend a package for SNP cluster analysis of Fluidigm microarrays?

2012 Jun 14

Can someone recommend a package for SNP cluster analysis of Fluidigm microarrays?

I know that there are quite a few packages out that there for cluster analysis. The problem that I am facing is finding a package that will not incorporate all my samples into clusters but just the samples that fit a threshold (that I have not set yet and may need help finding the right level) for genotyping. It should be able to "no call" samples outside the clusters. It also needs to

Fastest way to do HWE.exact test on 100K SNP data?

2006 Jun 05

Fastest way to do HWE.exact test on 100K SNP data?

Hi everyone, I'm using the function 'HWE.exact' of 'genetics' package to compute p-values of the HWE test. My data set consists of ~600 subjects (cases and controls) typed at ~ 10K SNP markers; the test is applied separately to cases and controls. The genotypes are stored in a list of 'genotype' objects, all.geno, and p-values are calculated inside the loop over all

permutation test - query

2009 Aug 31

permutation test - query

Hi, My query is regarding permutation test and reshuffling of genotype/phenotype data I have been using the haplo.stats package of R. for haplotype analysis and I would like to perform an analysis which I'm requesting your advice. I have a data set of individuals genotyped for 12 SNP and a dichotomous phenotype. At first, I have tested each of those SNP independently in order to bypass

Trouble obtaining results from a loop

2007 Oct 02

Trouble obtaining results from a loop

#Hello, #I have a question about obtaining results from a loop I have written. #Below is a sample of individual genotypes from a genetic question I am working on called "P.genotype.sample ". P.genotype.sample<-matrix(10,10,10) P.genotype.sample[,1]<-c(2,2,1,5,1,1,5,6,1,3) P.genotype.sample[,2]<-c(6,3,3,6,8,1,6,7,2,3) P.genotype.sample[,3]<-c(2,2,2,3,3,2,2,2,3,3)

"drop if missing" command?

2010 Feb 12

"drop if missing" command?

This will probably seem very simple to experienced R programmers: I am doing a snp association analysis and am at the model-fitting stage. I am using the Stats package's "drop1" with the following code: ##geno is the dataset ## the dependent variable (casectrln) is dichotomous and coded 0,1 ## rs743572_2 is one of the snps (which is coded 0,1,2 for the 3 genotypes)

Paste a matrix column in pairwise fashion with other columns?

2007 Sep 26

Paste a matrix column in pairwise fashion with other columns?

#Hello, #I have would like to paste a single column of a matrix # in pair wise fashion with other columns based upon # even and odd column numbers. # I can do it in a very clunky fashion and I know there # must be a better way. below is a sample matrix and my extremely # clunky code that gets the job done for a small matrix, but i plan to # do this on a much grander scale. any help would be very

no convergence using lme

2006 Feb 15

no convergence using lme

Hi. I was wondering if anyone might have some suggestions about how I can overcome a problem of "iteration limit reached without convergence" when fitting a mixed effects model. In this study: Outcome is a measure of heart action Age is continuous (in weeks) Gender is Male or Female (0 or 1) Genotype is Wild type or knockout (0 or 1) Animal is the Animal ID as a factor

Merging two files together in R

2007 Aug 24

Merging two files together in R

Hi, Thanks in advance for reading this post. I received some affymetrix genotyping data back recently (250K, Nsp array)…However, in order for me to do any analysis on this data set, I need to add append the annotation file to it. Basically I want to do something that looks like this: Snpfile(tab delimited): SNPID Genotype X Y 123 AA 13.4 1.2

cat bailing out in a for loop

2005 Apr 05

cat bailing out in a for loop

Dear All, I am trying to calculate the Hardy-Weinberg Equilibrium p-value for 42 SNPs. I am using the function HWE.exact from the package "genetics". In order not to do a lot of coding "by hand", I have a for loop that goes through each column (each column is one SNP) and gives me the p.value for HWE.exact. Unfortunately some SNP have reached fixation and HWE.exact requires a

similar to: generic genotype calling algorithm?