Displaying 20 results from an estimated 900 matches similar to: "SNPRelate: Plink conversion"
2011 Jun 21
4
Re; Getting SNPS from PLINK to R
I a using plink on a large SNP dataset with a .map and .ped file.
I want to get some sort of file say a list of all the SNPs that plink is
saying that I have. ANyideas on how to do this?
--
Thanks,
Jim.
[[alternative HTML version deleted]]
2011 Jun 21
1
Getting SNPS from PLINK to R
snpMatrix package is quite nice (read.plink())
2013 Jan 22
2
SNPRelate package error
Dear,
I am using the R package SNPRelate but I found an error when I run the
following command. Do you know what might be the problem? Thanks in
advance.
> vcf.fn <- system.file("extdata","str.vcf",package="SNPRelate")
> snpgdsVCF2GDS(vcf.fn,"test.gds")
Start snpgdsVCF2GDS ...
Extracting bi-allelic and polymorhpic SNPs.
2007 Feb 05
3
RSNPper SNPinfo and making it handle a vector
If I run an analysis which generates statistical tests on many SNPs I would
naturally want to get more details on the most significant SNPs. Directly
from within R one can get the information by loading RSNPer (from
Bioconductor) and simply issuing a command SNPinfo(2073285). Unfortunately,
the command cannot handle a vector and therefore only wants to do one at a
time.
I tried the lapply and
2006 Apr 06
4
Reshaping genetic data from long to wide
Bottom Line Up Front: How does one reshape genetic data from long to wide?
I currently have a lot of data. About 180 individuals (some
probands/patients, some parents, rare siblings) and SNP data from 6000 loci
on each. The standard formats seem to be something along the lines of Famid,
pid, fatid, motid, affected, sex, locus1Allele1, locus1Allele2,
locus2Allele1, locus2Allele2, etc
In other
2013 Oct 03
1
prcomp - surprising structure
Hello,
I did a pca with over 200000 snps for 340 observations (ids). If I plot the
eigenvectors (called rotation in prcomp) 2,3 and 4 (e.g. plot
(rotation[,2]) I see a strange "column" in my data (see attachment). I
suggest it is an artefact (but of what?).
Suggestion:
I used prcomp this way: prcomp (mat), where mat is a matrix with the column
means already substracted followed by a
2012 Mar 14
3
Needing a better solution to a lookup problem.
I have a solution (actually a few) to this problem, but none are computationally efficient enough to be useful. I'm hoping someone can enlighten me to a better solution.
I have data frame of chromosome/position pairs (along with other data for the location). For each pair I need to determine if it is with in a given data frame of ranges. I need to keep only the pairs that are within any of
2008 Jul 25
1
Plink bed files
Hi All,
does anyone know how to import binary .bed files generated by Plink (http://pngu.mgh.harvard.edu/~purcell/plink/
) into R? the Plink FAQ explains how to conver other types of files,
not the .bed.
Cheers,
Federico
--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG
Tel +44 (0)20 75941602 Fax
2007 Feb 14
1
Rsync with Plink?
Hello,
I've already tried asking at the cwrRsync forums with no luck yet, so I
thought I would try my luck here.
According to the rsync man pages, with the -e option you can use other
remote shells, and I had thought plink was one of them, but I could be
wrong. I can't seem to find any information anywhere about the proper
syntax. What I've been trying, are all kinds of combination
2012 Oct 23
1
factor or character
Hi,
The program below work very well.
(snps = c('rs621782_G', 'rs8087639_G', 'rs8094221_T', 'rs7227515_A',
'rs537202_C'))
Selec = todos[ , colnames(todos) %in% snps]
head(Selec)
But, I have a data set with 1.000 columns and I need extract 70 to use
(like snps in command above).
This 70 snps are in a file. So I create a file to extract them with
2012 Sep 12
1
SNPRelate package error
Dear all,
I am using the R package SNPRelate but I found an error when I run the following command. Do you know what might be the problem? Thanks in advance.
> vcf.fn <- system.file("extdata", "sequence.vcf", package="SNPRelate")
> snpgdsVCF2GDS(vcf.fn, "test.gds")
Start snpgdsVCF2GDS ...
Open
2007 Jan 21
2
efficient code. how to reduce running time?
Hi,
I am new to R.
and even though I've made my code to run and do what it needs to .
It is taking forever and I can't use it like this.
I was wondering if you could help me find ways to fix the code to run
faster.
Here are my codes..
the data set is a bunch of 0s and 1s in a data.frame.
What I am doing is this.
I pick a column and make up a new column Y with values associated with that
2010 Feb 12
1
"drop if missing" command?
This will probably seem very simple to experienced R programmers:
I am doing a snp association analysis and am at the model-fitting stage. I
am using the Stats package's "drop1" with the following code:
##geno is the dataset
## the dependent variable (casectrln) is dichotomous and coded 0,1
## rs743572_2 is one of the snps (which is coded 0,1,2 for the 3 genotypes)
2005 Apr 05
2
cat bailing out in a for loop
Dear All,
I am trying to calculate the Hardy-Weinberg Equilibrium p-value for 42
SNPs. I am using the function HWE.exact from the package "genetics".
In order not to do a lot of coding "by hand", I have a for loop that
goes through each column (each column is one SNP) and gives me the
p.value for HWE.exact. Unfortunately some SNP have reached fixation and
HWE.exact requires a
2011 Feb 03
1
bug in codetools/R CMD check?
Hi Mr Tierney,
I have noticed an error message from R 1.12.x's CMD check for a while (apparently prof Ripley completely rewrote CMD check in R 1.12+)
e.g.:
http://bioconductor.org/checkResults/2.7/bioc-LATEST/snpMatrix/lamb2-checksrc.html
----------------
* checking R code for possible problems ... NOTE
Warning: non-unique value when setting 'row.names': ?new?
Error in
2012 Mar 12
1
Speeding up lots of calls to GLM
Dear useRs,
First off, sorry about the long post. Figured it's better to give context
to get good answers (I hope!). Some time ago I wrote an R function that
will get all pairwise interactions of variables in a data frame. This
worked fine at the time, but now a colleague would like me to do this with
a much larger dataset. They don't know how many variables they are going to
have in the
2008 Jan 21
2
reordering huge data file
Dear R-experts,
My problem is how to handle a 10GB data file containing genotype data. The file is in a particular format (Illumina final report) and needs to be altered and merged with phenotype data for further analysis.
PERL seems to be an frequently used solution for this type of work, however I am inclined to think it should be doable with R.
How do I open a text-file, line by line,
2014 Jul 21
1
Multiple versions of data in a package
Dear R-devel,
I am writing for help on how I should include parallel sets of data in
my package.
Brief summary: I am new to using data within packages. I want a user to
be able to specify one of two alternative versions of within-package
datasets to use, and I want to load just that one. I have a solution
that works, but it doesn't seem as simple as it should be from a user's
2011 Dec 09
1
minor allele frequency comparison
Hi all,
We are using two methods to identify SNPs. One is based on resequencing
the genome and aligning the reads to the sequenced genome to identify SNPs
(data available for 44 individuals). Another is based on SNP array with
selected loci (30000 loci, 870 individuals). I want to compare the results
from the resequencing based minor allele frequency and Array based minor
allele frequency.
2011 Jul 27
1
SNP Tables
Hello,
I have indicators for the present of absent of a snps in columns and the
categorey (case control column). I would like to extract ONLY the tables and
the indices (SNPS) that give me 2 x 3 tables. Some gives 2x 2 tables when
one of the allelle is missing. The data look like the matrix snpmat below:
so the first snp should give me the following table: (aa=0, Aa=1 and AA=2)
aa