Displaying 1 result from an estimated 1 matches for "rsxxxxxxy".
Did you mean:
rsxxxxxxx
2008 Mar 15
0
Appending new values to an existing factor vector
...39;m trying to read in [genotype
data] files that have around 80,000,000 lines, 4 fields, with a high proportion
of repeated strings, here's a sample:
rsXXXXXXX SAMPLE0001 CG 0.05302
rsXXXXXX SAMPLE0001 CC 0.06817
rsXXXXXXXX SAMPLE0001 CC 0.01369
rsXXXXXXY SAMPLE0001 GG 0.01816
rsXXXXXXZ SAMPLE0001 GG 0.006711
rsXXXXXXX SAMPLE0002 GG 0.05813
[For the purpose of the work I'm doing at the moment, I don't care about the
last column]
What's the best way to read in these data?
My understanding...