Displaying 1 result from an estimated 1 matches for "7tcggcc".
Did you mean:
10tcggcc
2009 Jan 19
1
Deleting columns where the frequency of values are too disparate
...rom the set of A,T,C, G). For example, at Locus1 individuals have have either the A or T allele only; at Locus2 the individuals can have either C or G only; at Locus3 the individuals can have either T or G only.
IDLocus1Locus2Locus3Locus4Locus5Locus6
1AGTAAC
2AGGACC
3ACGGCC
4ACGGCC
5AGGGAC
6TGGGCC
7TCGGCC
8TCGGAC
9TGGGCC
10TCGGCC
11AGGGAC
12ACGGCC
13AGGGCC
14AGGGAC
15ACGGCC
16TCGGCC
17TGGGAC
18TGGGCC
19TGGGCC
20TCGGAC
I want to delete any columns from the dataset where the rarer of the two alleles has a frequency of ten percent or less. In other words, I would like to delete Locus3, Locus4, and Loc...