Displaying 1 result from an estimated 1 matches for "cccccaaaa".
2009 Sep 15
3
how to load only lines that start with a particular symbol
Dear all,
I have DNA sequence data which are fasta-formatted as
>gene A;.....
AAAAACCCC
TTTTTGGGG
CCCTTTTTT
>gene B;....
CCCCCAAAA
GGGGGTTTT
I want to load only the lines that start with ">" where the annotation
information for the gene is contained. In principle, I can remove the
sequences before loading or after loading all the lines. I just wonder if
there's a way to load only lines with a particular patte...