search for: alphabetfrequency

Displaying 4 results from an estimated 4 matches for "alphabetfrequency".

2008 Jul 15
5
counting number of "G" in "TCGGGGGACAATCGGTAACCCGTCT"
Any better solution than this ? sum(strsplit("TCGGGGGACAATCGGTAACCCGTCT", "")[[1]] == "G") _________________________________________________________________ [[alternative HTML version deleted]]
2013 Apr 19
1
Sequence analysis
Hiya, I am trying to look at the similarities between a number of sequences, for example i am trying to see how similar "ababbbassdaa" is to "addffggssbbsbbs" I was wondering is the some way for me to see how similar they are in terms of, for example, number of a's, number of b's, how often a and ab are consecutive, how often abab is together etc. Any advice
2012 Apr 26
2
Memoize and vectorize a custom function
My goal is simple: calcuate GC content of each sequence in a list of nucleotide sequences. I have figured out how to vectorize, but all my attempts at memoization failed. Can you show me how to properly memoize my function? There is a StackOverflow post on the subject of memoization, but it does not help me: http://stackoverflow.com/questions/7262485/options-for-caching-memoization-hashing-in-r
2009 Jun 01
1
installing sn package
...m> Subject: Re: [R] Loop avoidance and logical subscripts To: r-help@r-project.org Message-ID: <23745814.post@talk.nabble.com> Content-Type: text/plain; charset=us-ascii Thank you! The script is now adapted to Biostrings and it is really fast! For example, it does:   alph_sequence <- alphabetFrequency(data$sequence, baseOnly=TRUE)   data$GCsequence <- rowSums(alph_sequence[,c("G", "C")]) / rowSums(alph_sequence) in the G+C computation. It also works amazingly fast in substring extraction (substring), reverse complement (reverseComplement sequences), palindromes search (fi...