similar to: about the analysis of strings, thanks

Displaying 20 results from an estimated 2000 matches similar to: "about the analysis of strings, thanks"

2006 Jun 18
2
analyze amino acid sequence (composition)of proteins
Dear R-helpers: thank your for your attention. i am a newer to R and i am doing some protein category classification based on the amino acid sequence.while i have some questions urgently. 1. any packages for analysis amino acid sequence 2. given two sequences "AAA" and "BBB",how can i combine them into "AAABBB" 3. based on "AAABBB",how can i get some
2010 Jan 17
6
More than on loop??
hello every one, How to function more than one loop in R? I have the following problem to be solved with the a method of three loops, can you help me please? The data is attached with this message. The data is composed of two parts, cleaved (denoted by ?cleaved?) and non cleaved (denoted by ?noncleaved?). ? to access to the ith peptide, you can use X$Peptide[i] ? to access to the ith label,
2005 Jan 06
1
Calculating a table of symbol frequencies
Hello all: I have a protein sequence alignment in a data frame (align1, 72 x 236), where each row is a protein and each column a site in the alignment. AA is vector of amino acid symbols plus "-" (gap). I can calculate amino acid frequencies at each site by: >align1.F <- matrix(0,nrow=22,ncol=236,dimnames=list(AA,seq(1:236))) >for(i in 1:236) >
2012 Jul 23
3
How to do the same thing for all levels of a column?
Dear all, I am a R beginner, and I am looking for a way to do the same thing for all levels of a column in a table. Basically, I have a bunch of protein sequences composed of different amino acid residues, and each residue is represented by an uppercase letter. I want to calculate the ratio of different amino acid residues at each position of the proteins. Here is an example table: Proteins
2005 Jan 03
1
Calculating symbol (letter) frequencies
Hello: I am attempting to use R to analyze amino acid frequencies in aligned protein sequences and need some help. So far, I have imported my sequence alignment into a data frame (lets call it "alignment") with each site in one column, so that I have a data frame consisting of columns of letters (the 21 amino acid symbols plus "-") with row names being the corresponding
2018 May 03
0
Package for Molecular Properties
library(sos) (mp <- findFn('{molecular properties}')) ????? ** found 7 matches in 4 packages and opened two web pages in my default browser with (a) the 7 matches and (b) the 4 packages. The first function was something for amino acids, like you suggested.? Two others returned compound and substance information from PubChem. ????? Does this help? ????? Spencer On
2009 Sep 21
1
Pattern Matching within Vector?
Dear mailing list, I'm stuck with a tricky problem here - at least it seems tricky to me, being not really talented in pattern matching and regex matters. I'm analysing amino acid mutations by position and type of mutation. E.g. (fictitious example) in position 92, I can find L92V, L92MV, L92I... L is in this example the wild-type amino-acid, and everything behind the position number is
2010 Jan 02
4
caculate the frequencies of the Amino Acids
may some one please help me to sort this out, i am trying to writ a R code for calculating the frequencies of the amino acids in 9 different sequences, i want the code to read the sequence from external text file, i used the following code to do so: x<-read.table("sequence.txt",header=FALSE) then i defined an array for 20 amino acids as following:
2018 May 03
3
Package for Molecular Properties
All Is there a package or library that will, given a nucleotide sequence 1. calculate the extinction coefficient at 260 nm for (Beer-Lambert's law) 2. calculate molecular weight 3. return it's complementary sequence I was able to find several packages that can do similar calculations for an amino acid sequence for proteins but none for nucleic acids. Any pointers, etc. would be
2017 Aug 04
1
legend and values do not match in ggplot
I have following codes for ggplots. The legends are given in the plot do not match with the values specified in the codes given below. Your helps highly appreciated. Greg library(ggplot2) p <- ggplot(a,aes(x=NO_BMI_FI_beta ,y=FI_beta ,color= Super.Pathway))+ theme_bw() +theme(panel.border=element_blank()) + geom_point(size=3) p2<-p+scale_color_manual(name="Super.Pathway",
2013 Nov 01
1
Package(s) for making waffle plot-like figures?
Dear all, I am trying to make a series of waffle plot-like figures for my data to visualize the ratios of amino acid residues at each position. For each one of 37 positions, there may be one to four different amino acid residues. So the data consist of the positions, what residues are there, and the ratios of residues. The ratios of residues at a position add up to 100, or close to 100 (more on
2008 Jan 17
4
aaMI
hi i am new to R language. I want to use aaMI package which calculates the amino acid mutual interaction for a given protein sequence. I had installed the package but when i run the program it gives me the error could not find function "aaMI". can anyone tell me what might be the problem.. -- View this message in context: http://www.nabble.com/aaMI-tp14915744p14915744.html Sent from
2018 May 03
1
Package for Molecular Properties
... In addition, you may wish to also post on the Bioconductor list for this sort of thing. -- Bert Bert Gunter "The trouble with having an open mind is that people keep coming along and sticking things into it." -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) On Thu, May 3, 2018 at 12:58 AM, Spencer Graves <spencer.graves at effectivedefense.org>
2007 Jun 12
3
Appropriate regression model for categorical variables
Dear users, In my psychometric test i have applied logistic regression on my data. My data consists of 50 predictors (22 continuous and 28 categorical) plus a binary response. Using glm(), stepAIC() i didn't get satisfactory result as misclassification rate is too high. I think categorical variables are responsible for this debacle. Some of them have more than 6 level (one has 10 level).
2011 Jul 15
6
how to order each element according to alphabet
Hi there, I have a large amino acid csv file like this: input.txt: P,LV,Q,Z P,VL,Q,Z P,ML,QL,Z There is a problem with this file, since LV and VL are in fact the same thing. How do I order each element according to alphabetical order so that the desired output would look like: output.txt: P,LV,Q,Z P,LV,Q,Z P,LM,LQ,Z -- View this message in context:
2011 Mar 22
1
Best HMM package to generate random (protein) sequences?
Dear All, I would like to generate random protein sequences using a HMM model. Has anybody done that before, or would you have any idea which package is likely to be best for that? The important facts are that the HMM will be fitted on ~3 million sequential observations, with 20 different states (one for each amino acid). I guess that 2-5 hidden states should be enough, and an order of 3 would
2010 Feb 17
1
Unwanted case sensitivity
I have also posted this on IRC. I have a linux host running stock RHEL 5.4 Samba 3.0.33-3.15. The host acts both as a Samba server and does a CIFS mount of that same share. The reason for doing this is so that programs running on the Linux host have the same case insensitive view as the Windows clients. I have nocase set in the relevant line in /etc/fstab I have case sensitive = No set in the
2017 Jun 14
8
[WISH / PATCH] possibility to split string literals across multiple lines
Hi, I would really like to have a way to split long string literals across multiple lines in R. Currently, if a string literal spans multiple lines, there is no way to inhibit the introduction of newline characters: > "aaa + bbb" [1] "aaa\nbbb" If a line ends with a backslash, it is just ignored: > "aaa\ + bbb" [1] "aaa\nbbb" We could use
2006 Feb 01
2
memory limit in aov
I want to do an unbalanced anova on 272,992 observations with 405 factors including 2-way interactions between 1 of these factors and the other 404. After fitting only 11 factors and their interactions I get error messages like: Error: cannot allocate vector of size 1433066 Kb R(365,0xa000ed68) malloc: *** vm_allocate(size=1467461632) failed (error code=3) R(365,0xa000ed68) malloc: ***
2017 Jun 14
4
[WISH / PATCH] possibility to split string literals across multiple lines
On Wed, 14 Jun 2017 06:12:09 -0500, Duncan Murdoch <murdoch.duncan at gmail.com> wrote: > On 14/06/2017 5:58 AM, Andreas Kersting wrote: > > Hi, > > > > I would really like to have a way to split long string literals across > > multiple lines in R. > > I don't understand why you require the string to be a literal. Why not > construct the long