thr3ads.net - similar to: "about the analysis of strings, thanks"

Displaying 20 results from an estimated 2000 matches similar to: "about the analysis of strings, thanks"

analyze amino acid sequence (composition)of proteins

2006 Jun 18

analyze amino acid sequence (composition)of proteins

Dear R-helpers: thank your for your attention. i am a newer to R and i am doing some protein category classification based on the amino acid sequence.while i have some questions urgently. 1. any packages for analysis amino acid sequence 2. given two sequences "AAA" and "BBB",how can i combine them into "AAABBB" 3. based on "AAABBB",how can i get some

More than on loop??

2010 Jan 17

More than on loop??

hello every one, How to function more than one loop in R? I have the following problem to be solved with the a method of three loops, can you help me please? The data is attached with this message. The data is composed of two parts, cleaved (denoted by ?cleaved?) and non cleaved (denoted by ?noncleaved?). ? to access to the ith peptide, you can use X$Peptide[i] ? to access to the ith label,

Calculating a table of symbol frequencies

2005 Jan 06

Calculating a table of symbol frequencies

Hello all: I have a protein sequence alignment in a data frame (align1, 72 x 236), where each row is a protein and each column a site in the alignment. AA is vector of amino acid symbols plus "-" (gap). I can calculate amino acid frequencies at each site by: >align1.F <- matrix(0,nrow=22,ncol=236,dimnames=list(AA,seq(1:236))) >for(i in 1:236) >

How to do the same thing for all levels of a column?

2012 Jul 23

How to do the same thing for all levels of a column?

Dear all, I am a R beginner, and I am looking for a way to do the same thing for all levels of a column in a table. Basically, I have a bunch of protein sequences composed of different amino acid residues, and each residue is represented by an uppercase letter. I want to calculate the ratio of different amino acid residues at each position of the proteins. Here is an example table: Proteins

Calculating symbol (letter) frequencies

2005 Jan 03

Calculating symbol (letter) frequencies

Hello: I am attempting to use R to analyze amino acid frequencies in aligned protein sequences and need some help. So far, I have imported my sequence alignment into a data frame (lets call it "alignment") with each site in one column, so that I have a data frame consisting of columns of letters (the 21 amino acid symbols plus "-") with row names being the corresponding

Package for Molecular Properties

2018 May 03

Package for Molecular Properties

library(sos) (mp <- findFn('{molecular properties}')) ????? ** found 7 matches in 4 packages and opened two web pages in my default browser with (a) the 7 matches and (b) the 4 packages. The first function was something for amino acids, like you suggested.? Two others returned compound and substance information from PubChem. ????? Does this help? ????? Spencer On

Pattern Matching within Vector?

2009 Sep 21

Pattern Matching within Vector?

Dear mailing list, I'm stuck with a tricky problem here - at least it seems tricky to me, being not really talented in pattern matching and regex matters. I'm analysing amino acid mutations by position and type of mutation. E.g. (fictitious example) in position 92, I can find L92V, L92MV, L92I... L is in this example the wild-type amino-acid, and everything behind the position number is

caculate the frequencies of the Amino Acids

2010 Jan 02

caculate the frequencies of the Amino Acids

may some one please help me to sort this out, i am trying to writ a R code for calculating the frequencies of the amino acids in 9 different sequences, i want the code to read the sequence from external text file, i used the following code to do so: x<-read.table("sequence.txt",header=FALSE) then i defined an array for 20 amino acids as following:

Package for Molecular Properties

2018 May 03

Package for Molecular Properties

All Is there a package or library that will, given a nucleotide sequence 1. calculate the extinction coefficient at 260 nm for (Beer-Lambert's law) 2. calculate molecular weight 3. return it's complementary sequence I was able to find several packages that can do similar calculations for an amino acid sequence for proteins but none for nucleic acids. Any pointers, etc. would be

legend and values do not match in ggplot

2017 Aug 04

legend and values do not match in ggplot

I have following codes for ggplots. The legends are given in the plot do not match with the values specified in the codes given below. Your helps highly appreciated. Greg library(ggplot2) p <- ggplot(a,aes(x=NO_BMI_FI_beta ,y=FI_beta ,color= Super.Pathway))+ theme_bw() +theme(panel.border=element_blank()) + geom_point(size=3) p2<-p+scale_color_manual(name="Super.Pathway",

Package(s) for making waffle plot-like figures?

2013 Nov 01

Package(s) for making waffle plot-like figures?

Dear all, I am trying to make a series of waffle plot-like figures for my data to visualize the ratios of amino acid residues at each position. For each one of 37 positions, there may be one to four different amino acid residues. So the data consist of the positions, what residues are there, and the ratios of residues. The ratios of residues at a position add up to 100, or close to 100 (more on

aaMI

2008 Jan 17

aaMI

hi i am new to R language. I want to use aaMI package which calculates the amino acid mutual interaction for a given protein sequence. I had installed the package but when i run the program it gives me the error could not find function "aaMI". can anyone tell me what might be the problem.. -- View this message in context: http://www.nabble.com/aaMI-tp14915744p14915744.html Sent from

Package for Molecular Properties

2018 May 03

Package for Molecular Properties

... In addition, you may wish to also post on the Bioconductor list for this sort of thing. -- Bert Bert Gunter "The trouble with having an open mind is that people keep coming along and sticking things into it." -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) On Thu, May 3, 2018 at 12:58 AM, Spencer Graves <spencer.graves at effectivedefense.org>

Appropriate regression model for categorical variables

2007 Jun 12

Appropriate regression model for categorical variables

Dear users, In my psychometric test i have applied logistic regression on my data. My data consists of 50 predictors (22 continuous and 28 categorical) plus a binary response. Using glm(), stepAIC() i didn't get satisfactory result as misclassification rate is too high. I think categorical variables are responsible for this debacle. Some of them have more than 6 level (one has 10 level).

how to order each element according to alphabet

2011 Jul 15

how to order each element according to alphabet

Hi there, I have a large amino acid csv file like this: input.txt: P,LV,Q,Z P,VL,Q,Z P,ML,QL,Z There is a problem with this file, since LV and VL are in fact the same thing. How do I order each element according to alphabetical order so that the desired output would look like: output.txt: P,LV,Q,Z P,LV,Q,Z P,LM,LQ,Z -- View this message in context:

Best HMM package to generate random (protein) sequences?

2011 Mar 22

Best HMM package to generate random (protein) sequences?

Dear All, I would like to generate random protein sequences using a HMM model. Has anybody done that before, or would you have any idea which package is likely to be best for that? The important facts are that the HMM will be fitted on ~3 million sequential observations, with 20 different states (one for each amino acid). I guess that 2-5 hidden states should be enough, and an order of 3 would

Unwanted case sensitivity

2010 Feb 17

Unwanted case sensitivity

I have also posted this on IRC. I have a linux host running stock RHEL 5.4 Samba 3.0.33-3.15. The host acts both as a Samba server and does a CIFS mount of that same share. The reason for doing this is so that programs running on the Linux host have the same case insensitive view as the Windows clients. I have nocase set in the relevant line in /etc/fstab I have case sensitive = No set in the

memory limit in aov

2006 Feb 01

memory limit in aov

I want to do an unbalanced anova on 272,992 observations with 405 factors including 2-way interactions between 1 of these factors and the other 404. After fitting only 11 factors and their interactions I get error messages like: Error: cannot allocate vector of size 1433066 Kb R(365,0xa000ed68) malloc: *** vm_allocate(size=1467461632) failed (error code=3) R(365,0xa000ed68) malloc: ***

[WISH / PATCH] possibility to split string literals across multiple lines

2017 Jun 14

[WISH / PATCH] possibility to split string literals across multiple lines

Hi, I would really like to have a way to split long string literals across multiple lines in R. Currently, if a string literal spans multiple lines, there is no way to inhibit the introduction of newline characters: > "aaa + bbb" [1] "aaa\nbbb" If a line ends with a backslash, it is just ignored: > "aaa\ + bbb" [1] "aaa\nbbb" We could use

Problem using callNextMethod() in S4

2007 Mar 04

Problem using callNextMethod() in S4

Dear all, Maybe, I am doing something wrong, but using R-2.5.0 on my Intel-Mac, I have problems using function callNextMethod() in method initialize. I am loading the following code as file "testS4.R": setClass("baseClass", representation(myname = "character", mydir = "character", "VIRTUAL"),

similar to: about the analysis of strings, thanks