similar to: Compressing String in R

Displaying 20 results from an estimated 4000 matches similar to: "Compressing String in R"

2009 Jan 05
1
How to extract range of colums in a data frame
Dear all, I have the following data frame: > dat V1 V2 V3 V4 V5 V6 V7 V8 V9 1 1 AAAACACCCACCCCCCCCCCCCCCCCCCCCCCCC 9.0 18 12.00 18.0 15.0 12.0 6.0 2 1 ACGATACGGCGACCACCGAGATCTACACTCTTCC 18.0 8 12.00 18.0 15.0 12.0 18.0 3 1 ACTACTGCTCCCCCCCCACTCCCCCCCCCCCCCC 15.0 8 12.00 12.0 18.0 12.0 12.0 4 1 ACTTATACGGCGACCACCGAGATCTACACTCTTT 15.0
2008 Jun 24
5
Measuring Goodness of a Matrix
Hi all, Suppose I have 2 matrices A and B. And I want to measure how good each of this matrix is. So I intend to compare A and B with another "gold standard" matrix X. Meaning the more similar a matrix to X the better it is. What is the common way in R to measure matrix similarity (ie. A vs X, and B vs X) ? - Gundala Viswanath Jakarta - Indonesia
2008 Jul 07
4
Plot Mixtures of Synthetically Generated Gamma Distributions
Hi, I have the following vector which is created from 3 distinct distribution (three components) of gamma: x=c(rgamma(30,shape=.2,scale=14),rgamma(30,shape=12,scale=10),rgamma(30,shape=5,scale=6)) I want to plot the density curve of X, in a way that it shows a distinct 3 curves that represent each component. How can I do that? I tried this but doesn't work: lines(density(x)) Please
2008 Jun 23
3
Getting only label column of a data frame
Hi, How can I extract the label only from a given data frame. Fore example from this data frame. > print(dataf) V1 V2 V3 V4 V5 V6 V7 V8 V9 11145 14.3 17.1 31.2 41.7 45.8 49.8 68.6 70.6 72.9 3545 10.2 15.6 20.9 23.2 31.4 31.7 36.2 48.4 51.9 8951 15.2 17.5 20.0 21.4 32.4
2009 Jan 06
5
Changing Matrix Header
Dear all, I have the following matrix. > dat A A A A A A A A A A [1,] 0 0 0 0 0 0 0 0 0 0 [2,] 0 0 0 0 0 0 0 0 0 1 [3,] 0 0 0 0 0 0 0 0 0 2 How can I change it into: [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] [1,] 0 0 0 0 0 0 0 0 0 0 [2,] 0 0 0 0 0 0 0 0 0 1
2008 Aug 01
3
Grouping Index of Matrix Based on Certain Condition
Hi, I have the following (M x N) matrix, where M = 10 and N =2 What I intend to do is to group index of (M) based on this condition of "x_mn" , namely For each M, If x_m1 > x_m2, assign index of M to Group1 otherwise assign index of M into Group 2 > x [,1] [,2] [1,] 4.482909e-01 0.55170907 [2,] 9.479594e-01 0.05204063 [3,] 8.923553e-01 0.10764474
2008 Sep 09
3
Splitting Data Frame into Two Based on Source Array
Dear all, Suppose I have this data frame: > data_main V1 V2 foo 13.1 bar 12.0 qux 10.4 cho 20.33 pox 8.21 And I want to split the data into two parts first part are the one contain in the source array: > src [1] "bar" "pox" and the other one the complement. In the end we hope to get this two dataframes: > data_child1 V1 V2 bar 13.1 pox
2009 Jan 11
3
Converting Numerical Matrix to List of Strings
Hi all, Given a matrix: > mat [,1] [,2] [,3] [1,] 0 0 0 [2,] 3 3 3 [3,] 1 1 1 [4,] 2 1 1 How can I convert it to a list of strings: > desired_output [1] "aaa" "ttt" "ccc" "gcc" In principle: 1. Number of Column in matrix = length of string (= 3) 2. Number of Row in matrix = length of vector ( = 4). 3.
2009 Jan 13
3
Returning Non-Unique Index with Which (alternatives?)
Dear all, I tried to find index in repo given a query with this: > repo <- c("AAA", "AAT", "AAC", "AAG", "ATA", "ATT") > qr <- c("AAC", "ATT", "ATT") > which(repo%in%qr) [1] 3 6 Note that the query contain repeating elements, yet the output of which only returns unique. How can I make it
2008 Jul 06
4
Method for checking automatically which distribtions fits a data
Hi, Suppose I have a vector of data. Is there a method in R to help us automatically suggest which distributions fits to that data (e.g. normal, gamma, multinomial etc) ? - Gundala Viswanath Jakarta - Indonesia
2008 Aug 05
2
Iterating Named List
Hi all, I have the following named list: > print(y) $`200052_s_at` [1] -1066.975 -1063.893 -1062.815 -1062.121 -1059.004 $`200071_at` [1] -959.823 -953.980 -953.886 -948.781 -974.890 $`200084_at` [1] -1135.804 -1132.863 -1128.197 -1128.633 -1125.890 What I want to do is to iterate this name list and process its members. To do that I attempt the following code (but failed): __BEGIN__ ny
2009 Jan 09
4
Extracting File Basename without Extension
Dear all, The basename() function returns the extension also: > myfile <- "path1/path2/myoutput.txt" > basename(myfile) [1] "myoutput.txt" Is there any other function where it just returns plain base: "myoutput" i.e. without 'txt' - Gundala Viswanath Jakarta - Indonesia
2009 Jan 09
3
Pack and Unpack Strings in R
Dear all, Does R has any function/package that can pack and unpack string into bit size? The reason I want to do this in R is that R has much more native statistical function than Perl. Yet the data I need to process is so large that it required me to compress it into smaller unit -> process it -> finally recover them back again into string with new information. In Perl the
2008 Jun 23
2
Pairwise Partitioning of a Vector
Hi, How can I partitioned an example vector like this > print(myvector) [1] 30.9 60.1 70.0 73.0 75.0 83.9 93.1 97.6 98.8 113.9 into the following pairwise partition: PAIR1 part1 = 30.9 part2 = 60.1 70.0 73.0 75.0 83.9 93.1 97.6 98.8 113.9 PAIR2 part1 = 30.9 60.1 part2 = 70.0 73.0 75.0 83.9 93.1 97.6 98.8 113.9 .... PAIR9 part1 = 30.9
2009 Jan 13
3
Extracting Hash via Vector
Dear all, Suppose I have a hash created with this x <- list() for (i in c('test', 'some', 'more')){ x[[i]] <- runif(1) } then I want to extract the elem of that hash with a vector > q <- c("some", "more", "not_there") But why this failed? > extracted <- x[[q]] Error in x[[q]] : subscript out of bounds we expect the
2008 Dec 22
3
Convert ASCII string to Decimal in R (vice versa) was: Hex
Hi Dieter, Sorry my mistake. I wanted to convert them into Decimal (not Hexadecimal). Given this string, the desired answer follows: > ascii_str <- "ORQ>IK" 79 82 81 62 73 75 > ascii_str2 <- "FDC" 70 68 67 - Gundala Viswanath Jakarta - Indonesia On Mon, Dec 22, 2008 at 5:49 PM, Dieter Menne <dieter.menne at menne-biomed.de> wrote: > Gundala
2009 Jan 08
2
Faster Printing Alternatives to 'cat'
Dear all, I found that printing with 'cat' is very slow. For example in my machine this snippet __BEGIN__ # I need to resolve to use this type of loop. # because using write(), I need to create a matrix which # consumes so much memory. Note that "foo, bar, qux" object # is already very large (>2Gb) for ( s in 1:length(x) ) {
2008 Jun 16
2
Creating a Hash from Data.Frame
Hi, I have the following data frame: > print(mydatframe) __DATAFRAME__ V1 V2 V3 1 1007_s_at DDR1 discoidin domain receptor tyrosine kinase 1 2 1053_at RFC2 replication factor C (activator 1) 2, 40kDa 3 117_at HSPA6 heat shock 70kDa protein 6 (HSP70B') __END__ Is there a way to create a hash with V2 as Key and V3 as its value? - Gundala Viswanath Jakarta - Indonesia
2008 Oct 06
2
Problem with Grep Under Loop
Dear all, I have no problem with this individual grep command: > datk <- grep("XM_528056", source$V1) > dat2 <- source[datk,] > print(dat2) V1 V2 V3 V4 V5 V6 V7 35995 XM_528056 panTro2 chr8 + 1775569 1896107 Chimpanzee BUT, when I run them under the loop it gives this error: > hm_acc <-
2009 Jan 16
5
Value Lookup from File without Slurping
Dear all, I have a repository file (let's call it repo.txt) that contain two columns like this: # tag value AAA 0.2 AAT 0.3 AAC 0.02 AAG 0.02 ATA 0.3 ATT 0.7 Given another query vector > qr <- c("AAC", "ATT") I would like to find the corresponding value for each query above, yielding: 0.02 0.7 However, I want to avoid slurping whole repo.txt