Displaying 20 results from an estimated 4000 matches similar to: "Compressing String in R"
2009 Jan 05
1
How to extract range of colums in a data frame
Dear all,
I have the following data frame:
> dat
V1 V2 V3 V4 V5 V6 V7 V8 V9
1 1 AAAACACCCACCCCCCCCCCCCCCCCCCCCCCCC 9.0 18 12.00 18.0 15.0 12.0 6.0
2 1 ACGATACGGCGACCACCGAGATCTACACTCTTCC 18.0 8 12.00 18.0 15.0 12.0 18.0
3 1 ACTACTGCTCCCCCCCCACTCCCCCCCCCCCCCC 15.0 8 12.00 12.0 18.0 12.0 12.0
4 1 ACTTATACGGCGACCACCGAGATCTACACTCTTT 15.0
2008 Jun 24
5
Measuring Goodness of a Matrix
Hi all,
Suppose I have 2 matrices A and B.
And I want to measure how good each of this matrix is.
So I intend to compare A and B with another "gold standard"
matrix X. Meaning the more similar a matrix to X the better it is.
What is the common way in R to
measure matrix similarity (ie. A vs X, and B vs X) ?
- Gundala Viswanath
Jakarta - Indonesia
2008 Jul 07
4
Plot Mixtures of Synthetically Generated Gamma Distributions
Hi,
I have the following vector
which is created from 3 distinct distribution (three components) of gamma:
x=c(rgamma(30,shape=.2,scale=14),rgamma(30,shape=12,scale=10),rgamma(30,shape=5,scale=6))
I want to plot the density curve of X, in a way that it shows
a distinct 3 curves that represent each component.
How can I do that?
I tried this but doesn't work:
lines(density(x))
Please
2008 Jun 23
3
Getting only label column of a data frame
Hi,
How can I extract the label only from a given data frame.
Fore example from this data frame.
> print(dataf)
V1 V2 V3 V4 V5 V6 V7 V8 V9
11145 14.3 17.1 31.2 41.7 45.8 49.8 68.6 70.6 72.9
3545 10.2 15.6 20.9 23.2 31.4 31.7 36.2 48.4 51.9
8951 15.2 17.5 20.0 21.4 32.4
2009 Jan 06
5
Changing Matrix Header
Dear all,
I have the following matrix.
> dat
A A A A A A A A A A
[1,] 0 0 0 0 0 0 0 0 0 0
[2,] 0 0 0 0 0 0 0 0 0 1
[3,] 0 0 0 0 0 0 0 0 0 2
How can I change it into:
[,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10]
[1,] 0 0 0 0 0 0 0 0 0 0
[2,] 0 0 0 0 0 0 0 0 0 1
2008 Aug 01
3
Grouping Index of Matrix Based on Certain Condition
Hi,
I have the following (M x N) matrix, where M = 10 and N =2
What I intend to do is to group index of (M) based on this condition
of "x_mn" , namely
For each M,
If x_m1 > x_m2, assign index of M to Group1
otherwise assign index of M into Group 2
> x
[,1] [,2]
[1,] 4.482909e-01 0.55170907
[2,] 9.479594e-01 0.05204063
[3,] 8.923553e-01 0.10764474
2008 Sep 09
3
Splitting Data Frame into Two Based on Source Array
Dear all,
Suppose I have this data frame:
> data_main
V1 V2
foo 13.1
bar 12.0
qux 10.4
cho 20.33
pox 8.21
And I want to split the data into two parts
first part are the one contain in the source array:
> src
[1] "bar" "pox"
and the other one the complement.
In the end we hope to get this two dataframes:
> data_child1
V1 V2
bar 13.1
pox
2009 Jan 11
3
Converting Numerical Matrix to List of Strings
Hi all,
Given a matrix:
> mat
[,1] [,2] [,3]
[1,] 0 0 0
[2,] 3 3 3
[3,] 1 1 1
[4,] 2 1 1
How can I convert it to a list of strings:
> desired_output
[1] "aaa" "ttt" "ccc" "gcc"
In principle:
1. Number of Column in matrix = length of string (= 3)
2. Number of Row in matrix = length of vector ( = 4).
3.
2009 Jan 13
3
Returning Non-Unique Index with Which (alternatives?)
Dear all,
I tried to find index in repo given a query with this:
> repo <- c("AAA", "AAT", "AAC", "AAG", "ATA", "ATT")
> qr <- c("AAC", "ATT", "ATT")
> which(repo%in%qr)
[1] 3 6
Note that the query contain repeating elements, yet
the output of which only returns unique.
How can I make it
2008 Jul 06
4
Method for checking automatically which distribtions fits a data
Hi,
Suppose I have a vector of data.
Is there a method in R to help us automatically
suggest which distributions fits to that data
(e.g. normal, gamma, multinomial etc) ?
- Gundala Viswanath
Jakarta - Indonesia
2008 Aug 05
2
Iterating Named List
Hi all,
I have the following named list:
> print(y)
$`200052_s_at`
[1] -1066.975 -1063.893 -1062.815 -1062.121 -1059.004
$`200071_at`
[1] -959.823 -953.980 -953.886 -948.781 -974.890
$`200084_at`
[1] -1135.804 -1132.863 -1128.197 -1128.633 -1125.890
What I want to do is to iterate this name list and process its members.
To do that I attempt the following code (but failed):
__BEGIN__
ny
2009 Jan 09
4
Extracting File Basename without Extension
Dear all,
The basename() function returns the extension also:
> myfile <- "path1/path2/myoutput.txt"
> basename(myfile)
[1] "myoutput.txt"
Is there any other function where it just returns
plain base:
"myoutput"
i.e. without 'txt'
- Gundala Viswanath
Jakarta - Indonesia
2009 Jan 09
3
Pack and Unpack Strings in R
Dear all,
Does R has any function/package that can pack
and unpack string into bit size?
The reason I want to do this in R is that R
has much more native statistical function than Perl.
Yet the data I need to process is so large that it
required me to compress it into smaller unit -> process it -> finally
recover them back again into string with new information.
In Perl the
2008 Jun 23
2
Pairwise Partitioning of a Vector
Hi,
How can I partitioned an example vector like this
> print(myvector)
[1] 30.9 60.1 70.0 73.0 75.0 83.9 93.1 97.6 98.8 113.9
into the following pairwise partition:
PAIR1
part1 = 30.9
part2 = 60.1 70.0 73.0 75.0 83.9 93.1 97.6 98.8 113.9
PAIR2
part1 = 30.9 60.1
part2 = 70.0 73.0 75.0 83.9 93.1 97.6 98.8 113.9
....
PAIR9
part1 = 30.9
2009 Jan 13
3
Extracting Hash via Vector
Dear all,
Suppose I have a hash created with this
x <- list()
for (i in c('test', 'some', 'more')){
x[[i]] <- runif(1)
}
then I want to extract the elem of that hash with
a vector
> q <- c("some", "more", "not_there")
But why this failed?
> extracted <- x[[q]]
Error in x[[q]] : subscript out of bounds
we expect the
2008 Dec 22
3
Convert ASCII string to Decimal in R (vice versa) was: Hex
Hi Dieter,
Sorry my mistake. I wanted to convert them
into Decimal (not Hexadecimal).
Given this string, the desired answer follows:
> ascii_str <- "ORQ>IK"
79 82 81 62 73 75
> ascii_str2 <- "FDC"
70 68 67
- Gundala Viswanath
Jakarta - Indonesia
On Mon, Dec 22, 2008 at 5:49 PM, Dieter Menne
<dieter.menne at menne-biomed.de> wrote:
> Gundala
2009 Jan 08
2
Faster Printing Alternatives to 'cat'
Dear all,
I found that printing with 'cat' is very slow.
For example in my machine this snippet
__BEGIN__
# I need to resolve to use this type of loop.
# because using write(), I need to create a matrix which
# consumes so much memory. Note that "foo, bar, qux" object
# is already very large (>2Gb)
for ( s in 1:length(x) ) {
2008 Jun 16
2
Creating a Hash from Data.Frame
Hi,
I have the following data frame:
> print(mydatframe)
__DATAFRAME__
V1 V2 V3
1 1007_s_at DDR1 discoidin domain receptor tyrosine kinase 1
2 1053_at RFC2 replication factor C (activator 1) 2, 40kDa
3 117_at HSPA6 heat shock 70kDa protein 6 (HSP70B')
__END__
Is there a way to create a hash with
V2 as Key and V3 as its value?
- Gundala Viswanath
Jakarta - Indonesia
2008 Oct 06
2
Problem with Grep Under Loop
Dear all,
I have no problem with this individual grep command:
> datk <- grep("XM_528056", source$V1)
> dat2 <- source[datk,]
> print(dat2)
V1 V2 V3 V4 V5 V6 V7
35995 XM_528056 panTro2 chr8 + 1775569 1896107 Chimpanzee
BUT, when I run them under the loop it gives this error:
> hm_acc <-
2009 Jan 16
5
Value Lookup from File without Slurping
Dear all,
I have a repository file (let's call it repo.txt)
that contain two columns like this:
# tag value
AAA 0.2
AAT 0.3
AAC 0.02
AAG 0.02
ATA 0.3
ATT 0.7
Given another query vector
> qr <- c("AAC", "ATT")
I would like to find the corresponding value for each query above,
yielding:
0.02
0.7
However, I want to avoid slurping whole repo.txt