Displaying 20 results from an estimated 20000 matches similar to: "time series clustering"
2005 Jul 25
1
cluster
Dear listers:
Here I have a question on clustering methods available in R. I am
trying to down-sampling the majority class in a classification problem
on an imbalanced dataset. Since I don't want to lose information in
the original dataset, I don't want to use naive down-sampling: I think
using clustering on the majority class' side to select
"representative" samples might
2008 Aug 24
2
similarity between two gene lists with varied length
Dear listers,
a little off-topic:
I am looking for and compare algorithms which can calculate "distance" or
"similarity" between two gene lists with different lengths.
Any paper, any implementation in R and any suggestion is welcome!
Thanks,
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
2007 Jun 18
3
source a specific function
Dear Listers:
For example, if I have a .R source file which has more than one
function, and I want to just load only one of the functions, how could
I do that? (removing the rest after sourcing is not what I intend b/c
in my workspace, I might have some of the rest and I don't want to
change my workspace: i.e., I only change my workspace by adding one
function from a R source file).
Thanks,
2006 Oct 17
4
cluster in R
hi,
is there some good summary on clustering methods in R? It seems there
are many packages involving it.
And I have two questions on clustering here:
1. Is there a way of evaluate the effecitives (or seperation) of
clustering (rather than by visualization)?
2. Is there a search method (like genetic search) which can help find
the best subset of attributes which gives best seperation?
Thanks,
2006 Oct 18
1
selectively load some objects from old workspace
Dear Listers:
I have a question on loading objects from workspace:
suppose I have two workspaces for two approaches. My old workspace has
some objects I need for the new workspace but I don't want to load the
whole old workspace and remove most of the old objects and get what I
want. Is there an easier way to do like this:
load "some needed obj" from old workspace, which has been
2007 Aug 28
1
how to calculate mean into a list
Dear Listers:
I have this task and suppose a0 is a list of 10 data.frames, I want to
calculate like this
> (a0[[1]]+a0[[2]]+..+a[[10]])/10
Thanks.
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
---Matrix III
2007 Jun 12
4
pretty report
Dear Listers:
I have a couple of data frames to report and each corresponds to
different condtions, e.g. conditions=c(10, 15, 20, 25). In this
examples, four data frames need to be exported in a "pretty" report.
I knew Perl has some module for exporting data to Excel and after
googling, I found R does not.
So I am wondering if there is a package in R for generating good
reports. I
2005 Aug 04
1
some thoughts on outlier detection, need help!
Dear listers:
I have an idea to do the outlier detection and I need to use R to
implement it first. Here I hope I can get some input from all the
guru's here.
I select distance-based approach---
step 1:
calculate the distance of any two rows for a dataframe. considering
the scaling among different variables, I choose mahalanobis, using
variance as scaler.
step 2:
Let k be the number of
2005 Oct 27
3
memory problem in handling large dataset
Dear Listers:
I have a question on handling large dataset. I searched R-Search and I
hope I can get more information as to my specific case.
First, my dataset has 1.7 billion observations and 350 variables,
among which, 300 are float and 50 are integers.
My system has 8 G memory, 64bit CPU, linux box. (currently, we don't
plan to buy more memory).
> R.version
_
platform
2007 Jan 24
3
Cronbach's alpha
Dear Listers:
I used cronbach{psy} to evaluate the internal consistency and some set
of variables gave me alpha=-1.1003, while other, alpha=-0.2;
alpha=0.89; and so on. I am interested in knowing how to interpret
1. negative value
2. negative value less than -1.
I also want to re-mention my previous question about how to evaluate
the consistency of a set of variables and about the total
2007 May 01
1
dlda{supclust} 's output
Hi,
I am using dlda algorithm from supclust package and I am wondering if
the output can be a continuous probability instead of discrete class
label (zero or one) since it puts some restriction on convariance
matrix, compared with lda, while the latter can.
thanks,
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
2011 May 27
4
network package in R
Hi there,
I need a network builder and it can change the node size and color; I am not
sure if network package in R can do this or not. The other functions I
wanted have been found in that package.
BTW, if there is another package in R relating to this, please suggest too.
Thanks,
Weiwei
--
Weiwei Shi, Ph.D
Research Scientist
"Did you always know?"
"No, I did not. But I
2007 Apr 11
5
how to reverse a list
Hi, there:
I am wondering if there is a quick way to "reverse" a list like this:
t0 <- list(a=1, b=1, c=2, d=1)
reverst t0 to t1
> t1
$`1`
[1] "a" "b" "d"
$`2`
[1] "c"
thanks.
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
---Matrix III
2007 Apr 24
5
intersect more than two sets
Hi,
I searched the archives and did not find a good solution to that.
assume I have 10 sets and I want to have the common character elements of them.
how could i do that?
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
---Matrix III
2007 Oct 29
3
how to split data.frame by row?
hi,
if I have 20 x 3 data.frame, how to split it into
10 x 6 (moving the lower part of 10x3 to column)
or
5 x 12
thanks
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
---Matrix III
2009 Jul 13
2
how to keep row name if there is only one row selected from a data frame
Hi, there:
Assume I have a dataframe with rownames like A with rownames like a to e,
> A
[,1] [,2]
a 1 6
b 2 7
c 3 8
d 4 9
e 5 10
when I use A[1,], I lost the rowname for it, like below. How could I keep
it? Is there an easy way instead that I have to modify by myself after I
used A[1,] manually.
> A[1,]
[1] 1 6
Thanks,
W.
--
Weiwei Shi, Ph.D
Research
2007 Jun 25
3
a string to enviroment or function
Hi,
I am wondering how to make a function Fun to make the following work:
t0 <- (paste("hgu133a", "ENTREZID", sep=""))
xx <- as.list(Fun(t0)) # make it work like xx<-as.list(hgu133aENTREZID)
thanks,
--
Weiwei Shi, Ph.D
Research Scientist
GeneGO, Inc.
"Did you always know?"
"No, I did not. But I believed..."
---Matrix III
2005 Jan 27
3
clustering
Hi,
I just get a question (sorry if it is a dumb one) and I "phase" my
question in the following R codes:
group1<-rnorm(n=50, mean=0, sd=1)
group2<-rnorm(n=20, mean=1, sd=1.5)
group3<-c(group1,group2)
Now, if I am given a dataset from group3, what method (discriminant
analysis, clustering, maybe) is the best to cluster them by using R.
The known info includes: 2 clusters,
2005 Jul 13
1
read.table
Hi,
I have a question on read.table.
I have a dataset with 273,000 lines and 195 columns. I used the
read.table to load the data into R:
trn<-read.table('train1.dat', header=F, sep='|', na.strings='.')
I found it takes forever.
then I run 1/10 of the data (test) using read.table again. And this
time it finished quickly. So, there might be something wrong in my
data
2005 Oct 11
1
a problem in random forest
Hi, there:
I spent some time on this but I think I really cannot figure it out, maybe I
missed something here:
my data looks like this:
> dim(trn3)
[1] 7361 209
> dim(val3)
[1] 7427 209
> mg.rf2<-randomForest(x=trn3[,1:208], y=trn3[,209], data=trn3, xtest=val3[,
1:208], ytest=val3[,209], importance=T)
my test data has 7427 observations but after prediction,
> dim(mg.rf2$votes)