Mao Jianfeng
2011-Jul-13 12:48 UTC
[R] How to predict significant dominant regions of two sequence of numeric values by Hidden Markov Model
Dear all,
I have few training on Hidden Markov Model. But, I intend to solve my
problem by HMM. I would like to have your helps/directions to me.
Here, I have two variables to define the 8 one-dimension space (coordinate.1,
coordinate.2). In this one-dimension space, there are two sequences of
values (shared and specific). This means I would like to
detect/guess/predict the regions (defined by two coordinates variables)
which are significantly dominant (with higher values) by shared/specific in
some consecutive cells (units of the coordinate.2 in the dummy). I would
like to get the coordinates (the 1st two variables) of such dominant regions
for shared or specific.
Thanks in advance.
Best regards,
Jian-Feng,
########################################################
I use R to construct a dummy (the real data are more complex than this
Here is my data set:
mydata <- data.frame(coordinate.1=rep(1:8, each=25),
coordinate.2=rep(seq(100, 2500, 100), 8),
shared=rep(c(100,30,100), c(5,15,5)),
specific=rep(c(25,90,20,30), c(5,7,8,5)))
Here is how I happen to make a plot:
library(ggplot2)
pdf("shared_specific.pdf", width = 14, height = 8)
p.test<-ggplot(mydata, aes(coordinate.2)) +
geom_line(aes(y = shared, colour = "shared")) +
geom_line(aes(y = specific, colour = "specific")) +
facet_grid(coordinate.1 ~., scales = "free_x") +
scale_x_continuous("coordinate.2") +
scale_y_continuous("shared and specific")
p.test
dev.off()
[[alternative HTML version deleted]]
