Mike Williamson
2010-Dec-20 17:26 UTC
[R] ideas, modeling highly discrete time-series data
Hello all,
First of all, thanks so those of you who helped me a week or so ago
managing a time series with varying gaps between the data series in 'R'.
(My final preferred solution was to use "its" function & then
forecast(Arima( ) ). )
My next question is a general statistical question where I'd like some
advice, for those willing / able to proffer any wisdom:
- I need to predict using this same time series, where the *data* are
highly discrete. E.g., I will have values like 1e5, 2.2e5, and 3.6e5, but I
will never have 1.3e5 or 1.8e5, etc.
- I could simply leave these values as discrete, similar to a binomial
distribution, but then I am not sure how to use time series tricks like
arima above. For time-series analyses that I know of, an
assumption of an
approximately normal distribution is expected. No simple normalization
(e.g., log(values) ) works, since the non-normality arises from
the highly
discrete distribution more than any drastic asymmetry in the population
spread.
- I could leave the values as they are an work with a model where the
assumption is violated... I am not sure how sensitive a model
such as arima
is on the population distribution
- Or I could... (here's where I am hoping for some collective genius).
Thanks in advance for any help! If this isn't the best forum, since I
know this is not specifically an 'R' question, please let me know of a
better forum to post such a question.
Thanks!
Mike
"Telescopes and bathyscaphes and sonar probes of Scottish lakes,
Tacoma Narrows bridge collapse explained with abstract phase-space maps,
Some x-ray slides, a music score, Minard's Napoleanic war:
The most exciting frontier is charting what's already here."
-- xkcd
--
Help protect Wikipedia. Donate now:
http://wikimediafoundation.org/wiki/Support_Wikipedia/en
[[alternative HTML version deleted]]
Kjetil Halvorsen
2010-Dec-21 15:47 UTC
[R] ideas, modeling highly discrete time-series data
You could try the timeseries list at https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=TIMESERIES kjetil On Mon, Dec 20, 2010 at 6:26 PM, Mike Williamson <this.is.mvw at gmail.com> wrote:> Hello all, > > ? ?First of all, thanks so those of you who helped me a week or so ago > managing a time series with varying gaps between the data series in 'R'. > (My final preferred solution was to use "its" function & then > forecast(Arima( ) ). ?) > > ? ?My next question is a general statistical question where I'd like some > advice, for those willing / able to proffer any wisdom: > > ? - I need to predict using this same time series, where the *data* are > ? highly discrete. ?E.g., I will have values like 1e5, 2.2e5, and 3.6e5, but I > ? will never have 1.3e5 or 1.8e5, etc. > ? ? ?- I could simply leave these values as discrete, similar to a binomial > ? ? ?distribution, but then I am not sure how to use time series tricks like > ? ? ?arima above. ?For time-series analyses that I know of, an > assumption of an > ? ? ?approximately normal distribution is expected. ?No simple normalization > ? ? ?(e.g., log(values) ) works, since the non-normality arises from > the highly > ? ? ?discrete distribution more than any drastic asymmetry in the population > ? ? ?spread. > ? ? ?- I could leave the values as they are an work with a model where the > ? ? ?assumption is violated... I am not sure how sensitive a model > such as arima > ? ? ?is on the population distribution > ? ? ?- Or I could... (here's where I am hoping for some collective genius). > > ? ?Thanks in advance for any help! ?If this isn't the best forum, since I > know this is not specifically an 'R' question, please let me know of a > better forum to post such a question. > > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?Thanks! > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Mike > > > > "Telescopes and bathyscaphes and sonar probes of Scottish lakes, > Tacoma Narrows bridge collapse explained with abstract phase-space maps, > Some x-ray slides, a music score, Minard's Napoleanic war: > The most exciting frontier is charting what's already here." > ?-- xkcd > > -- > Help protect Wikipedia. Donate now: > http://wikimediafoundation.org/wiki/Support_Wikipedia/en > > ? ? ? ?[[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >
Seemingly Similar Threads
- anyone know why package "RandomForest" na.roughfix is so slow??
- manipulating the Date & Time classes
- manipulating the Date & Time classes
- question regarding "varImpPlot" results vs. model$importance data on package "RandomForest"
- How to get 'R' to talk BACK to other languages / scripts??