I was using the hist() function to create a frequency table of some network traffic data. The range in values is rather large, from 0 till just under 10e12. Calling hist(x, breaks=c(0,1000,1e6,1e9,1e12),plot=F,freq=T) causes hist() to return : $breaks [1] -1.0000e+05 1.0100e+05 1.1000e+06 1.0001e+09 1.0000e+12 Is this recalculation of the breaks by hist() intended? Maarten van Gelder. -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Sorry, I forgot to mention in my earlier post that I run version 1.2.3 on NT4.0 I was using the hist() function to create a frequency table of some network traffic data. The range in values is rather large, from 0 till just under 10e12. Calling hist(x, breaks=c(0,1000,1e6,1e9,1e12),plot=F,freq=T) causes hist() to return : $breaks [1] -1.0000e+05 1.0100e+05 1.1000e+06 1.0001e+09 1.0000e+12 Is this recalculation of the breaks by hist() intended? Maarten van Gelder. -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
On Tue, 26 Jun 2001, Maarten van Gelder wrote:> Sorry, I forgot to mention in my earlier post that I run version 1.2.3 on > NT4.0Thanks: hist has been changed several times recently.> I was using the hist() function to create a frequency table of some network > traffic data. The range in values is rather large, from 0 till just under > 10e12. Calling hist(x, breaks=c(0,1000,1e6,1e9,1e12),plot=F,freq=T) causes > hist() to return : > > $breaks > [1] -1.0000e+05 1.0100e+05 1.1000e+06 1.0001e+09 1.0000e+12 > > Is this recalculation of the breaks by hist() intended?Yes. You have got caught by the following: fuzz <- if (right) c(if (include.lowest) -diddle else diddle, rep(diddle, length(breaks) - 1)) else c(rep(-diddle, length(breaks) - 1), if (include.lowest) diddle else -diddle) breaks <- breaks + fuzz Don''t you want to use log10 scale? -- Brian D. Ripley, ripley at stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272860 (secr) Oxford OX1 3TG, UK Fax: +44 1865 272595 -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
On Tue, 26 Jun 2001, Maarten van Gelder wrote:> I was using the hist() function to create a frequency table of some network > traffic data. The range in values is rather large, from 0 till just under > 10e12. Calling hist(x, breaks=c(0,1000,1e6,1e9,1e12),plot=F,freq=T) causes > hist() to return : > > $breaks > [1] -1.0000e+05 1.0100e+05 1.1000e+06 1.0001e+09 1.0000e+12 > > Is this recalculation of the breaks by hist() intended? > > Maarten van Gelder.How many zeroes are there? If you got rid of them, a log transformation might help you. Also, have you considered looking at the distribution of the times between packets (or bytes)?? There are some interesting theoretical models for the distributions of times between packets. I''ve got a few books on the subject, but they''re at home. E-mail me off list if you''re interested. -- znmeb at aracnet.com (M. Edward Borasky) http://www.aracnet.com/~znmeb How to Stop A Folksinger Cold # 4 "Are you going to Scarborough Fair?" No. -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Prof Brian Ripley <ripley at stats.ox.ac.uk> writes:> On Tue, 26 Jun 2001, Maarten van Gelder wrote: > > > Sorry, I forgot to mention in my earlier post that I run version 1.2.3 on > > NT4.0 > > Thanks: hist has been changed several times recently. > > > I was using the hist() function to create a frequency table of some network > > traffic data. The range in values is rather large, from 0 till just under > > 10e12. Calling hist(x, breaks=c(0,1000,1e6,1e9,1e12),plot=F,freq=T) causes > > hist() to return : > > > > $breaks > > [1] -1.0000e+05 1.0100e+05 1.1000e+06 1.0001e+09 1.0000e+12 > > > > Is this recalculation of the breaks by hist() intended? > > Yes. You have got caught by the following: > > fuzz <- if (right) > c(if (include.lowest) -diddle else diddle, rep(diddle, length(breaks) - 1)) > else c(rep(-diddle, length(breaks) - 1), if (include.lowest) diddle else -diddle) > breaks <- breaks + fuzz > > Don''t you want to use log10 scale? >or just table(cut(x,breaks=c(0,1000,1e6,1e9,1e12))) Basically, just don''t expect hist() to do sensible things with things that cannot sensibly be plotted. (Your first column would have a width of one billionth of the total scale, whereas the last one takes up 99.9% of the total!) -- O__ ---- Peter Dalgaard Blegdamsvej 3 c/ /''_ --- Dept. of Biostatistics 2200 Cph. N (*) \(*) -- University of Copenhagen Denmark Ph: (+45) 35327918 ~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk) FAX: (+45) 35327907 -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
It turns out I have one of the books here. It''s _A Practical Guide To Heavy Tails_ (Adler, Feldman, Taqqu editors, Birkhauser is the publisher) and there are a few chapters on analysis of network traffic. My own applications are far less sophisticated than this book; we don''t have that kind of data. -- znmeb at aracnet.com (M. Edward Borasky) http://www.aracnet.com/~znmeb -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._