Hi, I am new to R and I would like to get your help in finding 'outliers'. I have mvoutlier package installed in my system and added the package . But I not able find a function from 'mvoutlier' package which will identify 'outliers'. This is the sample list of data I have got which has one out-lier. 11489 11008 11873 80000000 9558 8645 8024 8371 It will be of great help if somebody have got an example script for the same. Thanks & Regards, Thomas [[alternative HTML version deleted]]
Hi I had not see any answer yet but maybe there is nobody who wants to touch the elusive object of "outlier". Neither me, but here are some ideas how one can proceed. First of all its always up to you what is considered an outlier and how will you deal with them. I usually call an outlier any item which does not fit to the pattern and the pattern is usually best observed by some plotting function. You can identify outlier points, inspect the data source, correct typing mistakes and only if the value is really measured and you can not find any reason why it has such value it is real outlier. Then ***you*** need to decide what to do with it - discard, can come from some long tailed distribution, ... So here are my 0.02$ regarding an outlier theme. Regards Petr> > Hi, > I am new to R and I would like to get your help in finding > 'outliers'. > I have mvoutlier package installed in my system and added the package . > But I not able find a function from 'mvoutlier' package which willidentify> 'outliers'. > This is the sample list of data I have got which has one out-lier. > 11489 11008 11873 80000000 9558 8645 8024 8371 It will be of > great help if somebody have got an example script for the same. > > Thanks & Regards, > Thomas > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html> and provide commented, minimal, self-contained, reproducible code.
Petr and Bert offer sound advice. At the risk of getting completely ostracized, here's how you could find outliers using the definition of 'outlier' used by R's boxplot function and at the same time see your data.> dat = c(11489, 11008, 11873, 80000000, 9558, 8645, 8024, 8371) > a = boxplot(dat) > a$out[1] 8e+07 -----Original Message----- From: Prakash Thomas Sent: Tuesday, May 15, 2012 7:00 AM To: r-help at r-project.org Subject: [R] how to find outliers from the list of values Hi, I am new to R and I would like to get your help in finding 'outliers'. I have mvoutlier package installed in my system and added the package . But I not able find a function from 'mvoutlier' package which will identify 'outliers'. This is the sample list of data I have got which has one out-lier. 11489 11008 11873 80000000 9558 8645 8024 8371 It will be of great help if somebody have got an example script for the same. Thanks & Regards, Thomas [[alternative HTML version deleted]] ______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. ------------------------------------------ Robert W. Baer, Ph.D. Professor of Physiology Kirksville College of Osteopathic Medicine A. T. Still University of Health Sciences 800 W. Jefferson St. Kirksville, MO 63501 660-626-2322 FAX 660-626-2965