Displaying 20 results from an estimated 10000 matches similar to: "outlier detection"
2000 Apr 21
1
outlier detection methods in r?
hi -
if I sample from a normal distribution with something like
n100<-rnorm(100,0,1)
and add an outlier with
n100[10]<-4
then
qqnorm(n100)
visually shows the point 4 as an outlier
and calculating the probablity of a value of 4 or bigger in 100 samples of norm(0,1)
gives
> 1-exp(log(pnorm(4,0,1))*100)
[1] 0.003162164
If I have more than 1 sample above outlier threshold the math is a
2009 Sep 12
1
medcouple-based outlier detection in R
I need to detect outliers in a large data set which is highly right-skewed. I plan to use medcouple-based outlier detection. Is there any support for medcouple-based outlier detection in R? Are there any other routines in R to perform outlier detection in highly right-skewed data?
Manuj Sharma
See the Web's breaking stories, chosen by people like you. Check out Yahoo! Buzz.
2010 Jul 26
1
Outlier detection in bimodal distribution
Hi,
I was looking for a package that would help with outlier detection for bimodal
distributions. I have tried 'outliers' and 'extremevalues' packages, but am not
sure if they are ok for bimodal distribution.
Any help would be highly appreciated!
thanks,
[[alternative HTML version deleted]]
2010 Nov 30
3
Outlier statistics question
I have a statistical question.
The data sets I am working with are right-skewed so I have been
plotting the log transformations of my data. I am using a Grubbs Test
to detect outliers in the data, but I get different outcomes depending
on whether I run the test on the original data or the log(data). Here
is one of the problematic sets:
fgf2p50=c(1.563,2.161,2.529,2.726,2.442,5.047)
2009 Feb 14
6
Outlier Detection for timeseries
Hello R users,
Can someone tell if there is a package in R that can do outlier detection
that give outputs simiilar to what I got from SAS below.
Many thanks in advance for any help!
Outlier Details
Approx
Chi-
2012 Feb 09
1
Outlier removal techniques
Hello,
I need to analyse a data matrix with dimensions of 30x100.
Before analysing the data there is, however, a need to remove outliers from
the data.
I read quite a lot about outlier removal already and I think the most common
technique for that seems to be Principal Component Analysis (PCA). However,
I think that these technqiue is quite subjective. When is an outlier an
outlier?
I uploaded
2005 Apr 22
2
Hoaglin Outlier Method
I am a new user of R so please bear with me. I have reviewed some R books,
FAQs and such but the volume of material is great. I am in the process of
porting my current SAS and SVS Script code to Lotus Approach, R and
WordPerfect.
My question is, can you help me determine the best R method to implement
the Hoaglin Outlier Method? It is used in the Appendix A and B of the fo
llowing link.
2011 May 04
1
Outlier removal by Principal Component Analysis : error message
Hi,
I am currently analysis Raman spectroscopic data with the hyperSpec package.
I consulted the documentation on this package and I found an example
work-flow dedicated to Raman spectroscopy (see the address :
http://hyperspec.r-forge.r-project.org/chondro.pdf)
I am currently trying to remove outliers thanks to PCA just as they did in
the documentation, but I get a message error I can't
2006 Mar 14
2
bwplot and outlier symbols
Hi,
I was just trying to figure out how to beautify the output of my
bwplot-output. Altogether I figured most of the things out on my own. The
one thing which puzzles me though are the symbols for the outliers.
I can easily change the form of the median symbol by using "pch" but I
don't know how to do this for outliers. Obviously the "outpch" of the
2008 Jun 18
2
randomForest outlier
I try to use ?randomForest to find variables that are the most important to
divide my dataset (continuous, categorical variables) in two given groups.
But when I plot the outliers:
plot(outlier(FemMalSex_NAavoid88.rf33, cls=FemMalSex_NAavoid88$Sex),
type="h",col=c("red","green")[as.numeric(FemMalSex_NAavoid88$Sex)])
it seems to me that all my values appear as
2010 Jan 19
5
How to detect and exclude outliers in R?
Suppose I am reading data from a file and the data contains some outliers. I
want to know if it is possible in R to automatically detect outliers in a
dataset and remove them
--
View this message in context: http://n4.nabble.com/How-to-detect-and-exclude-outliers-in-R-tp1017285p1017285.html
Sent from the R help mailing list archive at Nabble.com.
2004 Jan 21
1
outlier identification: is there a redundancy-invariant substitution for mahalanobis distances?
Dear R-experts,
Searching the help archives I found a recommendation to do multivariate
outlier identification by mahalanobis distances based on a robustly estimated
covariance matrix and compare the resulting distances to a chi^2-distribution
with p (number of your variables) degrees of freedom. I understand that
compared to euclidean distances this has the advantage of being scale-invariant.
2004 Sep 23
6
detection of outliers
Hi,
this is both a statistical and a R question...
what would the best way / test to detect an outlier value among a series of 10 to 30 values ? for instance if we have the following dataset: 10,11,12,15,20,22,25,30,500 I d like to have a way to identify the last data as an outlier (only one direction). One way would be to calculate abs(mean - median) and if elevated (to what extent ?) delete the
2012 Sep 28
2
changing outlier shapes of boxplots using lattice
Hello
This is Elaine.
I am using package lattice to generate boxplots.
Using Richard's code, the display was almost perfect except the outlier
shape.
Based on the following code, the outliers are vertical lines.
However, I want the outliers to be empty circles.
Please kindly help how to modify the code to change the outlier shapes.
Thank you.
code
package (lattice)
dataN <-
2009 Aug 11
0
outlier detection test for large data set
Hello,
Which outlier detection test is robust for large dataset ?
I think the detection test in the package 'outliers' are suitable for small data set (between 3 and 100).
Do you know one for 20000 or more values ?
Thanks,
- Martial
_________________________________________________________________
Inédit ! Des Emoticônes Déjantées! Installez les dans votre Messenger !
[[alternative
2004 Jun 30
1
outlier tests
I have been learning about some outlier tests -- Dixon
and Grubb, specifically -- for small data sets. When
I try help.start() and search for outlier tests, the
only response I manage to find is the Bonferroni test
avaiable from the CAR package... are there any other
packages the offer outlier tests? Are the Dixon and
Grubb tests "good" for small samples or are others
more
2011 Oct 20
2
How to remove multiple outliers
Hi All,
I am working on the dataset in which some of the variables have more than
one observations with outliers .
I am using below mentioned sample script
library(outliers)
x1 <- c(10, 10, 11, 12, 13, 14, 14, 10, 11, 13, 12, 13, 10, 19, 18, 17,
10099, 10099, 10098)
outlier_tf1 = outlier(x1,logical=TRUE)
find_outlier1 = which(outlier_tf1==TRUE, arr.ind=TRUE)
beh_input_ro1 =
2009 Feb 14
2
implementing Grubbs outlier test on a large dataframe
Hi!
I'm trying to implement an outlier test once/row in a large dataframe.
Ideally, I'd do this then add the Pvalue results and the number flagged as
an outlier as two new separate columns to the dataframe. Grubbs outlier
test requires a vector and I'm confused how to make each row of my dataframe
a vector, followed by doing a Grubbs test for each row containing the vector
of numbers
2005 Feb 25
2
outlier threshold
For the analysis of financial data wih a large variance, what is the best way to select an outlier threshold?
Listed below, is there a best method to select an outlier threshold and how does R calculate it?
In R, how do you find the outlier threshold through an interquartile range?
In R, how do you find the outlier threshold using the hist command?
In R, how do you find the outlier threshold
2012 Oct 31
1
gauss fit with outlier removal
I have distribution that are gaussian to a good approximation. I fit a
gaussian to these distributons. Once in a while there is an outlier. Could
someone suggest a robust method (R package already?) that removes those
outliers and redoes the gaussian fit to get a better fit? Thanks.
[[alternative HTML version deleted]]