similar to: Using grubbs test for residuals to find outliers

Displaying 20 results from an estimated 3000 matches similar to: "Using grubbs test for residuals to find outliers"

2009 Feb 14
2
implementing Grubbs outlier test on a large dataframe
Hi! I'm trying to implement an outlier test once/row in a large dataframe. Ideally, I'd do this then add the Pvalue results and the number flagged as an outlier as two new separate columns to the dataframe. Grubbs outlier test requires a vector and I'm confused how to make each row of my dataframe a vector, followed by doing a Grubbs test for each row containing the vector of numbers
2010 Sep 15
1
cochran-grubbs tests results
Hello, I'm new in this R world and I don't know much about statistics, but now I have to analize some data and I've got some first queries yet: I have 5 sets of area mesures and each set has 5 repetitions. My first step is to check data looking for outliers. I've used the outliers package. I have to use the cochran test and the grubbs test in case I find any outlier. The problem
2005 Apr 14
2
grubbs.test
Dear All, I have small samples of data (between 6 and 15) for numerious time series points. I am assuming the data for each time point is normally distributed. The problem is that the data arrvies sporadically and I would like to detect the number of outliers after I have six data points for any time period. Essentially, I would like to detect the number of outliers when I have 6 data points then
2010 Nov 30
3
Outlier statistics question
I have a statistical question. The data sets I am working with are right-skewed so I have been plotting the log transformations of my data. I am using a Grubbs Test to detect outliers in the data, but I get different outcomes depending on whether I run the test on the original data or the log(data). Here is one of the problematic sets: fgf2p50=c(1.563,2.161,2.529,2.726,2.442,5.047)
2004 Jun 30
1
outlier tests
I have been learning about some outlier tests -- Dixon and Grubb, specifically -- for small data sets. When I try help.start() and search for outlier tests, the only response I manage to find is the Bonferroni test avaiable from the CAR package... are there any other packages the offer outlier tests? Are the Dixon and Grubb tests "good" for small samples or are others more
2011 Dec 30
3
good method of removing outliers?
Happy holidays all! I know it's very subjective to determine whether some data is outlier or not... But are there reasonally good and realistic methods of identifying outliers in R? Thanks a lot! [[alternative HTML version deleted]]
2007 Apr 25
1
How to identify and exclude the outliers with R?
Hello, everyone, I want to ask a simple question. If I have a set of data,and I want to identify how many outliers there are in the data.Which packages and functions can I use? Thanks. Shao chunxuan. [[alternative HTML version deleted]]
2017 Nov 19
2
Changeing logarithms
Hi! I'm using a large panel data, and now I have faced some difficulties with my analysis. The predictors are not normally distributed and there are quite many outliers (some of them are influential though). I have tried to change the logarythm, but i'm not sure, how to do that. I want also draw a plot picture in which logarythms of predictors x and y are changed. How could I do that?
2005 Oct 06
0
a question about LMS and what constitutes outliers
Hi, I have been using the lqs function with method='lms'. However the results I get are a little different from the results noted by Rousseeuw & Leroy (Robust Regression and Outlier Detection) and I was wondering how to use these results for outlier detection. I'm using the stackloss dataset, for which the original Rousseeuw et al. program points out that observations 1,2,3,4
2010 Jan 26
0
Trouble Highlighting outliers on Time Series Plot
I am having trouble plotting outliers on time series. Give then following code: ############################################################ # find STL Outliers by weight and append sh2, use Robust # this should allow the initial outliers to be filtered # this section may be commented out. ############################################################ tsSourceDiag <-
2011 Oct 20
2
How to remove multiple outliers
Hi All, I am working on the dataset in which some of the variables have more than one observations with outliers . I am using below mentioned sample script library(outliers) x1 <- c(10, 10, 11, 12, 13, 14, 14, 10, 11, 13, 12, 13, 10, 19, 18, 17, 10099, 10099, 10098) outlier_tf1 = outlier(x1,logical=TRUE) find_outlier1 = which(outlier_tf1==TRUE, arr.ind=TRUE) beh_input_ro1 =
2009 Jun 16
1
How to replace outliers by group median?
Dear R-helpers, Very small amount of outliers can greatly affect the mean and many other statistic of a numeric variable. So, usually we must deal with the outliers properly in the process of data analysis. Here, I want to replace outliers with the group median of the variable. But, I can not construct a good way to do that efficiently, because of I am a newbie to R and programming. Can anybody
2011 Nov 22
1
Capping outliers
Hi Experts, I am new to R, using following sample code for capping outliers using percentile information. Working on large data (30000 observations and 150 variables), loop I am using in the below mentioned code for detecting outliers and capping to upper /lower percentile value is taking much time for the execution. Is there anything wrong with code, can anyone suggest improvement in the script
2011 Sep 28
1
removing outliers in non-normal distributions
Hello, I'm seeking ideas on how to remove outliers from a non-normal distribution predictor variable. We wish to reset points deemed outliers to a truncated value that is less extreme. (I've seen many posts requesting outlier removal systems. It seems like most of the replies center around "why do you want to remove them", "you shouldn't remove them", "it
2016 Apr 07
1
identifying outliers
Thanks for writing this great piece of code. x = rnorm(100) boxplot(x) # you shouldn't see any outliers here although sometimes yow will # lets add some outliers intentionally x = c(21, 20, 25, x) # now 10, 15 and 20 are outliers myboxplot <- boxplot(x) # now you should see your three outliers myboxplot$out # it will print the values of the outliers How does one amend
2012 Apr 18
1
Pierce's criterion
Hello all, I would like to rigorously test whether observations in my dataset are outliers. I guess all the main tests in R (Grubbs) impose the assumption of normality. My data is surely not normal, so I would like to use something else. As far as I can tell from wikipedia, Peirce's criterion is just that. The data I am interested in testing is: 1) Continuous on the unit interval 2)
2009 Aug 19
2
mild and extreme outliers in boxplot
dear all, could somebody tell me how I can plot mild outliers as a circle(?) and extreme outliers as an asterisk(*) in a box-whisker plot? Thanks very much in advance -- View this message in context: http://www.nabble.com/mild-and-extreme-outliers-in-boxplot-tp25040545p25040545.html Sent from the R help mailing list archive at Nabble.com.
2012 Sep 05
1
How to effectively remove Outliers from a binary logistic regression in R
Hallo there, greetings from Germany. I have a simple question for you. I have run a binary logistic model, but there are lots of outliers distorting the real results. I have tried to get rid of the outliers using the following commands: remove = -c(56, 303, 365, 391, 512, 746, 859, 940, 1037, 1042, 1138, 1355) MIGRATION.rebuild <- glm(MIGRATION, subset=remove)
2004 Sep 23
6
detection of outliers
Hi, this is both a statistical and a R question... what would the best way / test to detect an outlier value among a series of 10 to 30 values ? for instance if we have the following dataset: 10,11,12,15,20,22,25,30,500 I d like to have a way to identify the last data as an outlier (only one direction). One way would be to calculate abs(mean - median) and if elevated (to what extent ?) delete the
2009 Dec 10
1
Detectar outliers en un gráfico de dispersión
Hola amigos, esta es mi primera duda, espero que no sea demasiado fácil. Tengo unos datos de dos variables y quiero mostrar recta de regresión y valor de correlación serie0 <- c(0.651, 0.712, 0.614, 0.645, 0.559, 0.647, 0.642, 0.534, 0.616, 0.621, 0.623) serie1 <- c(0.572, 0.641, 0.565, 0.596, 0.518, 0.604, 0.602, 0.501, 0.58, 0.589, 0.596) data <- cbind(serie0, serie1) colnames(data)