Displaying 20 results from an estimated 10000 matches similar to: "weird behaviour of summary.default"
2012 Jul 06
2
Maximum number of patterns and speed in grep
Hi,
I am using R's grep function to find patterns in vectors of strings. The
number of patterns I would like to match is 7,700 (of different sizes). I
noticed that I get an error message when I do the following:
data <- array()
for (j in 1:length(x))
{
array[j] <- length(grep(paste(patterns[1:7700], collapse = "|"), x[j],
value = T))
}
When I break this up into 4 chunks of
2000 Nov 14
2
library doubt
Hi all.
I'm trying to install the packages Rstreams and RPgSQL, but with no
success.
I did R INSTALL name package but if I try library(Rstreams) I get the
following error:
Error in library.dynam("Rstreams",pkg,lib):
dynamic library 'Rstreams' not found
Error in library(Rstreams): .First.lib failed
Where did I go wrong? I'm using R 1-1-1 on a Debian GNU/Linux box.
2016 Aug 19
3
summary.default rounding on numeric seems inconsistent with other R behaviors
I was wondering if it would make sense to change the default behavior of the following:
summary(15555L)
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 15560 15560 15560 15560 15560 15560
summary.default on numeric values rounds values (not just presentation) to getOption("digits")-3L (or four) digits by default, making those values surprising and less suitable for
2006 Sep 14
1
digits in summary.default
Dear all,
the number of significant digits in summary default is
digits = max(3, getOption("digits") - 3)
on my platform this results to be 4. The point is that if you have,
say, integer data of magnitude greater than 10^3 the command summary
will produce heavily rounded results.
A simple example follow:
> x <- c(123456,234567,345678)
> x
[1] 123456 234567 345678
>
2009 Aug 30
2
error with summary(vector)??
Hello,
I get
> summary(E)
level nodes ave_nodes time
Min. : 1 Min. : 1.00 Min. : 10.71 Min. : 0.0000
1st Qu.: 237414 1st Qu.: 2.00 1st Qu.: 19.70 1st Qu.: 0.0100
Median : 749229 Median : 3.00 Median : 27.01 Median : 0.0100
Mean : 767902 Mean : 49.85 Mean : 98.89 Mean : 0.2296
3rd
2008 Mar 17
1
summary of summaries
Hi,
I have a few hundreds files with numerical information of different length
but with the same column structure. I use the following code to get summary
statistics
fplist <- list.files(pattern=".*analysis")
for (fp in fplist){
x2 <- read.delim(fp)
summary(x2)
}
Summary gives something like:
summary (x2)
V1 V2
2009 Sep 29
1
Summary
My data is called xc and has more than 15 variables.
When I used summary(xc) it gave me the detail description of each
variable.
Summary(xc)
Y1 x1 x2
x3 ..
Min. :0.0000 Min. : 1.000 Min. : 1.000 Min. : 1.000
1st Qu. :0.0000 1st Qu.: 1.000 1st Qu.: 1.000 1st Qu.: 2.000
Median :1.0000 Median : 1.000
2006 Jul 03
2
could i change the ouput style on summary?
Dear friends,
summary() doesn't give a good ouput style,e.g.:
grass soiltem airtem gheight
humidity altitude
diluo :38 Min. :15.90 Min. :17.70 Min. : 8.00 Min. :
0.2360 high: 43
huanghuacai:32 1st Qu.:19.32 1st Qu.:22.60 1st Qu.:40.00 1st Qu.:
0.3190 low :119
hucao :46 Median :20.20 Median :25.30 Median :60.00
2007 Mar 03
2
format of summary.lm for 2-way ANOVA
Hi,
I am performing a two-way ANOVA (2 factors with 4 and 5 levels,
respectively). If I'm interpreting the output of summary correctly,
then the interaction between both factors is significant:
,----
| ## Two-way ANOVA with possible interaction:
| > model1 <- aov(log(y) ~ xForce*xVel, data=mydataset)
|
| > summary(model1)
| Df Sum Sq Mean Sq F value Pr(>F)
|
2001 Sep 13
1
significant figures in summary()
Howdy,
Should summary() print answers with different numbers of
significant figures, as in the following?
> dim(iv)
[1] 3000 2
> summary(iv)
X1 X2
Min. :-4.15261 Min. :-3.271520
1st Qu.:-0.66188 1st Qu.:-0.671205
Median : 0.01233 Median :-0.008175
Mean : 0.01121 Mean : 0.016815
3rd Qu.: 0.67778 3rd Qu.:
2009 May 12
4
different results on linux and windows
Dear R experts,
we are preparing an R-package to compute the Oja Median which contains
some C++ code in which random numbers are needed. To generate the random
numbers we use the following Mersenne-Twister implementation:
// MersenneTwister.h
// Mersenne Twister random number generator -- a C++ class MTRand
// Based on code by Makoto Matsumoto, Takuji Nishimura, and Shawn Cokus
// Richard J.
2009 Aug 10
1
summary(table)
Hi,
Why when I do a summary on a table it cuts off the other variables? It says
Other :58 or Other: 120.
how can I get the summary for all the variables under ServLoad.Task and
Server.Load and Avg. CPU and Max.CPU?
Thanks,
summary(probF)
Reboot.Id ServLoad.Task Server.Load
Avg.CPU Max.CPU Event.Log
Min. : 2.00 120067_122395: 5
2016 Feb 08
1
Apparent bug in summary.data.frame() with columns of Date class and NA's present
Hi all,
Based upon an exchange with G?ran Brostr?m on R-Help today:
https://stat.ethz.ch/pipermail/r-help/2016-February/435992.html
there appears to be a bug in summary.data.frame() in the case where a data frame contains Date class columns that contain NA's and other columns, if present, do not.
Example, modified from R-Help:
x <- c(18000000, 18810924, 19091227, 19027233, 19310526,
2006 Feb 17
1
Transforming results of the summary function
Hi all,
I have a question about transforming the data from summary function.
Let's say I have a data frame like this:
> x = data.frame(a = c(rep("lev1", 5), rep("lev2", 5)), b = c(rnorm(5)+2, rnorm(5)))
> x
a b
1 lev1 1.5964765
2 lev1 2.2945609
3 lev1 3.5285787
4 lev1 1.4439838
5 lev1 2.2948826
6 lev2 1.7063506
7 lev2 -0.4042742
8 lev2
1999 Oct 19
2
Summary bug?
Hi,
It seems that there's a bug in summary, in the max. output... but max() alone
works fine.
> hw04.dframe$area
...
[41] 1790 1380 1296 2745 798 2306 438649 1481 1559 2450
...
> summary(hw04.dframe)
area
Min. : 798
1st Qu.: 1349
Median : 1690
Mean : 6962
3rd Qu.: 2306
Max. :438600 ### should read 438649
or, to the point,
2016 Aug 19
2
summary.default rounding on numeric seems inconsistent with other R behaviors
It is the old story of defined behaviour and expected outcomes. Hard to
change now.
So I would suggest you do something like this in your ~/.Rprofile:
R> smry <- function(...) summary(..., digits=6)
R> smry(155555L)
Min. 1st Qu. Median Mean 3rd Qu. Max.
155555 155555 155555 155555 155555 155555
R>
Maybe call it Summary() instead.
Dirk
--
2013 Apr 05
1
mixed formatting of integer and numeric (e. g., by summary.default())
Hello, eveRybody,
I've been trying to find the origin for the following
formatting-"inconsistency":
E. g., look at the number of digits in summary.defaults()'s output when
NAs occur: in my example below the number of NA's is displayed as an
integer, the rest as numeric (floating point numbers):
> summary.default( c( 1:2, NA))
Min. 1st Qu. Median Mean 3rd Qu.
2004 Jul 20
1
Accuracy in summary (PR#7121)
Full_Name: Sanford Weisberg
Version: 1.9.1
OS: Win XP
Submission from: (NULL) (160.94.148.2)
> wm <- read.table(url("http://www.stat.umn.edu/~sandy/wmdata0.txt"),
header=TRUE)
> mean(wm$Spd1)
[1] 7.7773
> summary(wm$Spd1)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.222 4.780 7.550 7.780 10.200 21.600
The mean of this variable DOES NOT ROUND to the value
2006 Oct 12
1
Should NA's in summary() output always be reported???
Consider
> summary(1:5)
Min. 1st Qu. Median Mean 3rd Qu. Max.
1 2 3 3 4 5
> summary(c(1:5,NA))
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
1 2 3 3 4 5 1
Wouldn't it be more stringent if "NA's" was also reported in the first case??
Regards
S?ren
2003 May 29
2
R summary
Dear all
i use R only a few days and don't understand the difference between
fivenum(x) und summary(x).
> x
[1] 20.77 22.56 22.71 22.99 26.39 27.08 27.32 27.33 27.57 27.81 28.69 29.36
[13] 30.25 31.89 32.88 33.23 33.28 33.40 33.52 33.83 33.95 34.82
> fivenum(x)
[1] 20.770 27.080 29.025 33.280 34.820
> summary(x)
Min. 1st Qu. Median Mean 3rd Qu. Max.
20.77 27.14