thr3ads.net - similar to: "weird behaviour of summary.default"

Maximum number of patterns and speed in grep

2012 Jul 06

2

Maximum number of patterns and speed in grep

Hi, I am using R's grep function to find patterns in vectors of strings. The number of patterns I would like to match is 7,700 (of different sizes). I noticed that I get an error message when I do the following: data <- array() for (j in 1:length(x)) { array[j] <- length(grep(paste(patterns[1:7700], collapse = "|"), x[j], value = T)) } When I break this up into 4 chunks of

library doubt

2000 Nov 14

2

library doubt

Hi all. I'm trying to install the packages Rstreams and RPgSQL, but with no success. I did R INSTALL name package but if I try library(Rstreams) I get the following error: Error in library.dynam("Rstreams",pkg,lib): dynamic library 'Rstreams' not found Error in library(Rstreams): .First.lib failed Where did I go wrong? I'm using R 1-1-1 on a Debian GNU/Linux box.

summary.default rounding on numeric seems inconsistent with other R behaviors

2016 Aug 19

3

summary.default rounding on numeric seems inconsistent with other R behaviors

I was wondering if it would make sense to change the default behavior of the following: summary(15555L) ## Min. 1st Qu. Median Mean 3rd Qu. Max. ## 15560 15560 15560 15560 15560 15560 summary.default on numeric values rounds values (not just presentation) to getOption("digits")-3L (or four) digits by default, making those values surprising and less suitable for

digits in summary.default

2006 Sep 14

1

digits in summary.default

Dear all, the number of significant digits in summary default is digits = max(3, getOption("digits") - 3) on my platform this results to be 4. The point is that if you have, say, integer data of magnitude greater than 10^3 the command summary will produce heavily rounded results. A simple example follow: > x <- c(123456,234567,345678) > x [1] 123456 234567 345678 >

error with summary(vector)??

2009 Aug 30

2

error with summary(vector)??

Hello, I get > summary(E) level nodes ave_nodes time Min. : 1 Min. : 1.00 Min. : 10.71 Min. : 0.0000 1st Qu.: 237414 1st Qu.: 2.00 1st Qu.: 19.70 1st Qu.: 0.0100 Median : 749229 Median : 3.00 Median : 27.01 Median : 0.0100 Mean : 767902 Mean : 49.85 Mean : 98.89 Mean : 0.2296 3rd

summary of summaries

2008 Mar 17

1

summary of summaries

Hi, I have a few hundreds files with numerical information of different length but with the same column structure. I use the following code to get summary statistics fplist <- list.files(pattern=".*analysis") for (fp in fplist){ x2 <- read.delim(fp) summary(x2) } Summary gives something like: summary (x2) V1 V2

Summary

2009 Sep 29

1

Summary

My data is called xc and has more than 15 variables. When I used summary(xc) it gave me the detail description of each variable. Summary(xc) Y1 x1 x2 x3 .. Min. :0.0000 Min. : 1.000 Min. : 1.000 Min. : 1.000 1st Qu. :0.0000 1st Qu.: 1.000 1st Qu.: 1.000 1st Qu.: 2.000 Median :1.0000 Median : 1.000

could i change the ouput style on summary?

2006 Jul 03

2

could i change the ouput style on summary?

Dear friends, summary() doesn't give a good ouput style,e.g.: grass soiltem airtem gheight humidity altitude diluo :38 Min. :15.90 Min. :17.70 Min. : 8.00 Min. : 0.2360 high: 43 huanghuacai:32 1st Qu.:19.32 1st Qu.:22.60 1st Qu.:40.00 1st Qu.: 0.3190 low :119 hucao :46 Median :20.20 Median :25.30 Median :60.00

format of summary.lm for 2-way ANOVA

2007 Mar 03

2

format of summary.lm for 2-way ANOVA

Hi, I am performing a two-way ANOVA (2 factors with 4 and 5 levels, respectively). If I'm interpreting the output of summary correctly, then the interaction between both factors is significant: ,---- | ## Two-way ANOVA with possible interaction: | > model1 <- aov(log(y) ~ xForce*xVel, data=mydataset) | | > summary(model1) | Df Sum Sq Mean Sq F value Pr(>F) |

significant figures in summary()

2001 Sep 13

1

significant figures in summary()

Howdy, Should summary() print answers with different numbers of significant figures, as in the following? > dim(iv) [1] 3000 2 > summary(iv) X1 X2 Min. :-4.15261 Min. :-3.271520 1st Qu.:-0.66188 1st Qu.:-0.671205 Median : 0.01233 Median :-0.008175 Mean : 0.01121 Mean : 0.016815 3rd Qu.: 0.67778 3rd Qu.:

different results on linux and windows

2009 May 12

4

different results on linux and windows

Dear R experts, we are preparing an R-package to compute the Oja Median which contains some C++ code in which random numbers are needed. To generate the random numbers we use the following Mersenne-Twister implementation: // MersenneTwister.h // Mersenne Twister random number generator -- a C++ class MTRand // Based on code by Makoto Matsumoto, Takuji Nishimura, and Shawn Cokus // Richard J.

summary(table)

2009 Aug 10

1

summary(table)

Hi, Why when I do a summary on a table it cuts off the other variables? It says Other :58 or Other: 120. how can I get the summary for all the variables under ServLoad.Task and Server.Load and Avg. CPU and Max.CPU? Thanks, summary(probF) Reboot.Id ServLoad.Task Server.Load Avg.CPU Max.CPU Event.Log Min. : 2.00 120067_122395: 5

Apparent bug in summary.data.frame() with columns of Date class and NA's present

2016 Feb 08

1

Apparent bug in summary.data.frame() with columns of Date class and NA's present

Hi all, Based upon an exchange with G?ran Brostr?m on R-Help today: https://stat.ethz.ch/pipermail/r-help/2016-February/435992.html there appears to be a bug in summary.data.frame() in the case where a data frame contains Date class columns that contain NA's and other columns, if present, do not. Example, modified from R-Help: x <- c(18000000, 18810924, 19091227, 19027233, 19310526,

Transforming results of the summary function

2006 Feb 17

1

Transforming results of the summary function

Hi all, I have a question about transforming the data from summary function. Let's say I have a data frame like this: > x = data.frame(a = c(rep("lev1", 5), rep("lev2", 5)), b = c(rnorm(5)+2, rnorm(5))) > x a b 1 lev1 1.5964765 2 lev1 2.2945609 3 lev1 3.5285787 4 lev1 1.4439838 5 lev1 2.2948826 6 lev2 1.7063506 7 lev2 -0.4042742 8 lev2

Summary bug?

1999 Oct 19

2

Summary bug?

Hi, It seems that there's a bug in summary, in the max. output... but max() alone works fine. > hw04.dframe$area ... [41] 1790 1380 1296 2745 798 2306 438649 1481 1559 2450 ... > summary(hw04.dframe) area Min. : 798 1st Qu.: 1349 Median : 1690 Mean : 6962 3rd Qu.: 2306 Max. :438600 ### should read 438649 or, to the point,

summary.default rounding on numeric seems inconsistent with other R behaviors

2016 Aug 19

2

summary.default rounding on numeric seems inconsistent with other R behaviors

It is the old story of defined behaviour and expected outcomes. Hard to change now. So I would suggest you do something like this in your ~/.Rprofile: R> smry <- function(...) summary(..., digits=6) R> smry(155555L) Min. 1st Qu. Median Mean 3rd Qu. Max. 155555 155555 155555 155555 155555 155555 R> Maybe call it Summary() instead. Dirk --

mixed formatting of integer and numeric (e. g., by summary.default())

2013 Apr 05

1

mixed formatting of integer and numeric (e. g., by summary.default())

Hello, eveRybody, I've been trying to find the origin for the following formatting-"inconsistency": E. g., look at the number of digits in summary.defaults()'s output when NAs occur: in my example below the number of NA's is displayed as an integer, the rest as numeric (floating point numbers): > summary.default( c( 1:2, NA)) Min. 1st Qu. Median Mean 3rd Qu.

Accuracy in summary (PR#7121)

2004 Jul 20

1

Accuracy in summary (PR#7121)

Full_Name: Sanford Weisberg Version: 1.9.1 OS: Win XP Submission from: (NULL) (160.94.148.2) > wm <- read.table(url("http://www.stat.umn.edu/~sandy/wmdata0.txt"), header=TRUE) > mean(wm$Spd1) [1] 7.7773 > summary(wm$Spd1) Min. 1st Qu. Median Mean 3rd Qu. Max. 0.222 4.780 7.550 7.780 10.200 21.600 The mean of this variable DOES NOT ROUND to the value

Should NA's in summary() output always be reported???

2006 Oct 12

1

Should NA's in summary() output always be reported???

Consider > summary(1:5) Min. 1st Qu. Median Mean 3rd Qu. Max. 1 2 3 3 4 5 > summary(c(1:5,NA)) Min. 1st Qu. Median Mean 3rd Qu. Max. NA's 1 2 3 3 4 5 1 Wouldn't it be more stringent if "NA's" was also reported in the first case?? Regards S?ren

R summary

2003 May 29

2

R summary

Dear all i use R only a few days and don't understand the difference between fivenum(x) und summary(x). > x [1] 20.77 22.56 22.71 22.99 26.39 27.08 27.32 27.33 27.57 27.81 28.69 29.36 [13] 30.25 31.89 32.88 33.23 33.28 33.40 33.52 33.83 33.95 34.82 > fivenum(x) [1] 20.770 27.080 29.025 33.280 34.820 > summary(x) Min. 1st Qu. Median Mean 3rd Qu. Max. 20.77 27.14

similar to: weird behaviour of summary.default