I've rolled up R-2.3.0.tar.gz a short while ago. This version contains several changes and additions, mostly incremental. See the full list of changes below. You can get it (in a short while) from http://cran.r-project.org/src/base/R-2/R-2.3.0.tar.gz or wait for it to be mirrored at a CRAN site nearer to you. Binaries for various platforms will appear in due course. There is also a version split for floppies. For the R Core Team Peter Dalgaard These are the md5sums for the freshly created files, in case you wish to check that they are uncorrupted: eb723b61539feef013de476e68b5c50a COPYING a6f89e2100d9b6cdffcea4f398e37343 COPYING.LIB 152bf40b34f471387c623c724e112a58 FAQ 70447ae7f2c35233d3065b004aa4f331 INSTALL fcb3488d9d8e95e439f4bde1b730a615 NEWS 88bbd6781faedc788a1cbd434194480c ONEWS 4f004de59e24a52d0f500063b4603bcb OONEWS 11cc1e9df640ab52e608cf9e695f7354 R-2.3.0.tar.gz 2fb2766d3a35b1c4b525d61dec39f502 R-2.3.0.tar.gz-split.aa 51ac3cd512cbc0f265ca1c8318732c30 R-2.3.0.tar.gz-split.ab 0d5c03adcdc336e2881c1e5a080c8542 R-2.3.0.tar.gz-split.ac d7a9431dff3a3a7fefd60ce0ac4b39aa R-2.3.0.tar.gz-split.ad 096386cbc903ea5c5af2a91415b3535b R-2.3.0.tar.gz-split.ae 7e05f409a33e08df384aa8ae8ec80f90 R-2.3.0.tar.gz-split.af 6b79a851552a70a491454be0cfdfa685 R-2.3.0.tar.gz-split.ag 474a171062b1ea432bfdcb68afd696b7 R-2.3.0.tar.gz-split.ah 08173075ecea19a8cc75a062bf3fa2ac R-2.3.0.tar.gz-split.ai c9cdbbed7dce6b1d5a2af4dc4c495fc1 R-2.3.0.tar.gz-split.aj 11cc1e9df640ab52e608cf9e695f7354 R-latest.tar.gz 433182754c05c2cf7a04ad0da474a1d0 README 020479f381d5f9038dcb18708997f5da RESOURCES Here is the relevant bit of the NEWS file: CHANGES IN R VERSION 2.3.0 USER-VISIBLE CHANGES o In the grid package there are new 'arrow' arguments to grid.line.to(), grid.lines(), and grid.segments() (grid.arrows() has been deprecated). The new 'arrow' arguments have been added BEFORE the 'name', 'gp' and 'vp' arguments so existing code that specifies any of these arguments *by position* (not by name) will fail. o all.equal() is more stringent, see the PR#8191 bug fix below. o The data frame argument to transform() is no longer called 'x', but '_data'. Since this is an invalid name, it is less likely to clash with names given to transformed variables. (People were getting into trouble with transform(data, x=y+z).) NEW FEATURES o arima.sim() has a new argument 'start.innov' for compatibility with S-PLUS. (If not supplied, the output is unchanged from previous versions in R.) o arrows() has been changed to be more similar to segments(): for example col=NA omits the arrow rather than as previously (undocumented) using par("col"). o as.list() now accepts symbols (as given by as.symbol() aka as.name()). o atan2() now allows one complex and one numeric argument. o The 'masked' warnings given by attach() and library() now only warn for functions masking functions or non-functions masking non-functions. o New function Axis(), a generic version of axis(), with Date and POSIX[cl]t methods. This is used by most of the standard plotting functions (boxplot, contour, coplot, filled.contour, pairs, plot.default, rug, stripchart) which will thus label x or y axes appropriately. o pbeta() now uses TOMS708 in all cases and so is more accurate in some (e.g. when lower.tail = FALSE and when one of the shape parameters is very small). o [qr]beta(), [qr]f() and [qr]t() now have a non-centrality parameter. o [rc]bind and some more cases of subassignment are implemented for raw matrices. (PR8529 and 8530) o The number of lines of deparsed calls printed by browser() and traceback() can be limited by the option "deparse.max.lines". (Wish of PR#8638.) o New canCoerce() utility function in "methods" package. o [pq]chisq() are considerably more accurate for moderate (up to 80) values of ncp, and lower.tail = FALSE is fully supported in that region. (They are somewhat slower than before.) o chol(pivot = TRUE) now gives a warning if used on a (numerically) non-positive-definite matrix. o chooseCRANmirror() consults the CRAN master (if accessible) to find an up-to-date list of mirrors. o cov.wt() is more efficient for 'cor = TRUE' and has a new 'method' argument which allows 'Maximum Likelihood'. o do.call() gains an 'envir' argument. o eigen() applied to an asymmetric real matrix now uses a tolerance to decide if the result is complex (rather than expecting the imaginary parts of the eigenvalues to be exactly zero). o New function embedFonts() for embedding fonts in PDF or PostScript graphics files. o fisher.test() now uses p-values computed via hypergeometric distributions for all 2 by 2 tables. This might be slightly slower for a few cases, but works much better for tables with some large counts. There is a new option to simulate the p-value for larger than 2 x 2 tables. o for() now supports raw vectors as the set of indices. o getNativeSymbolInfo() is vectorized for the 'name' argument. It returns a named list of NativeSymbolInfo objects, but is backward compatible by default when called with a character vector of length 1, returning the NativeSymbolInfo object. o help.search() no longer attempts to handle packages installed prior to R 2.0.0, and reports the current path to the package (rather than where it was originally installed: this information is not shown by the print() method). o Added "hexmode" to parallel "octmode". o install.packages() now does tilde expansion on file paths supplied as 'pkgs'. o install.packages() has additional arguments 'configure.args' and 'clean' which allow the caller to provide additional arguments to the underlying R CMD INSTALL shell command when installing source packages on a Unix-alike. o is.loaded() has a new argument 'type' to confine the search to symbols for .C, .Fortran, .Call or .External: by default it looks for a symbol which will match any of them. It is now internal and not primitive, so argument matching works in the usual way. o The symmetry test for matrices used in eigen() has been ``exported'' as the 'matrix' method of a new S3-generic 'isSymmetric(). o .leap.seconds and the internal adjustment code now know about the 23rd leap second on 2005-12-31: the internal code uses a run-time test to see if the OS does. o The 'col' argument of legend() now defaults to par("col") (which defaults to "black", the previous default), so that the lines/symbols are shown in the legend in the colour that is used on the plot. o log2() and log10() call C functions of the same name if available, and will then be more likely to be precise to machine accuracy. o new.packages() gains a ... argument to pass e.g. 'destdir' to install.packages(). (Wish of PR#8239.) o nls() now supports 'weights'. o The vector passed as the first argument of the 'fn' and 'gr' arguments of optim() has the names (if any) given to argument 'par'. o options(expressions) is temporarily increased by 500 during error-handling. This enables e.g. traceback() to work when the error is reaching the limit on the nesting of expressions. o page() accepts general R objects, not just names (and previously undocumented) character strings. This allows the object to be specified as a call, for example. More options are allowed in its '...' argument. o pairs() allows a wider class of inputs, including data frames with date and date-time columns. o par() and the in-line use of graphical parameters produce more informative error messages, distinguishing between non-existent pars and inappropriate use of valid pars. Graphical parameters 'family', 'lend', 'ljoin' and 'lmitre' can now be set in-line. There is no longer a warning if non-settable pars are used in-line, but there is an appropriate warning if unknown pars are passed. The length limit for the 'family' parameter has been increased to 200 bytes, to allow for the names of some CID-keyed fonts in multi-byte locales. o The pdf() device now allows 'family' to be specified in the same generality as postscript(). o The pdf() device writes /FontDescriptor entries for all fonts except the base 14, and does not write font entries for unused fonts. o Plotmath allows 'vartheta', 'varphi' and 'varsigma' (or 'stigma') as synonyms for 'theta1', 'phi1' and 'sigma1', and the help page has a note for TeX users. o plot.xy() now takes its default arguments from the corresponding par() settings, so points(type="l") and lines(type="p") behave in the same way (and more obviously, also for type="b"). o poly() has a new argument 'raw', mainly for pedagogical purposes. o The class "POSIXlt" now supports fractional seconds (as "POSIXct" has always done). The printing of fractional seconds is controlled by the new option "digits.secs", and by default is off. o postscript() supports family = "ComputerModernItalic" for Computer Modern with italic (rather than slanted) faces. o The postscript()/pdf() font metrics for the 14 standard fonts (only, not the rest of the common 35) have been updated to versions from late 1999 which cover more glyphs. There are also a few differences in the metrics and hence the output might be slightly different in some cases. o The way families can be specified for postscript() and pdf() has been expanded to include CID-keyed fonts, with new functions Type1Font() and CIDFont() to set up such fonts families. o prettyNum() has new arguments 'preserve.width' and 'zero.print'. When the former is not "none", as in calls from format() and formatC(), the resulting strings are kept at the desired width when possible even after adding of 'big.mark' or 'small.mark'. o proc.time() and system.time() now record times to 1ms accuracy where available (most Unix-like systems). o The initialization methods for the quasi() family have been changed to depend on the variance function, and in particular to work better for the "mu(1-mu)" variance function. (PR#8486) o read.table() gains a 'flush' argument passed to scan(). o require() now takes a 'lib.loc' argument. o The second argument 'size' to sample() is required to have length 1, so that errors when supplying arguments are more easily detected. o The default is now compress = !ascii in save() (but not save.image). o scan() and write.table() now have some interruptibility, which may be useful when processing very large files. o A new heuristic test, seemsS4Object() is supplied, along with a similar C-level test, R_seemsS4Object(object). The test detects probable S4 objects by their class's attribute. See the help page. o S3 classes can now be made non-virtual S4 classes by supplying a prototype object in the arguments to setOldClass(). o splinefun() returns a function that now also has a 'deriv' argument and can provide up to the 3rd derivative of the interpolating spline, thanks to Berwin Turlach. o stopifnot(A) now gives a better error message when A has NAs, and uses "not all TRUE" when A has length >= 2. o str()'s default method has a new argument 'strict.width' which can be used to produce strict 'width' conforming output. A new options(str = list(strict.width = *)) setting allows to control this for a whole session. o summary.nls() has a new argument 'correlation' that defaults to FALSE (like summary.lm). o Sys.sleep() has sub-millisecond resolution on Unix-alikes with gettimeofday(). o Sys.time() now has sub-millisecond accuracy on systems supporting the POSIX call gettimeofday, and clock-tick accuracy on Windows. o The new function timestamp() adds a time stamp to the saved command history on consoles which support it. o New function tcrossprod() for efficiently computing x %*% t(x) and x %*% t(y). o The suffix used by tempfile() is now in hex on all platforms and guaranteed to be at least 6 hex digits (usually 8). o trace() now works more consistently and more like its documentation, in particular the assertions about old tracing being removed for new. For debugging purposes (of R) a mechanism for debugging the trace computations themselves was added. See trace.R. o The implementation of trace() has beem made more general by calling a function to do the trace interaction, and recover() now detects trace calls to trim the irrelevant code underneath. o unserialize() can now also read a byte stream from a raw vector. o The useDynLib() directive in the NAMESPACE file now accepts the names of the native/foreign symbols that are to be resolved in the DLL for use in .C/.Call/.Fortran/.External calls. These can be used as regular R variables instead of the (routine name, PACKAGE) pairs currently recommended. Alternative names can be given for the R variables mapping to these symbols. The native routine registration information can also be used directly via useDynLib(name, .registration = TRUE). See the 'Writing R Extensions' manual for more details. checkFF() (package 'tools') has been updated accordingly. o validObject() has an option complete=TRUE that recursively checks the objects in the slots. Not used when new(...) checks validity. o New Vectorize() function, a wrapper for mapply(). o write.ftable() has gained an argument 'append = FALSE' (thanks to Stephen Weigand). o On Unix-alikes, X11() now has arguments to request the initial position of the window, and 'gamma' defaults to the value of getOptions("gamma"). These changes are consistent with the windows() device. o X11() and the Unix-alike data entry window can have properties (including geometry) set by X resources: see their help files. o xy.coords() & xyz.coords() now have NULL defaults for their 'y' or 'y' and 'z' arguments. This is more consistent with their earlier documentation, and may be convenient for using them. o Non-syntactic names of list elements are now printed quoted by backticks rather than double quotes. o There is some basic checking for imminent C stack overflow (when the evaluation depth and the user interrupts are checked). On systems with suitable OS support (not Windows), segfaults from C stack overflow are caught and treated as an R error. New function Cstack_info() reports on stack size and usage. options(expressions) reverts to the default of 5000 now stack checking is in place. o Package tcltk does not try to initialize Tk on Unix-alikes unless a DISPLAY variable is present. This allows packages dependent on tcltk to be installed without access to an X server. o The code used to guess timezone offsets where not supplied by the OS uses a different algorithm that is more likely to guess the summer-time transitions correctly. o Package tools contains translation tables 'Adobe_glyphs' and 'charset_to_Unicode'. o Changed the environment tree to be rooted in an empty environment, available as emptyenv(). baseenv() has been modified to return an environment with emptyenv() as parent, rather than NULL. o gettext has been updated to 0.14.5. o PCRE has been updated to version 6.4. o The method $.DLLInfo resolves the specified symbol in the DLL, returning a NativeSymbolInfo object. Use [[ to access the actual values in the DLLInfo object. o On systems with either vasprintf or both va_copy and a vsnprintf which reports the size of buffer required, connections such as gzfile() and bzfile() can now write arbitrarily long lines, not just 100000 chars. o The R session temporary directory is now set in C code using the same algorithm whether or not the shell front-end is used and on all platforms. This looks at environment variables TMPDIR, TMP and TEMP in turn, and checks if they point to a writable directory. o Some of the classical tests put unnecessary restrictions on the LHS in the formula interface (e.g., t.test(x+y ~ g) was not allowed). o On suitably equipped Unix-alike systems, segfaults, illegal operations and bus errors are caught and there is a simple error-handler which gives the user some choice as to what to do in interactive use. [Experimental.] On Windows access violations and illegal instructions are caught with a simple error handler. [Experimental.] o Tracebacks now include calls such as .C/.Fortran/.Call, which will help if errors occur in R code evaluated by compiled code and in tracebacks presented by the segfault etc handlers. o Treatment of signature objects and method definition objects has been modified to give cleaner printing and more consistency in the treatment of signatures. A sometimes useful utility, methodSignatureMatrix(), is now exported. o R refrains from printing a final EOL upon exiting the main loop if the quiet flag is on and if the save action is known (e.g. this is true for --slave). DEPRECATED & DEFUNCT o The deprecated and undocumented use of atan() with two arguments has been removed: instead use atan2(). o write.table0() is defunct in favour of write.table(). o format.char() is defunct in favour of format.default(). o Support for the long-deprecated (and no longer documented) arguments --min-vsize --min-nsize --max-vsize --max-nsize --vsize --nsize of R CMD BATCH has been removed. o The 'debian' subdirectory has been removed from the sources. o The 'vfont' argument of axis() and mtext() has been removed: use par(family=) instead. o The unused graphical parameter "type" has been removed: it invited confusion with the 'type' argument to default methods of plot(), points() and lines(). o nlsMethod() and profiler() are no longer exported from the stats namespace (and nlsMethod.plinear() is no longer registered as a method, as nlsMethod() was not generic). o The re-named tcltk functions tkcmd, tkfile.tail, tkfile.dir, tkopen, tkclose, tkputs, tkread are now formally deprecated. o Argument 'col' of bxp() is now formally deprecated. o Use of NULL as an environment is deprecated and gives a warning. o postscriptFont() is deprecated in favour of Type1Font() (which is just a change of name). o La.chol() and La.chol2inv() are deprecated (they have since R 1.7.0 been the same as the default options of chol() and chol2inv). o La.svd(method = "dgesvd") is deprecated. o The use of install.R and R_PROFILE.R files in packages is deprecated: use the DESCRIPTION file instead to arrange to save an image or to load dependent packages. The following command-line options to INSTALL are deprecated (use the fields in the DESCRIPTION file instead): -s --save --no-save --lazy --no-lazy --lazy-data --no-lazy-data o Graphical parameter 'tmag' (which is long unused) is deprecated. INTERNATIONALIZATION A set of patches supplied by Ei-ji Nakama has been incorporated. o New postscript encodings for CP1253, CP1257 and Greek (ISO 8859-7). o Support for East Asian CID-keyed fonts in pdf() and postscript(). Although these usually contain Latin characters no accurate AFMs are available and so CID-keyed fonts are intended only for use with CJK characters. o Wide-character width functions wc[s]width are provided that overcome problems found with OS-supplied ones (and those previously used by R on Windows). This means that double-width CJK characters are now supported on all platforms. It seems that the width of some characters (and not just CJK characters) depends on which CJK locale's fonts are in use and also on the OS. Revised wide-character classification functions are provided for use on Windows, AIX and MacOS X to replace deficient OS-supplied ones. o There is support for MBCS charsets in the pictex() graphics device, and rotated (by 90 degrees) text may work better. o The \u (and \U except on Windows) notation for characters which is supported by the parser in all MBCS charsets is now always interpreted as a Unicode point, even on platforms which do not encode wchar_t in Unicode. These are now a syntax error in single-byte locales. o The default encoding for postscript() and pdf() is chosen to be suitable for the current locale, if that is a single-byte locale which is supported. This covers European (including Greek) and Cyrillic languages. In UTF-8 locales, a suitable single-byte encoding is chosen for postscript() and pdf(), and text translated to it. o xfig() gains an 'encoding' argument. o There are some message translations into Spanish. INSTALLATION CHANGES o The encoding files for pdf()/postscript() have been moved to directory 'enc' in package 'grDevices'. o Support for MBCS is only enabled if iconv is found and it supports enough conversions. (libiconv does.) o In an MBCS locale, make check now translates the graphics examples from Latin-1. This ensures that they will work correctly in UTF-8: it is possible that in other MBCS locales they will now fail (rather than work completely incorrectly). o There is a new test, 'test-Docs', which as part of 'make check-devel' tests the code in the documentation. Currently it runs doc/manual/R-{exts,intro}.R and the compiled code in R-exts.c. o The workaround to allow an external LAPACK-containing BLAS such as libsunperf to be used with the internal LAPACK has been removed. If you have such a library you may now need to use --with-lapack. It is no longer possible to use some older versions of libsunperf, e.g. Forte 7 on 64-bit builds. o A substitute for mkdtemp is provided, so it is now always used for R_TempDir. o Most of the functions checked for by 'configure' also have declarations checked for in the appropriate header. o The top-level documentation files AUTHORS COPYING.LIB COPYRIGHTS FAQ RESOURCES THANKS have been moved to doc, and COPYING and NEWS are installed there. The file Y2K has been removed from the distribution. o The extension .lo is no longer used in building R (only in the optional build of libRmath.so): this allows a considerable simplification of the Makefiles. o Direct support for f2c has been removed: it can still be used via a script which makes it look like a Fortran compiler. (src/scripts/f77_f2c is an example of such a script.) o There is a new flag SAFE_FFLAGS which is used for the compilation of dlamc.f. It is set by configure for known problem cases (recent g77 and gfortran), but can be overridden by the user. o The standard autoconf macros for large-file support are now used, and these are enabled unless --disable-largefile is specified. This replaces --enable-linux-lfs (and is now selected by default). o Visibility attributes are used where supported (gcc4/gfortran on some platforms, also gcc3/g77 on FC3 and partially elsewhere). The main benefit should be faster loading (and perhaps better optimized code) in some of the dynamic shared objects (e.g. libR.so and stats.so). o The *PICFLAGS are taken to be -fpic rather than -fPIC where possible. This will make no difference on most platforms: -fPIC is needed on Sparc (and still used there), but -fpic should give slightly better performance on PowerPC (although -fPIC is used on PPC64 as it is needed to build libR.so there). o More use is made of inlining for small utility functions such as isReal. Because this can only be done portably with C99 constructs (and we know of no actual implementation), this is only done for the GNU C compiler. o There is an experimental feature to allow shared installations of sub-architectures. See the R-admin manual. o All platforms now use R's internal implementation of strptime, which allows fractional seconds. (The major platforms were already using it.) o The dlcompat work-around for old Mac OS X systems (<= 10.2) has been removed. External dlcompat must be installed if needed. UTILITIES o R CMD check now uses an install log by default. o R CMD check works for packages whose package name is different from the directory name in which it is located. o R CMD INSTALL now uses more randomness in the temporary directory name even on systems without mktemp -d. o R CMD f77 has been removed now f2c is no longer supported. o The version string shown in the startup message and by "R --version", and that stored in variable R.version.string are now in exactly the same format. o The base name of a help file needs to be valid as part of a file:// URL, so R CMD check now checks the names are ASCII and do not contain % . o R CMD check now warns about unknown sections in Rd files, and invalid names for help, demo and R files, as well as unlikely file names in the 'src' directory. The latter is controlled by option --check-subdirs and by default is done if checking a tarball without a configure script. R CMD build excludes invalid files in the 'man', 'R' and 'demo' subdirectories. o \usepackage[noae]{Sweave} in the header of an Sweave file supresses auto-usage of the ae package ("almost European" fonts) and T1 input encoding. DOCUMENTATION o Rd format now allows \var{} markup inside \code{} and \examples{}. o Markup such as --, ---, < and > is handled better when converting .Rd files to [C]HTML. o There is new markup \link[=dest]{name} to generate a link to topic 'dest' which is shown as 'name', and \linkS4class{abc} which expands to \link[=abc-class]{abc}, for cross-referencing the recommended form of documentation for S4 classes. PACKAGE INSTALLATION o There is now some support for Fortran 90/95 code in packages: see `Writing R Extensions'. o Installation of man sources and demos is now done by R code. The restrictions on the names of help files, R files and of demos are now enforced (see `Writing R Extensions'). o Packages which contain compiled code can now have more than one dot in their name even on Windows. o The Meta/hsearch.rds database saved now contains LibPath="". This information is now always recreated when help.search() is run, but the field is retained for back-compatibility. o update.packages() now has a '...' argument to be passed to install.packages(), including the formerly separate arguments 'destdir' and 'installWithVers'. o Make macros AR and RANLIB are now declared in etc/Makeconf for use by packages which wish to make static libraries. C-LEVEL FACILITIES o qgamma and rgamma in Rmath.h now check for non-positive arguments. o The BLAS which ships with R now contains the complete set of double-complex BLAS routines, rather than just those used in R. <R_ext/BLAS.h> has been corrected to add the missing double-precision BLAS functions drotmg and drotm, and to exclude lsame (which is a Lapack auxiliary function and is now declared in <R_ext/Lapack.h>). It also includes the double complex routines added for this release of R provided Fortran doublecomplex is usable on the platform. o <R_ext/BLAS.h> and <R_ext/Lapack.h> now declare all the entry points as 'extern'. o The flag SAFE_FFLAGS is made available to packages via etc/Makeconf and R CMD config. It can be used where optimization needs to be defeated, e.g. in LAPACK setup. o getNativeSymbolInfo has a withRegistrationInfo argument which causes the address field to be a reference to the registration information if it is available for that symbol. If the registration information is not available, the address is a reference to the native symbol. The default is FALSE which is backward compatible, returning just the address of the symbol and ignoring registration information. o errorcall and warningcall are now declared in <Rinternals.h> (they might be needed in front-ends). o R_FlushConsole and R_ProcessEvents are now declared in <R.h>. o The R_Sock* functions supporting socket connections are no longer declared in R-ftp-http.h as they are not loaded into R itself, and are now hidden in the module's DLL on suitable systems. BUG FIXES o Quoted arguments to the R script after --args are now passed quoted to the R executable and so will be shown as expected by commandArgs(). (They were previously split at whitespace even inside quotes on Unix-alikes but not on Windows.) o axis() now supports pars 'xaxp'/'yaxp' as inline arguments. o sort() now does not return inappropriate attributes such as "dim" and "tsp": it only returns names. sort(x, partial=) no longer returns unsorted names, and drops names (since it is supplied for efficiency). o Use of non-central F in pf() gives accurate values for larger ncp. o R CMD build --binary does a better job of cleaning up after failure to re-make vignettes. o reg-test-1.R tested system(intern=TRUE) which depends on popen and so is not supported on all platforms. o Changed apparent mis-spelling of "Gibraltar" in dataset 'eurodist'. o sysconf() is now used to find the number of clock ticks/second: under some circumstances glibc reported CLK_TCK = 60 when the true value was 100. o identical() was not allowing for embedded nuls in character strings. (NB: the comparison operators including == do not, and never will.) o The profile() and profiler() methods for "nls" objects now support algorithm = "plinear" and algorithm = "port". o The signal handlers for signals USR1 and USR2 where not restored if the signal arrived when interrupts were suspended. o Certain combinations of S4 inheritance could cause inherited methods to override some directly specified methods. o Some cases of named signatures in calls to setMethod() caused errors. o all.equal() is now more consistent and "picky" about mismatching attributes, in particular names(); this is a part of the propositions by Andy Piskorkski (PR#8191). o load() when applied to a connection leaves it open/not as it found it, and checks explicitly for having a binary readable connection. o The p-values given by stat.anova() (called from several anova() methods) are now NA (rather than spurious) if non-nested models give rise to changes in deviance with a different sign from changes in degrees of freedom. o Built-ins were reported as the relevant call in C-level error()s iff R profiling was in progress. Now they are never reported. o Too-long signatures (with no names) were not being caught in setMethod(). o Slot names in prototype() are being more thoroughly checked. o signif() is more likely to follow the 'round to even' rule for exactly representable numbers, e.g. signif(0.25, 1). (Related to PR#8452.) o nls() now works correctly with some low-dimensional fits, e.g. with one or zero non-linear parameters. o glm() could give an inappropriate error message if all possible coefficients were invalid (e.g. a log-linear binomial model with no intercept and a not all positive predictor). o solve() gives clearer error messages for some incorrect usages. (PR#8494 and similar) o The gaussian() family was missing the 'valideta' component (which could be needed for the "inverse" link function). The starting values supplied by the gaussian family could be invalid for the "log" and "inverse" link functions. This is now reported. o data.matrix() did not work correctly on zero-row data frames. (PR#8496 and other problems.) o The DSC comments in the files from postscript(onefile=FALSE) now label all files as having page 1 of 1, as some other software seems to expect that. o The axis labels chosen for logarithmic axis are now less likely to be linear and inappropriate (when the range is more than 10 and less than 100). (PR#1235) o Staircase lines (types "s" and "S") are now drawn continuously rather than a point at a time and so line types, mitring and so on work. (PR#2630) o Calling par(mfg) before doing any plotting resulted in NewPage never being called on the device, which in turn resulted in incorrect output for postcript() and pdf() devices. (Reported by Marc Schwartz in discussion of the non-bug PR#7820.) o terms.formula needed to add parentheses to formulae with terms containing '|'. (PR#8462) o pbirthday() and qbirthday() now also work for very improbable events {those you are typically *not* interested in}. o Only source help files starting with an upper- or lower-case letter or digit and extension .Rd or .rd are documented to be processed. This is more liberal in that starting with a digit is now also allowed, but rule is now enforced. o nls(algorthm="port") was always taking positive numeric differences and so could exceed the upper bounds. o methods:::.asEnvironmentPackage() was not allowing for versioned installs. o .find.package() now reports which package(s) it cannot find in the case it stops with an error. o The standard Unix-alike version of file.show() gives an informative message if it cannot open a file rather than the (possibly incorrect) 'NO FILE'. o window() did not allow non-overlapping ranges with extend = TRUE. (PR#8545) o pbinom(size = 0) now returns correct values (not NaN). (PR#8560) o [dp]binom(x, *) for x < 0 now always returns 0. (PR#8700) analogous change in pgeom(), pnbinom() and ppois(). o [dqpr]geom and [dpqr]nbinom() now all consistently accept prob = 1 but not prob = 0. qgeom(prob=1) now gives the correct values (not -1). o INSTALL on Unix-alikes was not loading dependent packages when preparing for lazy-loading. o qcauchy(1) now gives +Inf instead of just a very large number. o df(0, f1, *) now properly returns Inf, 1, or 0 for f1 < , = , or > 2. o qbinom(), qnbinom() and qpois() now use a better search and normally reach the answer very quickly when it is large (instead of being slow or infinite-looping). o pt(x, df) lost accuracy in the far tails (when |x| > 1e154) for small df (like df = 0.001 for which such extremes are not unlikely). o dbeta(x, a, b) underflowed internally and incorrectly gave 0 for very small x and a. o None of the warnings about convergence failures or loss of precision in nmath (distribution and special functions) were being reported to the R user. o dnt was missing from standalone nmath (under Unix-alikes). o split() now accepts factors with numeric (but not storage mode integer) codes. o The utilities such as 'check' now report active version numbers again, as SVN 'last changed revision' numbers. o addmargins() did not accept a name for 'FUN', only an expression. o '+' for POSIXt objects now takes the tzone from whichever object has it, so date+x is the same as x+date if x is numeric. o mean.default() and var() compute means with an additional pass and so are often more accurate, e.g. the variance of a constant vector is (almost) always zero and the mean of such a vector will be equal to the constant value to machine precision. (PR#1228) sum(), prod(), mean(), rowSums() and friends use a long double accumulator where available and so may be more accurate. (This is particularly helpful on systems such as Sparc and AMD64 where long double gives considerably greater exponent range and precision than double.) o read.dcf() now gives a warning on malformed lines. o add1.[g]lm now try harder to use the environment of the formula in the orginal fit to look for objects such as the 'data' and 'subset' arguments. o gaussian()$aic was inconsistent with e.g. the lm results from AIC() and extractAIC() for weighted fits: it treated the weights as case weights and not variance factors. o system() on Unix-alikes ignored non-logical values of 'intern' and treated 'intern = NA' as true. o as.table() now produces non-NA rownames when converting a matrix of more than 26 rows. (PR#8652) o Partial sorting used an algorithm that was intended only for a few values of 'partial' and so could be far slower than a full sort. It now switches to a barebones full sort for more than 10 values of 'partial' and uses a more efficient recursive implementation for 2...10. o summary.glm() returned an estimate of dispersion of Inf for a gaussian glm with zero residual degrees of freedom and then treated that as a known value. It now uses the estimate NaN, which is consistent with summary.lm(). o Sys.sleep() on Unix-alikes was restricted to about 2147 seconds and otherwise might never have returned. (PR#8678) o is(obj, Cl) could wrongly report TRUE when Cl was a classUnion and multiple inheritance was involved. o confint[.lm / .default] used label "100 %" for level = 0.999 o Empty entries (i.e., extraneous ",") in NAMESPACE files now give a better error message early at parsing time instead of a less comprehensible one later at load time. o all.equal(n1, n2) could erroneously return NA when n1, n2 contained large integers. o anova.mlm() didn't handle multi-df effects properly in the single-model case (PR#8679) o anova.mlm() had its colnames mangled by data.frame() (needed check.names=FALSE). o summary.glm() gave an NA estimate of dispersion for fits with zero weights. (PR#8720) o qhyper() had too small a tolerance for right-continuity on some platforms so was not always an inverse to phyper(). o rownames<-.data.frame() and dimnames<-.data.frame() tested the length(s) of the replacement value(s) before coercion, which can change the length (e.g. for class "POSIXlt"). o max() and min() ignored the largest/smallest representable integer, as well as Inf/-Inf. (PR#8731) o write.table() assumed factors had integer codes: it now allows malformed factors with numeric codes (and otherwise throws an error). o Worked around a Solaris restriction which meant that Sys.sleep() was only effective for times of up to one second. o sink(, split=TRUE) now works correctly, but is allowed only on platforms that support va_copy or __va_copy. (PR#8716) o factanal(), prcomp() and princomp() now only check that columns in the model frame that will be used are numeric (they previously also checked columns which were part of negative terms in the formula). o Misuse of $ in apply could corrupt memory. (PR#8718) o apply() could fail if the function returned NULL (e.g. if there was a single row). o registerS3method() failed due to a typo. (It was almost never used.) o Registering an S3 method for an S3 generic in another package that was converted to an S4 generic in the same package as the S3 method, registered the method in the wrong place. o Recall() used lookup for the function in use and so could fail if that was an S3 method not on the search path. o Rdconv -t Ssgm failed if it encountered \link[opt]{arg}. o uniroot() did not give a warning (as documented) if it failed to converge in 'maxiter' steps. (PR#8751) o eapply (and as.list.environment) did not work for the base environment/namespace. (PR#8761) o Added protection in configure against systems for which using xmkmf fails to eport a C or C++ compiler. o expand.grid() was constructing a data frame 'by hand' and so setting integer row.names (which are documented to be character). It now sets character row names, and row.names.data.frame() coerces to character. o qbeta() used == on volatile doubles for its convergence test, which failed with gcc 3.3.x on ix86 Linux. We now use a less fragile test (and lose a negligible amount of accuracy). o ls.str() was missing inherits=FALSE, and so could have reported on an object of the same name but a different mode in the enclosure of the given environment. o logLik.nls assumed that sigma^2 had been estimated, but did not count this in the 'df' attribute. ************************************************** * * * 2.2 SERIES NEWS * * * ************************************************** CHANGES IN R VERSION 2.2.1 patched INSTALLATION CHANGES o The macro SOCKLEN_T has been replaced by R_SOCKLEN_T to work around a problem with the headers of AIX 5.3. BUG FIXES o sub(fixed = TRUE) could get wrong the length of the character string for elements of the result after the first. o legend() worked out which elements of 'lty' were valid before resizing 'lty', and so could fail if 'lty' was a different length from 'legend'. o str(<dendrogram>) sometimes used much too many spaces (in 2.2.x). o eigen(eispack=TRUE) accessed areas off the matrix in some circumstances (some asymmetric matrices with both complex conjugate pair and real eigenvalues). o strptime() in 2.2.1 sometimes did not set $isdst when it was previously set. o Another case of infinite influence has been worked around. (An addendum to PR#8367.) o qr.coef() worked incorrectly with multiple rhs in the LAPACK-using cases. (PR#8476/8) o rbind.data.frame() gave a corrupt data frame if one of the named arguments was a zero-row data frame. (PR#8506) o Checks for NULL in the rho argument of the C-level findVar function have been added. o The C-level substitute function was handling NULL in its 'rho' argument incorrectly. o The code for pgamma() introduced in 2.1.0 failed for large values of 'shape' where the previous code was perfectly acceptable, despite the claim to be uniformly better. For example, pgamma(0.9e100, 1e100) was NaN. (PR#8528) o There was no command 'ls' in browser() nor 'next' in debug(), despite the documentation (which has been corrected). Command 'where' in the browser() no longer changes to step-though mode. o factor.scope() could report incorrectly that interaction terms were not in the upper scope when such terms in the model and the upper scope had different orders for the main effects. (Another manifestation of PR#7842.) o The "lm" method of drop1() was giving incorrect results for weighted fits (since deviance.lm() was called on a non-"lm" object). o dotchart() was miscalculating the space for the labels in the left margin. (PR#8681) o r <- glm(.....); all.equal(r,r) # now gives TRUE instead of an error o plot.acf() with a multiple time series was sometimes miscalculating the 'ylim' value for the plot after the first. (PR#8705) -- O__ ---- Peter Dalgaard ?ster Farimagsgade 5, Entr.B c/ /'_ --- Dept. of Biostatistics PO Box 2099, 1014 Cph. K (*) \(*) -- University of Copenhagen Denmark Ph: (+45) 35327918 ~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk) FAX: (+45) 35327907 _______________________________________________ R-announce at stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-announce