Torsten.Hothorn at stat.uni-muenchen.de
2009-Jul-16 15:20 UTC
[Rd] model.matrix memory problem (PR#13838)
Hi,
`model.matrix' might kill R with a segfault (on a illposed problem, but
anyway):
mydf <- as.data.frame(sapply(1:40, function(i) gl(2, 100)))
f <- as.formula(paste("~ - 1 + ", paste(names(mydf), collapse =
":"), sep = ""))
X <- model.matrix(f, data = mydf)
*** caught segfault ***
address 0x18, cause 'memory not mapped'
Segmentation fault
Best wishes,
Torsten
--please do not edit the information below--
Version:
platform = i686-pc-linux-gnu
arch = i686
os = linux-gnu
system = i686, linux-gnu
status major = 2
minor = 9.0
year = 2009
month = 04
day = 17
svn rev = 48333
language = R
version.string = R version 2.9.0 (2009-04-17)
Locale:
LC_CTYPE=en_GB.UTF-8;LC_NUMERIC=C;LC_TIME=en_GB.UTF-8;LC_COLLATE=en_GB.UTF-8;LC_MONETARY=C;LC_MESSAGES=en_GB.UTF-8;LC_PAPER=en_GB.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_GB.UTF-8;LC_IDENTIFICATION=C
Search Path:
.GlobalEnv, package:stats, package:graphics, package:grDevices, package:utils,
package:datasets, package:methods, Autoloads, package:base
Hi, Excerpts from Torsten.Hothorn's message of Thu Jul 16 17:20:10 +0200 2009:> `model.matrix' might kill R with a segfault (on a illposed problem, but anyway): > > mydf <- as.data.frame(sapply(1:40, function(i) gl(2, 100))) > f <- as.formula(paste("~ - 1 + ", paste(names(mydf), collapse = ":"), sep = "")) > X <- model.matrix(f, data = mydf) > > *** caught segfault *** > address 0x18, cause 'memory not mapped' > Segmentation faultI've taken a look at this. The problem lies in lines 1784 - 1798 of src/main/model.c. What happens is that 'k' overflows (signed int). That means k is 0 after the loop an nc is set to 0. That means the allocated model matrix 'x' is too small which results in the observed segfault. I can provide a patch which checks for overflow and throws an error if that is the desired behaviour. Greetings, Olaf Mersmann