Hilmar Berger
2019-Sep-11 11:26 UTC
[Rd] '==' operator: inconsistency in data.frame(...) == NULL
Sorry, I can't reproduce the example below even on the same machine. However, the following example produces the same error as NULL values in prior examples: > setClass("FOOCLASS", +????????? representation("list") + ) > ma = new("FOOCLASS", list(M=matrix(rnorm(300), 30,10))) > isS4(ma) [1] TRUE > data.frame(a=1:3) == ma Error in matrix(unlist(value, recursive = FALSE, use.names = FALSE), nrow = nr,? : ? length of 'dimnames' [2] not equal to array extent Best, Hilmar On 11/09/2019 12:24, Hilmar Berger wrote:> Another example where a data.frame is compared to (here non-null, > non-empty) non-atomic values in Ops.data.frame, resulting in an error > message: > > setClass("FOOCLASS2", > ???????? slots = c(M="matrix") > ) > ma = new("FOOCLASS2", M=matrix(rnorm(300), 30,10)) > > > isS4(ma) > [1] TRUE > > ma == data.frame(a=1:3) > Error in eval(f) : dims [product 1] do not match the length of object [3] > > As for the NULL/logical(0) cases I would suggest to explicitly test > for invalid conditions in Ops.data.frame and generate a comprehensible > message (e.g. "comparison is possible only for atomic and list types") > if appropriate. > > Best regards, > Hilmar > > > On 11/09/2019 11:55, Hilmar Berger wrote: >> >> In the data.frame()==NULL cases I have the impression that the fact >> that both sides are non-atomic is not properly detected and therefore >> R tries to go on with the == method for data.frames. >> >> From a cursory check in Ops.data.frame() and some debugging I have >> the impression that the case of the second argument being non-atomic >> or empty is not handled at all and the function progresses until the >> end, where it fails in the last step on an empty value: >> >> matrix(unlist(value, recursive = FALSE, use.names = FALSE), >> ??? nrow = nr, dimnames = list(rn, cn)) >-- Dr. Hilmar Berger, MD Max Planck Institute for Infection Biology Charit?platz 1 D-10117 Berlin GERMANY Phone: + 49 30 28460 430 Fax: + 49 30 28460 401 E-Mail: berger at mpiib-berlin.mpg.de Web : www.mpiib-berlin.mpg.de
Hilmar Berger
2019-Sep-14 11:31 UTC
[Rd] '==' operator: inconsistency in data.frame(...) == NULL
Dear all, I did some more tests regarding the == operator in Ops.data.frame (see below).? All tests done in R 3.6.1 (x86_64-w64-mingw32). I find that errors are thrown also when comparing a zero length data.frame to atomic objects with length>0 which should be a valid case according to the documentation. This can be traced to a check in the last line of Ops.data.frame which tests for the presence of an empty result value (i.e. list() ) but does not handle a list of empty values (i.e. list(logical(0))) which in fact is generated in those cases. There is a simple fix (see also below). There are other issues with the S4 class example (i.e. data.frame() == <s4_object with representation as list>) which fails for different reasons. ############################################################################## d_0 = data.frame(a = numeric(0)) # zero length data.frame d_00 = data.frame(numeric(0)) # zero length data.frame without names names(d_00) <- NULL # remove names to obtain value being an empty list() at the end of Ops.data.frame d_3 = data.frame(a=1:3) # non-empty data.frame m_0 = matrix(logical(0)) # zero length matrix #------------------------ # error A: # Error in matrix(if (is.null(value)) logical() else value, nrow = nr, dimnames = list(rn,? : # length of 'dimnames' [2] not equal to array extent d_0 == 1?? # error A d_00 == 1? # <0 x 0 matrix> d_3 == 1?? # <3 x 1 matrix> d_0 == logical(0) # error A d_00 == logical(0) # <0 x 0 matrix> d_3 == logical(0) # error A d_0 == NULL # error A d_00 == NULL # <0 x 0 matrix> d_3 == NULL # error A m_0 == d_0? # error A m_0 == d_00 # <0 x 0 matrix> m_0 == d3?? # error A # empty matrix for comparison m_0 == 1 # < 0 x 1 matrix> m_0 == logical(0) # < 0 x 1 matrix> m_0 == NULL # < 0 x 1 matrix> # All errors above could be solved by changing the last line in Ops.data.frame from # matrix(if (is.null(value)) logical() else value, nrow = nr, dimnames = list(rn, cn)) # to # matrix(if (length(value)==0) logical() else value, nrow = nr, dimnames = list(rn, cn)) # Alternatively or in addition one could add an explicit test for data.frame() == NULL if desired and raise an error ######################################################################################### # non-empty return value but failing in the same code line due to incompatible dimensions. # should Ops.data.frame at all be dispatched for <data.frame> == <S4 object> ? setClass("FOOCLASS", ????????? representation("list") ) ma = new("FOOCLASS", list(M=matrix(rnorm(300), 30,10))) isS4(ma) d_3 == ma # error A ########################################################################################## Best regards, Hilmar Am 11/09/2019 um 13:26 schrieb Hilmar Berger:> Sorry, I can't reproduce the example below even on the same machine. > However, the following example produces the same error as NULL values > in prior examples: > > > setClass("FOOCLASS", > +????????? representation("list") > + ) > > ma = new("FOOCLASS", list(M=matrix(rnorm(300), 30,10))) > > isS4(ma) > [1] TRUE > > data.frame(a=1:3) == ma > Error in matrix(unlist(value, recursive = FALSE, use.names = FALSE), > nrow = nr,? : > ? length of 'dimnames' [2] not equal to array extent > > Best, > Hilmar > > > On 11/09/2019 12:24, Hilmar Berger wrote: >> Another example where a data.frame is compared to (here non-null, >> non-empty) non-atomic values in Ops.data.frame, resulting in an error >> message: >> >> setClass("FOOCLASS2", >> ???????? slots = c(M="matrix") >> ) >> ma = new("FOOCLASS2", M=matrix(rnorm(300), 30,10)) >> >> > isS4(ma) >> [1] TRUE >> > ma == data.frame(a=1:3) >> Error in eval(f) : dims [product 1] do not match the length of object >> [3] >> >> As for the NULL/logical(0) cases I would suggest to explicitly test >> for invalid conditions in Ops.data.frame and generate a >> comprehensible message (e.g. "comparison is possible only for atomic >> and list types") if appropriate. >> >> Best regards, >> Hilmar >> >> >> On 11/09/2019 11:55, Hilmar Berger wrote: >>> >>> In the data.frame()==NULL cases I have the impression that the fact >>> that both sides are non-atomic is not properly detected and >>> therefore R tries to go on with the == method for data.frames. >>> >>> From a cursory check in Ops.data.frame() and some debugging I have >>> the impression that the case of the second argument being non-atomic >>> or empty is not handled at all and the function progresses until the >>> end, where it fails in the last step on an empty value: >>> >>> matrix(unlist(value, recursive = FALSE, use.names = FALSE), >>> ??? nrow = nr, dimnames = list(rn, cn)) >> >
Martin Maechler
2019-Sep-18 08:35 UTC
[Rd] '==' operator: inconsistency in data.frame(...) == NULL
>>>>> Hilmar Berger >>>>> on Sat, 14 Sep 2019 13:31:27 +0200 writes:> Dear all, > I did some more tests regarding the == operator in Ops.data.frame (see > below).? All tests done in R 3.6.1 (x86_64-w64-mingw32). > I find that errors are thrown also when comparing a zero length > data.frame to atomic objects with length>0 which should be a valid case > according to the documentation. This can be traced to a check in the > last line of Ops.data.frame which tests for the presence of an empty > result value (i.e. list() ) but does not handle a list of empty values > (i.e. list(logical(0))) which in fact is generated in those cases. > There is a simple fix (see also below). I'm pretty sure what you write above is wrong: For some reason you must have changed more in your own version of Ops.data.frame : Because there's a line value <- unlist(value, ...) there, value is *not* list(logical(0)) there, but rather logical(0) and then indeed, your proposed line change (at the end of Ops.data.frame) has no effect for the examples you give. Note also that your analysis -- treating all 0-extent data frames or matrices the same -- is very incomplete. A 0 x 0 matrix is not the same as a 0 x 1 matrix etc, and similar for data frames. Here's an extended "testing" script which takes into account some of the above : ##----------------------------------------------------------- d0 <- data.frame(a = numeric(0)) # zero length data.frame d00 <- unname(d0) # zero length data.frame __without names__ d3 <- data.frame(a=1:3) # non-empty data.frame d30. <- d3[,FALSE] # 3 x 0 -- take into account, too ! d30 <- unname(d30.) m01. <- matrix(,0,1, dimnames=list(NULL,"a")) # 0 x 1 matrix with dimnames m01 <- unname(m01.) m00. <- matrix(,0,0, dimnames=list(NULL,NULL)) # 0 x 0 matrix with dimnames m00 <- unname(m00.) m3 <- data.matrix(d3) ##------------------------ ## error A: ## Error in matrix(if (is.null(value)) logical() else value, nrow = nr, dimnames = list(rn, : ## length of 'dimnames' [2] not equal to array extent d0 == 1 # error A d00 == 1 # <0 x 0 matrix> d30. == 1 # <3 x 0 matrix> d30 == 1 # <3 x 0 matrix> d3 == 1 # <3 x 1 matrix> d0 == logical(0) # error A d00 == logical(0) # <0 x 0 matrix> d30. == logical() # <3 x 0 matrix> d30 == logical() # <3 x 0 matrix> d3 == logical(0) # error A d0 == NULL # error A d00 == NULL # <0 x 0 matrix> d30. == NULL # <3 x 0 matrix> d30 == NULL # <3 x 0 matrix> d3 == NULL # error A m00 == d0 # error A m00 == d00 # <0 x 0 matrix> m00 == d3 # error A # 0-length matrix for comparison : identical(m00., m00. == 1) ## 0 x 0 matrix *with* "invisible" dimnames [ NULL, NULL ] identical(m00., m00. == logical(0)) identical(m00., m00. == NULL) identical(m00, m00 == 1) ## 0 x 0 matrix w/o dimnames identical(m00, m00 == logical(0)) identical(m00, m00 == NULL) ## 0 x 1 --------------------- identical(m01., m01. == 1) # < 0 x 1 matrix> *with* dimnames identical(m01., m01. == logical(0)) # " " " identical(m01., m01. == NULL) # " " " identical(m01, m01 == 1) # < 0 x 1 matrix> w/o dimnames identical(m01, m01 == logical(0)) # < 0 x 1 matrix> identical(m01, m01 == NULL) # < 0 x 1 matrix> ##----------------------------------------------------------- Best regards, Martin
Apparently Analagous Threads
- '==' operator: inconsistency in data.frame(...) == NULL
- '==' operator: inconsistency in data.frame(...) == NULL
- '==' operator: inconsistency in data.frame(...) == NULL
- '==' operator: inconsistency in data.frame(...) == NULL
- '==' operator: inconsistency in data.frame(...) == NULL