thr3ads.net - R devel - [Rd] pbinom with size argument 0 (PR#8560) [Feb 2006]

If this information is useful, please help other people find it:
Share via:

uht@dfu.min.dk

2006-Feb-03 11:55 UTC

[Rd] pbinom with size argument 0 (PR#8560)

Full_Name: Uffe H?gsbro Thygesen
Version: 2.2.0
OS: linux
Submission from: (NULL) (130.226.135.250)


Hello all.

  pbinom(q=0,size=0,prob=0.5)

returns the value NaN. I had expected the result 1. In fact any value for q
seems to give an NaN. Note that

  dbinom(x=0,size=0,prob=0.5)

returns the value 1.

Cheers,

Uffe

(Ted Harding)

2006-Feb-03 14:34 UTC

head link

[Rd] pbinom with size argument 0 (PR#8560)

On 03-Feb-06 uht at dfu.min.dk wrote:> Full_Name: Uffe H?gsbro Thygesen
> Version: 2.2.0
> OS: linux
> Submission from: (NULL) (130.226.135.250)
> 
> 
> Hello all.
> 
>   pbinom(q=0,size=0,prob=0.5)
> 
> returns the value NaN. I had expected the result 1. In fact any
> value for q seems to give an NaN.
Well, "NaN" can make sense since "q=0" refers to a single
sampled
value, and there is no value which you can sample from "size=0";
i.e. sampling from "size=0" is a non-event. I think the probability
of a non-event should be NaN, not 1! (But maybe others might argue
that if you try to sample from an empty urn you necessarily get
zero "successes", so p should be 1; but I would counter that you
also necessarily get zero "failures" so q should be 1. I suppose
it may be a matter of whether you regard the "r" of the binomial
distribution as referring to the "identities" of the outcomes
rather than to how many you get of a particular type. Hmmm.)
> Note that
> 
>   dbinom(x=0,size=0,prob=0.5)
> 
> returns the value 1.
That is probably because the .Internal code for pbinom may do
a preliminary test for "x >= size". This also makes sense, for
the cumulative p<dist> for any <dist> with a finite range,
since the answer must then be 1 and a lot of computation would
be saved (likewise returning 0 when x < 0). However, it would
make even more sense to have a preceding test for "size<=0"
and return NaN in that case since, for the same reasons as
above, the result is the probability of a non-event.

(But it depends on your point of view, as above ... However,
surely the two  should be consistent with each other.)

Best wishes,
Ted.

--------------------------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at nessie.mcc.ac.uk>
Fax-to-email: +44 (0)870 094 0861
Date: 03-Feb-06                                       Time: 14:34:28
------------------------------ XFMail ------------------------------

ehlers@math.ucalgary.ca

2006-Feb-04 22:05 UTC

head link

[Rd] pbinom with size argument 0 (PR#8560)

(Ted Harding) wrote:> On 03-Feb-06 Peter Dalgaard wrote:
> 
>>(Ted Harding) <Ted.Harding at nessie.mcc.ac.uk> writes:
>>
>>
>>>On 03-Feb-06 uht at dfu.min.dk wrote:
>>>
>>>>Full_Name: Uffe H?gsbro Thygesen
>>>>Version: 2.2.0
>>>>OS: linux
>>>>Submission from: (NULL) (130.226.135.250)
>>>>
>>>>
>>>>Hello all.
>>>>
>>>>  pbinom(q=0,size=0,prob=0.5)
>>>>
>>>>returns the value NaN. I had expected the result 1. In fact any
>>>>value for q seems to give an NaN.
>>>
>>>Well, "NaN" can make sense since "q=0" refers to
a single sampled
>>>value, and there is no value which you can sample from
"size=0";
>>>i.e. sampling from "size=0" is a non-event. I think the
probability
>>>of a non-event should be NaN, not 1! (But maybe others might argue
>>>that if you try to sample from an empty urn you necessarily get
>>>zero "successes", so p should be 1; but I would counter
that you
>>>also necessarily get zero "failures" so q should be 1. I
suppose
>>>it may be a matter of whether you regard the "r" of the
binomial
>>>distribution as referring to the "identities" of the
outcomes
>>>rather than to how many you get of a particular type. Hmmm.)
>>>
>>>
>>>>Note that
>>>>
>>>>  dbinom(x=0,size=0,prob=0.5)
>>>>
>>>>returns the value 1.
>>>
>>>That is probably because the .Internal code for pbinom may do
>>>a preliminary test for "x >= size". This also makes
sense, for
>>>the cumulative p<dist> for any <dist> with a finite
range,
>>>since the answer must then be 1 and a lot of computation would
>>>be saved (likewise returning 0 when x < 0). However, it would
>>>make even more sense to have a preceding test for
"size<=0"
>>>and return NaN in that case since, for the same reasons as
>>>above, the result is the probability of a non-event.
>>
>>Once you get your coffee, you'll likely realize that you got
>>your p's and d's mixed up...
> 
> 
> You're right about the mix-up! (I must mend the pipeline.)
> 
> 
>>I think Uffe is perfectly right: The result of zero experiments will
>>be zero successes (and zero failures) with probability 1, so the
>>cumulative distribution function is a step function with one step at
>>zero ( == as.numeric(x>=0) ).
> 
> 
> I'm perfectly happy with this argument so long as it leads to
> dbinom(x=0,size=0,prob=p)=1 and also pbinom(q=0,size=0,prob=p)=1
> (which seems to be what you are arguing too). And I think there
> are no traps if p=0 or p=1.
> 
> 
>>>(But it depends on your point of view, as above ... However,
>>>surely the two  should be consistent with each other.)
> 
> 
> Ted.
I prefer a (consistent) NaN. What happens to our notion of a
Binomial RV as a sequence of Bernoulli RVs if we permit n=0?
I have never seen (nor contemplated, I confess) the definition
of a Bernoulli RV as anything other than some dichotomous-outcome
one-trial random experiment. Not n trials, where n might equal zero,
but _one_ trial. I can't see what would be gained by permitting a
zero-trial experiment. If we assign probability 1 to each outcome,
we have a problem with the sum of the probabilities.

Peter Ehlers> 
> --------------------------------------------------------------------
> E-Mail: (Ted Harding) <Ted.Harding at nessie.mcc.ac.uk>
> Fax-to-email: +44 (0)870 094 0861
> Date: 03-Feb-06                                       Time: 15:07:49
> ------------------------------ XFMail ------------------------------
> 
> ______________________________________________
> R-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel

p.dalgaard@biostat.ku.dk

2006-Feb-05 00:33 UTC

head link

[Rd] pbinom with size argument 0 (PR#8560)

P Ehlers <ehlers at math.ucalgary.ca> writes:
> I prefer a (consistent) NaN. What happens to our notion of a
> Binomial RV as a sequence of Bernoulli RVs if we permit n=0?
> I have never seen (nor contemplated, I confess) the definition
> of a Bernoulli RV as anything other than some dichotomous-outcome
> one-trial random experiment. 
What's the problem ??

An n=0 binomial is the sum of an empty set of Bernoulli RV's, and the
sum over an empty set is identically 0.
> Not n trials, where n might equal zero,
> but _one_ trial. I can't see what would be gained by permitting a
> zero-trial experiment. If we assign probability 1 to each outcome,
> we have a problem with the sum of the probabilities.
Consistency is what you gain. E.g. 

 binom(.,n=n1+n2,p) == binom(.,n=n1,p) * binom(.,n=n2,p)

where * denotes convolution. This will also hold for n1=0 or n2=0 if
the binomial in that case is defined as a one-point distribution at
zero. Same thing as any(logical(0)) etc., really.

-- 
   O__  ---- Peter Dalgaard             ?ster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark          Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)                  FAX: (+45) 35327907

Uffe Høgsbro Thygesen

2006-Feb-05 20:39 UTC

head link

[Rd] pbinom with size argument 0 (PR#8560)

Hello all

A pragmatic argument for allowing size==0 is the situation where the size is in
itself a random variable (that's how I stumbled over the inconsistency, by
the way).

For example, in textbooks on probability it is stated that:

  If X is Poisson(lambda), and the conditional 
  distribution of Y given X is Binomial(X,p), then 
  Y is Poisson(lambda*p).

(cf eg Pitman's "Probability", p. 400)

Clearly this statement requires Binomial(0,p) to be a well-defined distribution.

Such statements would be quite convoluted if we did not define Binomial(0,p) as
a legal (but degenerate) distribution. The same applies to codes where the size
parameter may attain the value 0.

Just my 2 cents.

Cheers,

Uffe

-----Oprindelig meddelelse-----
Fra: pd at pubhealth.ku.dk p? vegne af Peter Dalgaard
Sendt: s? 05-02-2006 01:33
Til: P Ehlers
Cc: ted.harding at nessie.mcc.ac.uk; Peter Dalgaard; R-bugs at biostat.ku.dk;
r-devel at stat.math.ethz.ch; Uffe H?gsbro Thygesen
Emne: Re: [Rd] pbinom with size argument 0 (PR#8560)

P Ehlers <ehlers at math.ucalgary.ca> writes:
> I prefer a (consistent) NaN. What happens to our notion of a
> Binomial RV as a sequence of Bernoulli RVs if we permit n=0?
> I have never seen (nor contemplated, I confess) the definition
> of a Bernoulli RV as anything other than some dichotomous-outcome
> one-trial random experiment. 
What's the problem ??

An n=0 binomial is the sum of an empty set of Bernoulli RV's, and the
sum over an empty set is identically 0.
> Not n trials, where n might equal zero,
> but _one_ trial. I can't see what would be gained by permitting a
> zero-trial experiment. If we assign probability 1 to each outcome,
> we have a problem with the sum of the probabilities.
Consistency is what you gain. E.g. 

 binom(.,n=n1+n2,p) == binom(.,n=n1,p) * binom(.,n=n2,p)

where * denotes convolution. This will also hold for n1=0 or n2=0 if
the binomial in that case is defined as a one-point distribution at
zero. Same thing as any(logical(0)) etc., really.

-- 
   O__  ---- Peter Dalgaard             ?ster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark          Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)                  FAX: (+45) 35327907

uht@dfu.min.dk

2006-Feb-05 20:40 UTC

head link

[Rd] pbinom with size argument 0 (PR#8560)

Hello all

A pragmatic argument for allowing size=3D=3D0 is the situation where the size is
in itself a random variable (that's how I stumbled over the inconsistency,
by the way).

For example, in textbooks on probability it is stated that:

  If X is Poisson(lambda), and the conditional=20
  distribution of Y given X is Binomial(X,p), then=20
  Y is Poisson(lambda*p).

(cf eg Pitman's "Probability", p. 400)

Clearly this statement requires Binomial(0,p) to be a well-defined distribution.

Such statements would be quite convoluted if we did not define Binomial(0,p) as
a legal (but degenerate) distribution. The same applies to codes where the size
parameter may attain the value 0.

Just my 2 cents.

Cheers,

Uffe

-----Oprindelig meddelelse-----
Fra: pd at pubhealth.ku.dk p=E5 vegne af Peter Dalgaard
Sendt: s=F8 05-02-2006 01:33
Til: P Ehlers
Cc: ted.harding at nessie.mcc.ac.uk; Peter Dalgaard; R-bugs at biostat.ku.dk;
r-devel at stat.math.ethz.ch; Uffe H=F8gsbro Thygesen
Emne: Re: [Rd] pbinom with size argument 0 (PR#8560)
=20
P Ehlers <ehlers at math.ucalgary.ca> writes:
> I prefer a (consistent) NaN. What happens to our notion of a
> Binomial RV as a sequence of Bernoulli RVs if we permit n=3D0?
> I have never seen (nor contemplated, I confess) the definition
> of a Bernoulli RV as anything other than some dichotomous-outcome
> one-trial random experiment.=20
What's the problem ??

An n=3D0 binomial is the sum of an empty set of Bernoulli RV's, and the
sum over an empty set is identically 0.
> Not n trials, where n might equal zero,
> but _one_ trial. I can't see what would be gained by permitting a
> zero-trial experiment. If we assign probability 1 to each outcome,
> we have a problem with the sum of the probabilities.
Consistency is what you gain. E.g.=20

 binom(.,n=3Dn1+n2,p) =3D=3D binom(.,n=3Dn1,p) * binom(.,n=3Dn2,p)

where * denotes convolution. This will also hold for n1=3D0 or n2=3D0 if
the binomial in that case is defined as a one-point distribution at
zero. Same thing as any(logical(0)) etc., really.

--=20
   O__  ---- Peter Dalgaard             =D8ster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark          Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)                  FAX: (+45) 35327907

Possibly Parallel Threads

Search for more reasonably related threads

R devel - Feb 2006 - pbinom with size argument 0 (PR#8560)

[Rd] pbinom with size argument 0 (PR#8560)

[Rd] pbinom with size argument 0 (PR#8560)

[Rd] pbinom with size argument 0 (PR#8560)

[Rd] pbinom with size argument 0 (PR#8560)

[Rd] pbinom with size argument 0 (PR#8560)

[Rd] pbinom with size argument 0 (PR#8560)

Possibly Parallel Threads