thr3ads.net - R help - [R] diff question [Jan 2015]

If this information is useful, please help other people find it:
Share via:

Troels Ring

2015-Jan-11 08:29 UTC

[R] diff question

R version 3.1.1 (2014-07-10) -- "Sock it to Me"
Copyright (C) 2014 The R Foundation for Statistical Computing
Platform: x86_64-w64-mingw32/x64 (64-bit)

Dear friends - I have a small problem with diff (I guess)
I made a sequence with fixed interval between consecutive elements - and 
hence thought the  diff would be as specified
but had a vector with apparently identical 12 elements returned from diff
tt <- seq(0,20,by=0.02)
unique(diff(tt)) #[1] 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 
0.02 0.02
Trying to see if these elements in diff were duplicated
duplicated(diff(tt))
#[1] FALSE  TRUE FALSE FALSE  TRUE FALSE FALSE  TRUE  TRUE and from
sum(duplicated(diff(tt)))
[1] 988
saw that 12 of the elements in duplicated(diff(tt)) were FALSE. Would it 
be expected that the first was FALSE and the rest TRUE?
|duplicated()|determines which elements of a vector or data frame are 
duplicates of elements with smaller subscripts, and returns a logical 
vector indicating which elements (rows) are duplicates.

All best wishes
Troels
Aalborg, Denmark

	[[alternative HTML version deleted]]

(Ted Harding)

2015-Jan-11 08:48 UTC

head link

[R] diff question

Troels, this is due to the usual tiny difference between numbers
as computed by R and the numbers that you think they are!

  tt  <- seq(0,20,by=0.02)
  dtt <- diff(tt)
  length(dtt)
  # [1] 1000
  r02 <- rep(0.02,1000)
  unique(r02 - dtt)
  # [1]  0.000000e+00  3.469447e-18 -3.469447e-18  1.040834e-17
  # [5] -1.734723e-17  3.816392e-17  9.367507e-17  2.046974e-16 
  # [9]  4.267420e-16 -4.614364e-16 -1.349615e-15 -3.125972e-15

Hoping this helps!
Ted.

On 11-Jan-2015 08:29:26 Troels Ring wrote:> R version 3.1.1 (2014-07-10) -- "Sock it to Me"
> Copyright (C) 2014 The R Foundation for Statistical Computing
> Platform: x86_64-w64-mingw32/x64 (64-bit)
> 
> Dear friends - I have a small problem with diff (I guess)
> I made a sequence with fixed interval between consecutive elements - and 
> hence thought the  diff would be as specified
> but had a vector with apparently identical 12 elements returned from diff
> tt <- seq(0,20,by=0.02)
> unique(diff(tt)) #[1] 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 
> 0.02 0.02
> Trying to see if these elements in diff were duplicated
> duplicated(diff(tt))
>#[1] FALSE  TRUE FALSE FALSE  TRUE FALSE FALSE  TRUE  TRUE and from
> sum(duplicated(diff(tt)))
> [1] 988
> saw that 12 of the elements in duplicated(diff(tt)) were FALSE. Would it 
> be expected that the first was FALSE and the rest TRUE?
>|duplicated()|determines which elements of a vector or data frame are 
> duplicates of elements with smaller subscripts, and returns a logical 
> vector indicating which elements (rows) are duplicates.
> 
> All best wishes
> Troels
> Aalborg, Denmark
> 
>       [[alternative HTML version deleted]]
> 
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
-------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at wlandres.net>
Date: 11-Jan-2015  Time: 08:48:03
This message was sent by XFMail

Rolf Turner

2015-Jan-11 08:57 UTC

head link

[R] diff question

See FAQ 7.31.

cheers,

Rolf Turner

On 11/01/15 21:29, Troels Ring wrote:> R version 3.1.1 (2014-07-10) -- "Sock it to Me"
> Copyright (C) 2014 The R Foundation for Statistical Computing
> Platform: x86_64-w64-mingw32/x64 (64-bit)
>
> Dear friends - I have a small problem with diff (I guess)
> I made a sequence with fixed interval between consecutive elements - and
> hence thought the  diff would be as specified
> but had a vector with apparently identical 12 elements returned from diff
> tt <- seq(0,20,by=0.02)
> unique(diff(tt)) #[1] 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02
> 0.02 0.02
> Trying to see if these elements in diff were duplicated
> duplicated(diff(tt))
> #[1] FALSE  TRUE FALSE FALSE  TRUE FALSE FALSE  TRUE  TRUE and from
> sum(duplicated(diff(tt)))
> [1] 988
> saw that 12 of the elements in duplicated(diff(tt)) were FALSE. Would it
> be expected that the first was FALSE and the rest TRUE?
> |duplicated()|determines which elements of a vector or data frame are
> duplicates of elements with smaller subscripts, and returns a logical
> vector indicating which elements (rows) are duplicates.

-- 
Rolf Turner
Technical Editor ANZJS
Department of Statistics
University of Auckland
Phone: +64-9-373-7599 ext. 88276
Home phone: +64-9-480-4619

(Ted Harding)

2015-Jan-11 11:41 UTC

head link

[R] diff question

I should have added an extra line to the code below, to complete
the picture. Here it is (see below line "##########".
Ted.

On 11-Jan-2015 08:48:06 Ted Harding wrote:> Troels, this is due to the usual tiny difference between numbers
> as computed by R and the numbers that you think they are!
  tt  <- seq(0,20,by=0.02)
  dtt <- diff(tt)
  length(dtt)
  # [1] 1000
  r02 <- rep(0.02,1000)
  unique(r02 - dtt)
  # [1]  0.000000e+00  3.469447e-18 -3.469447e-18  1.040834e-17
  # [5] -1.734723e-17  3.816392e-17  9.367507e-17  2.046974e-16
  # [9]  4.267420e-16 -4.614364e-16 -1.349615e-15 -3.125972e-15
  ##########
  sum(dtt != 0.02)
  # [1] 998

So only 2 values among the 1000 in diff(tt) are exactly equal to
[R's representation of] 0.2!
> Hoping this helps!
> Ted.
> 
> On 11-Jan-2015 08:29:26 Troels Ring wrote:
>> R version 3.1.1 (2014-07-10) -- "Sock it to Me"
>> Copyright (C) 2014 The R Foundation for Statistical Computing
>> Platform: x86_64-w64-mingw32/x64 (64-bit)
>> 
>> Dear friends - I have a small problem with diff (I guess)
>> I made a sequence with fixed interval between consecutive elements -
and
>> hence thought the  diff would be as specified
>> but had a vector with apparently identical 12 elements returned from
diff
>> tt <- seq(0,20,by=0.02)
>> unique(diff(tt)) #[1] 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02 0.02
>> 0.02 0.02
>> Trying to see if these elements in diff were duplicated
>> duplicated(diff(tt))
>>#[1] FALSE  TRUE FALSE FALSE  TRUE FALSE FALSE  TRUE  TRUE and from
>> sum(duplicated(diff(tt)))
>> [1] 988
>> saw that 12 of the elements in duplicated(diff(tt)) were FALSE. Would
it
>> be expected that the first was FALSE and the rest TRUE?
>>|duplicated()|determines which elements of a vector or data frame are 
>> duplicates of elements with smaller subscripts, and returns a logical 
>> vector indicating which elements (rows) are duplicates.
>> 
>> All best wishes
>> Troels
>> Aalborg, Denmark
>> 
>>       [[alternative HTML version deleted]]
>> 
>> ______________________________________________
>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
> 
> -------------------------------------------------
> E-Mail: (Ted Harding) <Ted.Harding at wlandres.net>
> Date: 11-Jan-2015  Time: 08:48:03
> This message was sent by XFMail
> 
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
-------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at wlandres.net>
Date: 11-Jan-2015  Time: 11:41:27
This message was sent by XFMail

R help - Jan 2015 - diff question

[R] diff question

[R] diff question

[R] diff question

[R] diff question