thr3ads.net - R help - [R] Joining uneven datasets [May 2008]

If this information is useful, please help other people find it:
Share via:

Neil Gupta

2008-May-29 14:40 UTC

[R] Joining uneven datasets

Hello,

I have quite a simple problem that I believe can be solved quite easily. I
have a dataframe as such:
Symbol       Date     Time Exchange TickType ReferenceNumber Price Size
1 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916044 13387    9
2 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916045 13388    1
3 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916054 13387    9
4 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916055 13388    1
5 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916533 13386   39
6 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916534 13388    1

I wanted the average of the B's and A's. I wrote this to perform that.
 NPrice <-
(YM1207$Price[which(YM1207$TickType=="B")]+YM1207$Price[which(YM1207$TickType=="A")])/2

head(NPrice)
[1] 13387.5 13387.5 13387.0 13386.5 13386.5 13387.0

Now since NPrice is much smaller than the original dataframe, YM1207 I can
not just add NPrice to the set.
How can I put each of those averages back into their corresponding row? I
would even prefer repeating the values for A's as well..

I would like to do it as such..

Symbol       Date     Time Exchange TickType ReferenceNumber Price Size
NPrice
1 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916044 13387
9   13387.5
2 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916045 13388
1    13387.5
3 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916054 13387
9    13387.5
4 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916055 13388
1     13387.5
5 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916533 13386
39   13387.0
6 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916534 13388
1    13387.0

Thankyou much

Neil

	[[alternative HTML version deleted]]

Peter Dalgaard

2008-May-29 14:57 UTC

head link

[R] Joining uneven datasets

Neil Gupta wrote:> Hello,
>
> I have quite a simple problem that I believe can be solved quite easily. I
> have a dataframe as such:
> Symbol       Date     Time Exchange TickType ReferenceNumber Price Size
> 1 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916044 13387   
9
> 2 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916045 13388   
1
> 3 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916054 13387   
9
> 4 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916055 13388   
1
> 5 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916533 13386  
39
> 6 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916534 13388   
1
>
> I wanted the average of the B's and A's. I wrote this to perform
that.
>  NPrice <-
>
(YM1207$Price[which(YM1207$TickType=="B")]+YM1207$Price[which(YM1207$TickType=="A")])/2
>
> head(NPrice)
> [1] 13387.5 13387.5 13387.0 13386.5 13386.5 13387.0
>
> Now since NPrice is much smaller than the original dataframe, YM1207 I can
> not just add NPrice to the set.
> How can I put each of those averages back into their corresponding row? I
> would even prefer repeating the values for A's as well..
>
> I would like to do it as such..
>
> Symbol       Date     Time Exchange TickType ReferenceNumber Price Size
> NPrice
> 1 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916044 13387
> 9   13387.5
> 2 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916045 13388
> 1    13387.5
> 3 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916054 13387
> 9    13387.5
> 4 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916055 13388
> 1     13387.5
> 5 3:YMZ7.EC 12/03/2007 08:30:00       EC        B        83916533 13386
> 39   13387.0
> 6 3:YMZ7.EC 12/03/2007 08:30:00       EC        A        83916534 13388
> 1    13387.0
>
>   What can be assumed here? If the alternating B,A pattern is consistent, 
I'd go for (something like)

N <- nrow(YM1207)
ix <- gl(N/2,2)
YM1207$NPrice <- ave(YM1207, ix)

-- 
   O__  ---- Peter Dalgaard             ?ster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark      Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)              FAX: (+45) 35327907

Maybe Matching Threads

Search for more seemingly similar threads

R help - May 2008 - Joining uneven datasets

[R] Joining uneven datasets

[R] Joining uneven datasets

Maybe Matching Threads