Hello,
Today, Brian Ripley commited the revision 50000 of R's svn repository.
I took this as an opportunity to do some data analysis of the log and
posted some code and graphics on my blog:
http://romainfrancois.blog.free.fr/index.php?post/2009/10/09/celebrating-R-commit-50000
The plots of the number of commits per day (and per month) indicates a
peak in 2002.
Here are the top 20 modified files :
> most.updated <- head( sort( table( data$file ), decreasing=TRUE ), 20 )
> cat( sprintf( "%40s : %6d", names(most.updated),
most.updated ),
sep ="\n" )
/trunk/NEWS : 5449
/trunk/date-stamp : 2219
/trunk/BUGS : 1485
/trunk/src/gnuwin32/CHANGES : 994
/trunk/configure : 979
/trunk/doc/manual/R-FAQ.texi : 948
/trunk/doc/manual/R-exts.texi : 814
/trunk/doc/manual/R-admin.texi : 666
/trunk/src/main/names.c : 634
/trunk/tests/reg-tests-1.R : 621
/trunk/src/gnuwin32/Makefile : 542
/trunk/FAQ : 471
/trunk/doc/html/faq.html : 458
/trunk/src/include/Defn.h : 455
/trunk/configure.ac : 447
/trunk/src/scripts/check.in : 433
/trunk/src/library/tools/R/QC.R : 397
/branches/unlabeled-1.1139.4/date-stamp : 386
/trunk/src/main/connections.c : 349
/trunk/src/main/plot.c : 325
The most modified file extensions (the second element indicates files
without extensions) :
> head( sort( table( data$extension ), decreasing=T ) , 24 )
Rd c R po in mo h
36766 33999 29849 25221 10215 8806 6280 6147
texi gmo html save pot f win m4
5274 3705 1920 1668 1285 1148 1060 1041
pl ac afm pm mac y packages sh
910 527 482 461 402 384 311 215
The number of commits per author :
> sort( table( simple$author ), decreasing=T )
ripley maechler hornik pd murdoch iacus
20523 10207 6744 4655 1370 915
leisch jmc luke ihaka murrell bates
743 643 618 481 379 350
rgentlem tlumley urbaneks (no author) duncan r
299 274 274 203 172 96
thomas lyndon paul guido martyn plummer
63 54 48 39 39 26
deepayan falcon mike
20 18 1
Many thanks to the R core team for these 50 000 commits.
Romain
--
Romain Francois
Professional R Enthusiast
+33(0) 6 28 91 30 30
http://romainfrancois.blog.free.fr
|- http://tr.im/BcPw : celebrating R commit #50000
|- http://tr.im/ztCu : RGG #158:161: examples of package IDPmisc
`- http://tr.im/yw8E : New R package : sos
Thanks to Dirk for pointing it out in my blog : "thomas" and
"tlumley"
are the same person. Also "martyn" and "plummer", and
"paul" and
"murrell" ... updated number of commits per author below
> sort( table( simple$author ), decreasing=T )
ripley maechler hornik pd murdoch iacus
20523 10207 6744 4655 1370 915
leisch jmc luke ihaka murrell bates
743 643 618 481 427 350
tlumley rgentlem urbaneks (no author) duncan r
337 299 274 203 172 96
plummer lyndon guido deepayan falcon mike
65 54 39 20 18 1
Not quite sure who "r" is (96 commits) and "mike" (1
commit). Also there
are 203 commits that have "(no author)"
The other thing is that it does not include data from the pre-svn era.
Romain
On 10/09/2009 03:41 PM, Romain Francois wrote:>
> Hello,
>
> Today, Brian Ripley commited the revision 50000 of R's svn repository.
>
> I took this as an opportunity to do some data analysis of the log and
> posted some code and graphics on my blog:
>
http://romainfrancois.blog.free.fr/index.php?post/2009/10/09/celebrating-R-commit-50000
>
>
> The plots of the number of commits per day (and per month) indicates a
> peak in 2002.
>
> Here are the top 20 modified files :
>
> > most.updated <- head( sort( table( data$file ), decreasing=TRUE ),
20 )
> > cat( sprintf( "%40s : %6d", names(most.updated),
most.updated ), sep
> ="\n" )
> /trunk/NEWS : 5449
> /trunk/date-stamp : 2219
> /trunk/BUGS : 1485
> /trunk/src/gnuwin32/CHANGES : 994
> /trunk/configure : 979
> /trunk/doc/manual/R-FAQ.texi : 948
> /trunk/doc/manual/R-exts.texi : 814
> /trunk/doc/manual/R-admin.texi : 666
> /trunk/src/main/names.c : 634
> /trunk/tests/reg-tests-1.R : 621
> /trunk/src/gnuwin32/Makefile : 542
> /trunk/FAQ : 471
> /trunk/doc/html/faq.html : 458
> /trunk/src/include/Defn.h : 455
> /trunk/configure.ac : 447
> /trunk/src/scripts/check.in : 433
> /trunk/src/library/tools/R/QC.R : 397
> /branches/unlabeled-1.1139.4/date-stamp : 386
> /trunk/src/main/connections.c : 349
> /trunk/src/main/plot.c : 325
>
>
> The most modified file extensions (the second element indicates files
> without extensions) :
>
> > head( sort( table( data$extension ), decreasing=T ) , 24 )
>
> Rd c R po in mo h
> 36766 33999 29849 25221 10215 8806 6280 6147
> texi gmo html save pot f win m4
> 5274 3705 1920 1668 1285 1148 1060 1041
> pl ac afm pm mac y packages sh
> 910 527 482 461 402 384 311 215
>
> Many thanks to the R core team for these 50 000 commits.
>
> Romain
>
--
Romain Francois
Professional R Enthusiast
+33(0) 6 28 91 30 30
http://romainfrancois.blog.free.fr
|- http://tr.im/BcPw : celebrating R commit #50000
|- http://tr.im/ztCu : RGG #158:161: examples of package IDPmisc
`- http://tr.im/yw8E : New R package : sos