thr3ads.net - R devel - [R] The hidden costs of GPL software? [Nov 2004]

If this information is useful, please help other people find it:
Share via:

Philippe Grosjean

2004-Nov-17 09:53 UTC

[R] The hidden costs of GPL software?

Hello,

In the latest 'Scientific Computing World' magazine (issue 78, p. 22),
there
is a review on free statistical software by Felix Grant ("doesn't have
to
pay good money to obtain good statistics software"). As far as I know, this
is the first time that R is even mentioned in this magazine, given that it
usually discuss commercial products.

In this article, the analysis of R is interesting. It is admitted that R is
a great software with lots of potentials, but: "All in all, R was a good
lesson in the price that may have to be paid for free software: I spent many
hours relearning some quite basic things taken for granted in the commercial
package." Those basic things are releated with data import, obtention of
basic plots, etc... with a claim for a missing more intuitive GUI in order
to smooth a little bit the learning curve.

There are several R GUI projects ongoing, but these are progressing very
slowly. The main reason is, I believe, that a relatively low number of
programmers working on R are interested by this field. Most people wanting
such a GUI are basic user that do not (cannot) contribute... And if they
eventually become more knowledgeable, they tend to have other interests.

So, is this analysis correct: are there hidden costs for free software like
R in the time required to learn it? At least currently, for the people I
know (biologists, ecologists, oceanographers, ...), this is perfectly true.
This is even an insurmountable barrier for many of them I know, and they
have given up (they come back to Statistica, Systat, or S-PLUS using
exclusively functions they can reach through menus/dialog boxes).

Of course, the solution is to have a decent GUI for R, but this is a lot of
work, and I wonder if the intrinsic mechanism of GPL is not working against
such a development (leading to a very low pool of programmers actively
involved in the elaboration of such a GUI, in comparison to the very large
pool of competent developers working on R itself).

Do not misunderstand me: I don't give up with my GUI project, I am just
wondering if there is a general, ineluctable mechanism that leads to the
current R / R GUI situation as it stands,... and consequently to a "general
rule" that there are indeed most of the time "hidden costs" in
free
software, due to the larger time required to learn it. I am sure there are
counter-examples, however, my feeling is that, for Linux, Apache, etc... the
GUI (if there is one) is often a way back in comparison to the potentials in
the software, leading to a steep learning curve in order to use all these
features.

I would be interested by your impressions and ideas on this topic.

Best regards,

Philippe Grosjean  

..............................................<??}))><........
 ) ) ) ) )
( ( ( ( (    Prof. Philippe Grosjean
 ) ) ) ) )
( ( ( ( (    Numerical Ecology of Aquatic Systems
 ) ) ) ) )   Mons-Hainaut University, Pentagone
( ( ( ( (    Academie Universitaire Wallonie-Bruxelles
 ) ) ) ) )   6, av du Champ de Mars, 7000 Mons, Belgium  
( ( ( ( (       
 ) ) ) ) )   phone: + 32.65.37.34.97, fax: + 32.65.37.33.12
( ( ( ( (    email: Philippe.Grosjean at umh.ac.be
 ) ) ) ) )      
( ( ( ( (    web:   http://www.umh.ac.be/~econum
 ) ) ) ) )
..............................................................

Lorenz.Gygax@fat.admin.ch

2004-Nov-17 10:29 UTC

head link

[R] The hidden costs of GPL software?

> So, is this analysis correct: are there hidden costs for free 
> software like R in the time required to learn it? At least
> currently, for the people I know (biologists, ecologists,
> oceanographers, ...), this is perfectly true. This is even an
> insurmountable barrier for many of them I know, and they
> have given up (they come back to Statistica, Systat, or S-PLUS
> using exclusively functions they can reach through menus/dialog
> boxes).
I guess you are right, in that the steep initial learning curve could be
smoothed for beginners. On the other hand I do not see how a GUI for R could
cover more than the bare essentials because the available functionality is
so vast. We also have S-Plus at our research institution and even there, I
see, that people who do not know about the underlying code have difficulties
in using the GUI.

I personally believe that it is more a question how one is used to do
statistics. Click and drag is the norm. (And I guess it is usually also the
norm of how people/scientists use other Software.) In my eyes, using code
instead, means that one is able to repeat the steps of an evaluation easily
and to document at the same time what has been done. Very soon evaluations
(and data handling) can be done far more efficiently than with click and
drag. All these advantages outweigh the initial costs by several orders of
magnitude. Thus, in my opinion it is more a question of education such that
people might realize how they can work efficiently and cleanly. Perhaps one
could even say that such an approach is more scientific because, in
principal, it can be easily communicated and reproduced.

It is, of course, easy for me to make these statements, as in the meantime I
have been using S (S-Plus and R) for - gosh - over 10 years. But I see in
some projects that I supervise that people get started easily with a snippet
of code that I provide and the insight of the usefulness of such a work
approach is usually easily within reach.

Lorenz
- 
Lorenz Gygax, Dr. sc. nat.
Tel: +41 (0)52 368 33 84 / lorenz.gygax at fat.admin.ch      
Centre for proper housing of ruminants and pigs
Swiss Federal Veterinary Office

Jan P. Smit

2004-Nov-17 10:49 UTC

head link

[R] The hidden costs of GPL software?

Dear Phillippe,

Very interesting. The URL of the article is 
http://www.scientific-computing.com/scwsepoct04free_statistics.html.

Best regards,

Jan Smit


Philippe Grosjean wrote:> Hello,
> 
> In the latest 'Scientific Computing World' magazine (issue 78, p.
22), there
> is a review on free statistical software by Felix Grant ("doesn't
have to
> pay good money to obtain good statistics software"). As far as I know,
this
> is the first time that R is even mentioned in this magazine, given that it
> usually discuss commercial products.
> 
> In this article, the analysis of R is interesting. It is admitted that R is
> a great software with lots of potentials, but: "All in all, R was a
good
> lesson in the price that may have to be paid for free software: I spent
many
> hours relearning some quite basic things taken for granted in the
commercial
> package." Those basic things are releated with data import, obtention
of
> basic plots, etc... with a claim for a missing more intuitive GUI in order
> to smooth a little bit the learning curve.
> 
> There are several R GUI projects ongoing, but these are progressing very
> slowly. The main reason is, I believe, that a relatively low number of
> programmers working on R are interested by this field. Most people wanting
> such a GUI are basic user that do not (cannot) contribute... And if they
> eventually become more knowledgeable, they tend to have other interests.
> 
> So, is this analysis correct: are there hidden costs for free software like
> R in the time required to learn it? At least currently, for the people I
> know (biologists, ecologists, oceanographers, ...), this is perfectly true.
> This is even an insurmountable barrier for many of them I know, and they
> have given up (they come back to Statistica, Systat, or S-PLUS using
> exclusively functions they can reach through menus/dialog boxes).
> 
> Of course, the solution is to have a decent GUI for R, but this is a lot of
> work, and I wonder if the intrinsic mechanism of GPL is not working against
> such a development (leading to a very low pool of programmers actively
> involved in the elaboration of such a GUI, in comparison to the very large
> pool of competent developers working on R itself).
> 
> Do not misunderstand me: I don't give up with my GUI project, I am just
> wondering if there is a general, ineluctable mechanism that leads to the
> current R / R GUI situation as it stands,... and consequently to a
"general
> rule" that there are indeed most of the time "hidden costs"
in free
> software, due to the larger time required to learn it. I am sure there are
> counter-examples, however, my feeling is that, for Linux, Apache, etc...
the
> GUI (if there is one) is often a way back in comparison to the potentials
in
> the software, leading to a steep learning curve in order to use all these
> features.
> 
> I would be interested by your impressions and ideas on this topic.
> 
> Best regards,
> 
> Philippe Grosjean  
> 
> ..............................................<??}))><........
>  ) ) ) ) )
> ( ( ( ( (    Prof. Philippe Grosjean
>  ) ) ) ) )
> ( ( ( ( (    Numerical Ecology of Aquatic Systems
>  ) ) ) ) )   Mons-Hainaut University, Pentagone
> ( ( ( ( (    Academie Universitaire Wallonie-Bruxelles
>  ) ) ) ) )   6, av du Champ de Mars, 7000 Mons, Belgium  
> ( ( ( ( (       
>  ) ) ) ) )   phone: + 32.65.37.34.97, fax: + 32.65.37.33.12
> ( ( ( ( (    email: Philippe.Grosjean at umh.ac.be
>  ) ) ) ) )      
> ( ( ( ( (    web:   http://www.umh.ac.be/~econum
>  ) ) ) ) )
> ..............................................................
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide!
http://www.R-project.org/posting-guide.html
>

(Ted Harding)

2004-Nov-17 12:34 UTC

head link

[R] The hidden costs of GPL software?

On 17-Nov-04 Philippe Grosjean wrote:> Hello,
> 
> In the latest 'Scientific Computing World' magazine
> (issue 78, p. 22), there is a review on free statistical
> software by Felix Grant ("doesn't have to pay good money
> to obtain good statistics software"). As far as I know,
> this is the first time that R is even mentioned in this
> magazine, given that it usually discuss commercial products.
Hi Philippe,
Thanks for a most interesting post on this question. Further
comments below. Felix Grant's article is excellent, and well
balanced.
> In this article, the analysis of R is interesting. It is
> admitted that R is a great software with lots of potentials,
> but: "All in all, R was a good lesson in the price that may
> have to be paid for free software: I spent many hours
> relearning some quite basic things taken for granted in the
> commercial package." Those basic things are releated with
> data import, obtention of basic plots, etc... with a claim
> for a missing more intuitive GUI in order to smooth a little
> bit the learning curve.
It would better represent the balanced view of the article
to further quote:

  "In fact, the whole file menu in R looks either elegantly
   uncluttered of frightenly obscure, depending on your point
   of view."

  "It [the effort of learning] is the price paid, just as the
   dollars or euros for a commercial package would be. For
   that price, I've learned a great deal -- and nor only
   about R. And I shall remember it when I next have to find
   a heavyweight solution for a big problem presented by a
   small charitable client with an invisible budget. It's a
   huge, awe-inspiring package -- easier to perceive as such
   because the power is not hidden beneath a cosmetic veneer."

This last remark is, in my view, particularly significant.
See below.
> There are several R GUI projects ongoing, but these are
> progressing very slowly. The main reason is, I believe,
> that a relatively low number of programmers working on R
> are interested by this field. Most people wanting such a
> GUI are basic user that do not (cannot) contribute... 
> And if they eventually become more knowledgeable, they
> tend to have other interests.
> 
> So, is this analysis correct: are there hidden costs for
> free software like R in the time required to learn it?
> At least currently, for the people I know (biologists,
> ecologists, oceanographers, ...), this is perfectly true.
> This is even an insurmountable barrier for many of them
> I know, and they have given up (they come back to Statistica,
> Systat, or S-PLUS using exclusively functions they can
> reach through menus/dialog boxes).
Non-GUI vs GUI is not intrinsically linked to Free Software
as such. There are well-known FS programs which are essentially
GUI-based -- as an easy example, consider all the FS Web
Browsers such as Netscape, Mozilla, ... . If you want the
graphics experiences offered by the Web, you're in a graphics
screen anyway, and so it may as well be programmed around
a GUI. Others, such as OpenOffice, have deliberately built
on a GUI approach in order to emulate The Other Thing.

There are a lot of FS programs which offer a GUI, usually
somewhat on the basic side, which nonetheless encapsulates
the entire functionality of the program and saves the user
the task of composing a possibly complex command-line or
even a script.

The comment "hidden beneath a cosmetic veneer" is, in my
view, somewhat directly linked to commercial software.
If you sell software, you want a big market. So you want
to include the people who will never learn how to work
software from a command line; and the sweeter the taste of
the eye candy, the more such people will feel enjoyment
in using the software. The fact that their usage is limited
to what has been pre-programmed into the menus is not going
to affect many such people, since typically their useage
is limited to a very small subset of what is in fact possible.
This in turn leads, of course, to the phenomenon of
"software-driven analysis", where people only do what the
GUI allows (or, more precisely, easily allows); and this
leads on in turn to a culture in which people tend to believe
that Statistics is what they can do with a particular
software package.

S-Plus does its best to compromise: as well as GUI access
to a pretty wide range of functions, there is the Command
Line Window where the user can explicitly type in commands.
(I dare say many R users, in S-Plus, may tend to work in
the latter since they are already used to it.) But, as always
in a GUI, one can tend to get lost in the ramifications.
Also, things like the big arrays of tiny icons you get when
you click on the "2D Plots" or "3D Plots" buttons in the
S-Plus toolbar can be trying on the eyes and time-consuming
to pick through.
> Of course, the solution is to have a decent GUI for R,
> but this is a lot of work, and I wonder if the intrinsic
> mechanism of GPL is not working against such a development
> (leading to a very low pool of programmers actively involved
> in the elaboration of such a GUI, in comparison to the very
> large pool of competent developers working on R itself).
> 
> Do not misunderstand me: I don't give up with my GUI project,
> I am just wondering if there is a general, ineluctable
> mechanism that leads to the current R / R GUI situation as
> it stands,... and consequently to a "general rule" that
> there are indeed most of the time "hidden costs" in free
> software, due to the larger time required to learn it.
> I am sure there are counter-examples, however, my feeling
> is that, for Linux, Apache, etc... the GUI (if there is one)
> is often a way back in comparison to the potentials in
> the software, leading to a steep learning curve in order to
> use all these features.
Often, I think, in the Free Software world, people get involved
because they want to produce something which achieves a task.
Once they have a program which does that, then their aim is
satisfied. The GUI, in many cases, would be additional work
which would add nothing to what the software can do in terms
of tasks to be achieved. So in such cases, yes, I would tend
to agree that there is an intrinsic mechanism that discourages
work on a GUI for its own sake. You can add to that the fact
that once a developer has got to the point of creating such
software, successful in the tasks, they may have got beyond
the point at which they can readily sympathise with users who
have not acquired such skills: they no longer perceive, from
their own experience, that there is a problem.

However, this leaves people like you, having colleagues who
"come back to Statistica, Systat, or S-PLUS using exclusively
functions they can reach through menus/dialog boxes." By this
experience, you are aware of the problem, and rightly feel
that they would be helped by having access to the sort of
GUI/Menu interface that they are used to using.

One genuine benefit that the GUI offers, especially to
beginners with a particular software package, is that the
resources of the software can perhaps more easily and rapidly
be explored through the GUI, rather than searching laboriously
through the documentation of functions, extra packages, and
so on. This means that they more readily come to perceive
what is available though of course this is limited to what
the GUI will show them. But a good "Help" window can break
that barrier.

Perhaps R itself is less helpful than it might be in this
respect. The R-help list bristles with queries of the form
"How can I do X?", which I think is evidence of a problem.
While some of these queries clearly originate from people
who have taken no trouble to explore readily accessible
information, many others can not be so easily dismissed.

If you know something about what you're after, once you
realise that a judiciously formulated "help.search" can
throw up a lot of possibilities you are well on your way.
So, for instance (as in a recent query about 2-D Fourier
transform for spatial data) 'help.search("fourier")' gives
relevant information.

This, though, still fails for information in packages which
you have not installed. Perhaps I'm about to reveal my own
culpable ignorance here, but I'm not aware of a "full R info"
package which would be installed as part of R-base, being
a database of info about R-base itself and also every current
additional package, such that a "help.search" would show
all resources -- including those not installed -- which
match a query (and flag the non-installed ones as such so
that the user knows what to install for a particular purpose).

Whether this needs to be supplemented by a GUI is a point
that could be discussed from several points of view.
Philippe's biological/oceanographic users no doubt would
be considerably helped, provided they can in due course
come to the point where they can start to work "beyond
the GUI" (if indeed they need to).

Personally, however, I find that GUI work is slower and
more error-prone than command-line work. Swanning the
mouse around the screen, visually idebtifying icons and
buttons, clicking on this and that in order to see whether
it's what you want, and so on, is much more time-consuming
than typiing in a command.
And God help you if you accidentally click on something
destructive!

I'll close with an immortal quotation (from Charles Curran,
of the UK Unix Users Group):

  "I can touch-type, but I can't touch-mouse"

Best wishes to all,
Ted.
> I would be interested by your impressions and ideas on this topic.
> 
> Best regards,
> 
> Philippe Grosjean  
> 
> ..............................................<??}))><........
                                                   /\
                                                 /   |
  .............................<??}))><........  :)    >=---
                                                 \   |
                                                   \/

Best wishes to all,
Ted.

--------------------------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at nessie.mcc.ac.uk>
Fax-to-email: +44 (0)870 094 0861  [NB: New number!]
Date: 17-Nov-04                                       Time: 12:34:31
------------------------------ XFMail ------------------------------

Federico Calboli

2004-Nov-17 12:49 UTC

head link

[R] The hidden costs of GPL software?

Philippe Grosjean wrote:> I would be interested by your impressions and ideas on this topic.
I have found that "user friendly" packages make a lot of assumptions
and
take a lot of decisions for the user. This makes things easy, but you do
not really know what is going on, and I'd say this is a hidden cost of
"commercial" software. 

I wrote to the list in February asking how to reproduce some results
previously obtained with Statistica. It turned out that Statistica does
some data manipulation without telling the user, with poor documentation
and no options or choice. Do you trust results obtained this way? I
don't. 

So I'd argue that the lack of a GUI is a good thing, because it forces
the users to think a bit more about what they want to do, and gives more
control on what is going on.

Best,

Federico Calboli
-- 
Federico C. F. Calboli

Dipartimento di Biologia Evoluzionistica Sperimentale
Universit?? di Bologna
Via Selmi, 3
40126 Bologna - ITALY

Tel - +39 051 2094187
Fax - +39 051 2094286
f.calboli at ucl.ac.uk
fcalboli at alma.unibo.it

Mike Prager

2004-Nov-17 20:55 UTC

head link

[R] The hidden costs of GPL software?

This has been an interesting discussion. I make the following comment with 
hesitation, since I have neither the time nor the ability to implement it 
myself.

Using CLI software, an infrequent user has trouble remembering the known 
functions needed and trouble finding new ones (especially as that user gets 
older).  What might help is an added help facility more oriented towards 
tasks, rather than structured around functions or packages.

Such a help facility might have a tree structure.

Want help?  Are you looking for information on (1) data manipulation or (2) 
analysis?  If (1), do you want to to (3) import or export data, (4) 
transform data, (5) reshape data, or (6) select data?  If (2), do you want 
to (7) fit a model or (8) make a graph?  And so on....

Once appropriate function(s) are located, the user would be directed (by 
hyperlinks) to the existing help framework.

That could help the problem of knowing what you want to do, but not what it 
is called.  I think that "Introductory Statistics with R" is a step in
that
direction for the basics, as MASS is for more complex matters.  The 
question is whether such material can be incorporated into a help system 
that will allow users to find, more easily, what they need.  That largely 
depends, it seems to me, on a great deal of work by volunteers.

I agree also with the suggestion that a dedicated editor (or add-in) that 
could supply arguments for functions might be considerable help.

MHP


-- 
Michael Prager, Ph.D.
Population Dynamics Team, NMFS SE Fisheries Science Center
NOAA Center for Coastal Fisheries and Habitat Research
Beaufort, North Carolina  28516
http://shrimp.ccfhrb.noaa.gov/~mprager/

Andrew Criswell

2004-Nov-18 01:59 UTC

head link

[R] The hidden costs of GPL software?

Hi All,

GRETL, a Gnu Regression, Econometrics and Time-series Library is 
open-source, cross-platform, multi-language and fully GUI based. The 
website is http://gretl.sourceforge.net/ This is NOT a personal plug, 
simply posted to show what can be done.

Andrew

Richard A. O'Keefe

2004-Nov-18 04:38 UTC

head link

[R] The hidden costs of GPL software?

Background:  I'm a Computer Science lecturer, and I read the blue book cover
to cover before ever setting finger to keyboard with R.

Observation:  I really only use R for very simple things, but there's
practically *nothing* I've done with R since installing it could have
been done via menus.  I seem to need lots of little R functions, lots
of little try this transformation plot that, fiddle with the other...
I've had a student use it, I've introduced it to colleauges, and they
would not have benefited one iota from a GUI interface.

I'm glad that the effort put into R has gone into the things it has.

Michael Grant

2004-Nov-18 11:24 UTC

head link

[R] The hidden costs of GPL software?

Hmmmm, interesting thread and minds will not be
changed but regarding GUIs...I thought S (aka R) was a
PROGRAMMING LANGUAGE with a statistical and numerical
slant, and not a statistics application. ;O)  

Certainly there is an important place for GUIs but I
believe that it is very much overemphasized in modern
computer culture. My experience and bias--and I
started in the 1960's-- is that except for 'trivial'
uses, GUIs are a detriment to any reasonably complex
CREATIVE computational task. They are adequate for the
simple, common task. But even then, typing a command
or two is not overly taxing--- particularly when
compared to navigating layer upon layer of submenus as
is some times needed. If I need to, I will add a
little syntactical sugaring when coding and move on. 

GUIs encourage a passive approach to using computers
when solving problems. In addition, it is regretable
that a lot of people in the 'workplace' will carry out
incomplete and/or incorrect quantitative work because
of the real or perceived limitations of the particular
(GUI) apps they are using. There is no inclination to
go beyond the menu and even then many menu items
gather 'electronic dust'.

Finally, there are times for many of us when work
'goes home' at the end of the day. That just comes
with the territory. I (and most others) can not afford
the luxury of S-plus, Statistica, SPSS, etc. at home.
So in a sense there is a very real 'loss of
productivity' cost associated with using commercial
software. Now that does bring us around to supporting
R doesn't it? (Mea culpa. And I resolve to do better!)
What value does one put on the vitality of the R
community?

Best regards,
Michael Grant, Ph.D. 

* The requirements for creating packages are on
target,  and have the desired impact on both the
quality and breadth of R. 

--- Philippe Grosjean <phgrosjean at sciviews.org> wrote:
> Hello,
> 
> In the latest 'Scientific Computing World' magazine
> (issue 78, p. 22), there
> is a review on free statistical software by Felix
> Grant ...2.)

Tiago R Magalhaes

2004-Nov-18 16:39 UTC

head link

[R] Re: The hidden costs of GPL software?

My background:

I am a biologist coming to R via Bioconductor. I have no computer 
background in computer sciences and only basic undergraduate training 
level in statistics.

I have used R with great pleasure and great pains. The most difficult 
thing is to know what functions to use - sometimes I know that one 
function is most likely available, but there's really no easy way to 
get it (yes, even going to the archives and reading the help files). 
I feel that more examples in the help files would definitely be a 
good way to fully understand the potencial of the functions. I know 
how difficult this is to do and how much of a time sink it must be.

One thing I defeinitely think would be a great improvement is to have 
a beefed up Object Explorer as Splus does. I think it's the great 
advantage of Splus when compared to R is to have much easier access 
to what type of object, col names, classes and so on there are. And 
how much easier it is to change all these attributes in Splus. I 
think R 2.0 in Mac did a lot to improve this, but I think that for 
someone that very frequently needs to know whether the object created 
turned out to be a vector or a list, the easy access to objects is 
very, very, very important and would be a great improvement in the 
ease of use of R.

Arne.Muller@aventis.com

2004-Nov-18 17:50 UTC

head link

[R] Re: The hidden costs of GPL software?

[...]> I am a biologist coming to R via Bioconductor. I have no computer 
> background in computer sciences and only basic undergraduate training 
> level in statistics.
> 
> I have used R with great pleasure and great pains. The most difficult 
> thing is to know what functions to use - sometimes I know that one 
> function is most likely available, but there's really no easy way to 
> get it (yes, even going to the archives and reading the help files). 
> I feel that more examples in the help files would definitely be a 
> good way to fully understand the potencial of the functions. I know 
> how difficult this is to do and how much of a time sink it must be.
Yes, I' often have the same problem when it comes to programming in R (data
manipulation, formatting etc ...). When thinking about a solution, I often come
up with something slow and complicated. A positng to this list usually reveals a
very simple solution thanks to a function that I didn't find when exploring
help, help.search and the archives (and thanks to those who give me the hint
;-). However, I don't know how to improve this, i.e. how to implement a more
sophisticated help.search. Maybe the keywords in the help files or some kind of
free text mining would help - well, maybe this is a bit over the top.

On the other hand, when it comes to the statistics (I'm a not a
statistician) and it's minimal formatting of data etc , I think that
developing an understanding of the stats itself is the main probelm and a GUI
doesn't help very much in for this. Once the basic understanding is there
(which one needs anyway, even with a GUI), the rest is not too difficult. In
addition I usually need to script the calculations for many different datasets,
and again most GUIs are bad in repeating tasks systematically.

I've spent quite some time with learing R (and I haven't stoped yet ;-),
but it's devinitely worth it. As a scientists I appreciate it, and since it
is a tool that use often, I would not exchange the command-line for any GUI.

This list and the many books and manuals (mentioned in the other postings here)
do a pretty good job in teaching R!

	kind regards,

	Arne

[...]

Liaw, Andy

2004-Nov-18 19:50 UTC

head link

[R] The hidden costs of GPL software?

> From: David Forrest
> 
> On Wed, 17 Nov 2004, Mike Prager wrote:
> 
> ...
> > Using CLI software, an infrequent user has trouble 
> remembering the known
> > functions needed and trouble finding new ones (especially 
> as that user gets
> > older).  What might help is an added help facility more 
> oriented towards
> > tasks, rather than structured around functions or packages.
> ...
> 
> Another good (non-GUI) tool for the CLI is keyword 
> completion.  R in ESS
> does this, giving you lists of possible functions, variables 
> and objects,
> or feedback if there isn't any.  R's CLI completes, but only with
> filenames in the current directory.
That works only if R was compiled with readline.  Thus it doesn't work that
way on Windows, for example.  Completion still works on Windows under ESS
though.

Andy


 > Dave
> -- 
>  Dave Forrest
>  drf at vims.edu                                    (804)684-7900w
>  drf5n at maplepark.com                             (804)642-0662h
>                                    http://maplepark.com/~drf5n/
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html
> 
>

John Maindonald

2004-Nov-18 21:48 UTC

head link

[R] The hidden costs of GPL software?

The author of the article says nothing about the large
number of hours and weeks that he surely spent learning
S-plus!

There should be attention to the costs that arise from a wrong
or inappropriate analysis, perhaps because the software that
is in use makes it difficult to do anything better, perhaps
because of statistical skill limitations, often with these two
factors working together.  Analyses that misrepresent the
science, or designs and analyses that conspire together to
this end, have serious and costly implications for research.

I've refereed several papers recently, in broadly ecological
fields of endeavour, with seemingly quite reasonable data,
where the mix of author skill and abilities of the package was
clearly not up to the task in hand.  Relative to getting on top
of the statistical issues (for which they will probably end up
getting, as they need to, statistical help), the GUI/noGUI issue
will be a minor consideration, and hours or weeks spent
learning R will be at most a modest consideration.

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Bioinformation Science, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.


On 17 Nov 2004, at 10:27 PM, r-help-request at stat.math.ethz.ch wrote:
> From: "Philippe Grosjean" <phgrosjean at sciviews.org>
> Date: 17 November 2004 8:53:28 PM
> To: <r-help at stat.math.ethz.ch>, <r-sig-gui at
stat.math.ethz.ch>
> Cc: Subject:
>
> Hello,
>
> In the latest 'Scientific Computing World' magazine (issue 78, p.
22),
> there
> is a review on free statistical software by Felix Grant ("doesn't
have
> to
> pay good money to obtain good statistics software"). As far as I know,
> this
> is the first time that R is even mentioned in this magazine, given 
> that it
> usually discuss commercial products.
>
> In this article, the analysis of R is interesting. It is admitted that 
> R is
> a great software with lots of potentials, but: "All in all, R was a 
> good
> lesson in the price that may have to be paid for free software: I 
> spent many
> hours relearning some quite basic things taken for granted in the 
> commercial
> package." Those basic things are releated with data import, obtention 
> of
> basic plots, etc... with a claim for a missing more intuitive GUI in 
> order
> to smooth a little bit the learning curve.
>
> There are several R GUI projects ongoing, but these are progressing 
> very
> slowly. The main reason is, I believe, that a relatively low number of
> programmers working on R are interested by this field. Most people 
> wanting
> such a GUI are basic user that do not (cannot) contribute... And if 
> they
> eventually become more knowledgeable, they tend to have other 
> interests.
>
> So, is this analysis correct: are there hidden costs for free software 
> like
> R in the time required to learn it? At least currently, for the people 
> I
> know (biologists, ecologists, oceanographers, ...), this is perfectly 
> true.
> This is even an insurmountable barrier for many of them I know, and 
> they
> have given up (they come back to Statistica, Systat, or S-PLUS using
> exclusively functions they can reach through menus/dialog boxes).
>
> Of course, the solution is to have a decent GUI for R, but this is a 
> lot of
> work, and I wonder if the intrinsic mechanism of GPL is not working 
> against
> such a development (leading to a very low pool of programmers actively
> involved in the elaboration of such a GUI, in comparison to the very 
> large
> pool of competent developers working on R itself).
>
> Do not misunderstand me: I don't give up with my GUI project, I am just
> wondering if there is a general, ineluctable mechanism that leads to 
> the
> current R / R GUI situation as it stands,... and consequently to a 
> "general
> rule" that there are indeed most of the time "hidden costs"
in free
> software, due to the larger time required to learn it. I am sure there 
> are
> counter-examples, however, my feeling is that, for Linux, Apache, 
> etc... the
> GUI (if there is one) is often a way back in comparison to the 
> potentials in
> the software, leading to a steep learning curve in order to use all 
> these
> features.
>
> I would be interested by your impressions and ideas on this topic.
>
> Best regards,
>
> Philippe Grosjean

Bill Northcott

2004-Nov-19 09:22 UTC

head link

[R] The hidden costs of GPL software?

>       If the GPL were not so tight on this point, someone could
> commercialize a GUI for R without having to offer their source code
> under the GPL.
>There is nothing in GPL to stop a commercial GUI for R.  Have a look at 
what Apple do.  They have a complete commercial GUI and numerous 
applications built on a an almost completely GPL'ed operating system.  
There are loads of shareware GUIs which drive GPL utilities.  Most 
obviously there are plenty of commercial apps which run on GNU Linux.

Bill Northcott

Pfaff, Bernhard

2004-Nov-19 13:42 UTC

head link

[R] The hidden costs of GPL software?

Dear list member,

this thread as well as the first one started by Philippe about the
usefulness of a GUI is interesting and overwhelming alike. IMHO, it
wittnesses the greatness and superiority of R compared to other statistical
programming environments and programs: the core team and all people involved
with it. Everyday I am flabbergasted and amazed anew; learning new concepts,
programming tricks, statistical methods I was unfamiliar with and much much
more: kudos to all of you. Let me now toss in my two cents:

First cent: Comments about a GUI
Although I use Emacs/ESS and R batch mode mostly, a GUI is beneficial in
terms of teaching statistics and/or econometrics with R. This conclusion
draws upon experience nine years back, while I was giving, beside
econometric classes, computer labs at university. At that time we had only
commercial products at hand: RATS, GAUSS and EViews. From a students
perspective EViews (menu-driven) was the most convenient one. The
econometric method comprehension and the interpretation of its application
is of utmost importance. Hence, novices should concentrate on this to
familiarise themselves with the subject. Most of the students got scared and
distracted by learning a command driven programming language too, i.e. this
was too much to swallow at one time. With other words: do not challenge
novices to statistics/econometrics programmatically. A point mentioned by
Phillipe in an earlier email too. 
Now given Rcmdr: I like its flexibility and that everybody can tailormade
his own `version' by adding new menus and functionalities. So to speak, a
very decent ground work has been provided by John Fox and I appreciate it
alot. I can only speak for myself: I am currently writing an `urca' add-in
to Rcmdr, such that the package is more amenable to novice users in a
computer lab for example; that is: they do not have to worry about the
correct syntax or what can be achieved with which function. In order to do
so, two files are needed: one is an addendum to the menu's file and the
other one contains the R functions to be executed within Rcmdr. It is at the
leisure of the instructure to include these into Rcmdr. They can be shipped
in the package's /inst directory, for example. This seems to be a feasible
approach for other package maintainers working in different fields too. Or
would such an approach be to simple?

Second cent: help system
As voiced in earlier emails in this thread the R documentation, contributed
tutorials and the likes as well as the help facilities are indeed great. The
only snag, is a lack of an `easy to find' approach to be taken. Surely there
is help.search(), apropos, help.start() etc. etc. But what would be nice,
would be something similar to `texdoctk' for LaTeX document retrieval. That
is: categorise the manuals, package manuals, vignettes and other contributed
docs with respect to the catergory they belong to. Well, the snag is: who
does this labour intensive work? Hm, I am sceptic, but it might turn out
that this is not a feasible approach to be taken, but maybe my second
suggestion is: making greater use of \concepts and/or \keyword by providing
a file for download on CRAN that contains the \concepts entries & the
function & the package in which it is contained. One could then download
this file and execute a `zgrep' on it, as could be done likewise with a
contents file from an apt repository to find out which file is contained in
which rpm. The advantage would be the decentralisation of the work. It does
not take that long when each package maintainer utilises \concepts in his
.Rd files. Once, a package is contributed to CRAN, one could scan the
tarballs and extract the relevant information into the above mentioned file.
Another advantage would be, that users would find functions of packages that
are currently not in their search path, because the packages have not been
installed. And not a few questions on this list are answered by: `This
functionality is contained in package xyz'. Anyway, I will introduce
\concepts within the next release of `urca'.

Best Regards,
Bernhard   
> 
> "Philippe Grosjean" <phgrosjean at sciviews.org> writes:
> 
> > Peter,
> > 
> > You don't need the ActiveState Tcl distribution to add 
> extensions. If you
> > compile extensions yourself (and these extensions have a compatible
> > license), then you have no problems... (well, almost! You 
> must make sure
> > those extensions compile correctly on all supproted 
> platforms). This is
> > exactly what I do in the tcltk2 package.
> > Best,
> > 
> > Philippe Grosjean 
> 
> I know, it's just that it feels silly that we cannot build on the fine
> work of ActiveState.
> 
> -- 
>    O__  ---- Peter Dalgaard             Blegdamsvej 3  
>   c/ /'_ --- Dept. of Biostatistics     2200 Cph. N   
>  (*) \(*) -- University of Copenhagen   Denmark      Ph: 
> (+45) 35327918
> ~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)             FAX: 
> (+45) 35327907
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

--------------------------------------------------------------------------------
The information contained herein is confidential and is inte...{{dropped}}

Siddique, Amer

2004-Nov-19 15:18 UTC

head link

[R] Re: The hidden costs of GPL software?

my quick thought:
R is a programming language and shouldn't be wrapped up in a GUI to serve
the interests of those too complacent to learn to leverage its power. 

and to echo others: I feel an IDE approach with, say, a code editor and a
hyperlinked help system with a richer set of examples is sufficient. we
already have the former. cheers

Cliff Lunneborg

2004-Nov-19 21:59 UTC

head link

[R] The hidden costs of GPL software?

Could I voice my support for the sixth point raised by John Fox? Many
users would find such a development to be enormously useful.


"  (6) As has been pointed out, e.g., by Duncan Murdoch, solving the
function-locating problem is best done by a method or methods that
automatically accommodate the growing and changing set of contributed
packages on CRAN.  Why not, as previously has been proposed, replace the
current static (and, in my view, not very useful) set of keywords in R
documentation with the requirement that package authors supply their own
keywords for each documented object? I believe that this is the intent
of
the concept entries in Rd files, but their use certainly is not required
or
even actively encouraged. (They're just mentioned in passing in the
Writing
R Extensions manual.)"


**********************************************************
Cliff Lunneborg, Professor Emeritus, Statistics &
Psychology, University of Washington, Seattle
cliff at ms.washington.edu

Liaw, Andy

2004-Nov-23 21:31 UTC

head link

[R] The hidden costs of GPL software?

> From: Patrick Burns
> 
> I think John has exactly the right image -- index to a book --
> but I disagree with his conclusions.
> 
> I read somewhere that an index should not be done by the
> author.  It was probably written by someone who was bored
> of indexing, but the logic was precisely because indices should
> be about concepts.  The author of a package will have one
> concept for a function but not all of the concepts that come
> from various fields of study.  I suspect that no one outside of
> finance would think to index "sd" with "volatility" for
(a not very
> good) example.
> 
> There could be an index builder that accepts a search phrase and
> the function or package that is the successful answer to the search.
> If this were open, then R users could contribute to the index who
> don't feel qualified to submit code. It could also help diffuse the
> frustration of taking too long to find a function by allowing a way
> to insure that the exact same thing doesn't happen to others.
> 
> Amazon has a function that says those who bought "The Chicago
> Manual of Style" also bought Strunk and White.
Would that be the same function that suggested bunch of books on fashion
modeling when I look up Frank's book (`Regression Modeling Strategies')?
8-)

Andy
>  In the same way,
> the R index could provide a list of terms that overlap the given
> search term.  For example if we search for "goodness of fit",
then
> "hypothesis test" might be one of the related terms that pops up.
> 
> No, I'm not volunteering to build the system.
> 
> Patrick Burns
> 
> Burns Statistics
> patrick at burns-stat.com
> +44 (0)20 8525 0696
> http://www.burns-stat.com
> (home of S Poetry and "A Guide for the Unwilling S User")
> 
> John Fox wrote:
> 
> >Dear Duncan,
> >
> >I don't think that there is an automatic, nearly costless 
> way of providing
> >an effective solution to locating R resources. The problem 
> seems to me to be
> >analogous to indexing a book. There's an excellent 
> description of what that
> >process *should* look like in the Chicago Manual of Style, 
> and it's a lot of
> >work. In my experience, most book indexes are quite poor, 
> and automatically
> >generated indexes, while not useless, are even worse, since 
> one should index
> >concepts, not words. The ideal indexer is therefore the 
> author of the book.
> >
> >I guess that the question boils down to how important is it 
> to provide an
> >analogue of a good index to R? As I said in a previous 
> message, I believe
> >that the current search facilities work pretty well -- about 
> as well as one
> >could expect of an automatic approach. I don't believe that 
> there's an
> >effective centralized solution, so doing something more 
> ambitious than is
> >currently available implies farming out the process to 
> package authors. Of
> >course, there's no guarantee that all package authors will 
> be diligent
> >indexers. 
> >
> >Regards,
> > John
> >
> >--------------------------------
> >John Fox
> >Department of Sociology
> >McMaster University
> >Hamilton, Ontario
> >Canada L8S 4M4
> >905-525-9140x23604
> >http://socserv.mcmaster.ca/jfox 
> >-------------------------------- 
> >
> >  
> >
> >>-----Original Message-----
> >>From: r-help-bounces at stat.math.ethz.ch 
> >>[mailto:r-help-bounces at stat.math.ethz.ch] On Behalf Of 
> Duncan Murdoch
> >>Sent: Monday, November 22, 2004 8:55 AM
> >>To: Cliff Lunneborg
> >>Cc: r-help at stat.math.ethz.ch
> >>Subject: Re: [R] The hidden costs of GPL software?
> >>
> >>On Fri, 19 Nov 2004 13:59:23 -0800, "Cliff Lunneborg"
> >><cliff at ms.washington.edu> quoted John Fox:
> >>
> >>    
> >>
> >>>Why not, as previously has been proposed, replace the 
> current static 
> >>>(and, in my view, not very useful) set of keywords in R 
> >>>      
> >>>
> >>documentation 
> >>    
> >>
> >>>with the requirement that package authors supply their own 
> >>>      
> >>>
> >>keywords for 
> >>    
> >>
> >>>each documented object? I believe that this is the intent of
the
> >>>concept entries in Rd files, but their use certainly is not 
> >>>      
> >>>
> >>required or 
> >>    
> >>
> >>>even actively encouraged. (They're just mentioned in 
> passing in the 
> >>>Writing R Extensions manual.
> >>>      
> >>>
> >>That would not be easy and won't happen quickly.  There are
some
> >>problems:
> >>
> >> - The base packages mostly don't use  \concept. (E.g. base 
> >>has 365 man pages, only about 15 of them use it).  Adding it 
> >>to each file is a fairly time-consuming task.
> >>
> >>- Before we started, we'd need to agree as to what they are
for.
> >>Right now, I think they are mainly used when the name of a 
> >>concept doesn't match the name of the function that 
> >>implements it, e.g.
> >>"modulo", "remainder", "promise",
"argmin", "assertion".  The
> >>need for this usage is pretty rare.  If they were used for 
> >>everything, what would they contain?
> >>
> >> - Keywording in a useful way is hard.  There are spelling 
> >>issues (e.g. optimise versus optimize); our fuzzy matching 
> >>helps with those.
> >>But there are also multiple names for the same thing, and 
> >>multiple meanings for the same name.
> >>
> >>Duncan Murdoch
> >>
> >>______________________________________________
> >>R-help at stat.math.ethz.ch mailing list
> >>https://stat.ethz.ch/mailman/listinfo/r-help
> >>PLEASE do read the posting guide! 
> >>http://www.R-project.org/posting-guide.html
> >>    
> >>
> >
> >______________________________________________
> >R-help at stat.math.ethz.ch mailing list
> >https://stat.ethz.ch/mailman/listinfo/r-help
> >PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html
> >
> >
> >
> >  
> >
> 
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html
> 
>

Apparently Analagous Threads

Search for more reasonably related threads

R devel - Nov 2004 - The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] Re: The hidden costs of GPL software?

[R] Re: The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] Re: The hidden costs of GPL software?

[R] The hidden costs of GPL software?

[R] The hidden costs of GPL software?

Apparently Analagous Threads