Martin Maechler
2010-Jun-01 13:25 UTC
[R] R-help "spam" detection; please help the moderators
Dear readers of R-help as most of you will *not* be aware, R-help has continued to work the way it does, only thanks to a dozen of volunteers, see https://stat.ethz.ch/mailman/listinfo/r-help . The volunteers manually moderate e-mails that "look like spam" (and sometimes are and sometimes are not). While much more than 90% of the spam is filtered out long before a human sees it, with the increasing sophistication of spammers, manual intervention has deemed to be necessary and served the community very well. OTOH, in recent weeks, the amount of work for the volunteers has increased, mainly because an increasingly number of non-spam postings are erronously tagged as "possibly spam". We have discussed about this and done some analysis and found that most of these message that produce a considerable amount of extra work share two properties : 1) they are posted via Nabble {which *always* attaches a small pro-Nabble spam at the end of the message} 2) the e-mail address of the sender is from a freemail provider, quite often 'at gmail dot com', and often the part *before* the '@' (at-sign) ends with digits. We hereby ask those among you who use a freemail account to please no longer post via nabble. Thank you for your support of R-help, *the* "community mailing list" of the R project since even before that project existed "formally", namely since 1997-04-01, today 13 years and two months. Martin Maechler, ETH Zurich (and R-help creator and principal manager)
Hi all, I also couldn't help but notice that some of my messages are bounced for following reason: The message headers matched a filter rule I included the header of one of the messages below, but neither of these messages is sent trough Nabble, nor does any mail address has digits in it. I also never had that before. Did you change some of the rules somehow? Cheers Joris ----------------------- MIME-Version: 1.0 Received: by 10.140.173.9 with HTTP; Fri, 28 May 2010 05:32:32 -0700 (PDT) In-Reply-To: <AANLkTim9eTuY2EfynLoH2LYN7M133YTjeNcDJpkGPHJx@mail.gmail.com> References: <AANLkTikgC7V2ZbSYRWcWBUeeZm8D24qj0VqeB2z1NduD@mail.gmail.com> <AANLkTim9eTuY2EfynLoH2LYN7M133YTjeNcDJpkGPHJx@mail.gmail.com> Date: Fri, 28 May 2010 14:32:32 +0200 Delivered-To: jorismeys@gmail.com Message-ID: <AANLkTimg4IDyiVhe1ek9mk6_RybjcNuU4msvWRvtSGTS@mail.gmail.com> Subject: Re: [R] How to get values out of a string using regular expressions? From: Joris Meys <jorismeys@gmail.com> To: Gabor Grothendieck <ggrothendieck@gmail.com> Cc: R mailing list <r-help@r-project.org> Content-Type: multipart/alternative; boundary=000e0cd2295481515c0487a6b3be --000e0cd2295481515c0487a6b3be Content-Type: text/plain; charset=ISO-8859-1 On Tue, Jun 1, 2010 at 3:25 PM, Martin Maechler <maechler@stat.math.ethz.ch>wrote:> Dear readers of R-help > > as most of you will *not* be aware, R-help has continued to work the > way it does, only thanks to a dozen of volunteers, > see https://stat.ethz.ch/mailman/listinfo/r-help . > > The volunteers manually moderate e-mails that "look like spam" (and > sometimes are and sometimes are not). > While much more than 90% of the spam is filtered out long before > a human sees it, with the increasing sophistication of spammers, > manual intervention has deemed to be necessary and served the > community very well. > > OTOH, in recent weeks, the amount of work for the volunteers has > increased, mainly because an increasingly number of non-spam postings are > erronously tagged as "possibly spam". > We have discussed about this and done some analysis and found > that most of these message that produce a considerable amount of > extra work share two properties : > 1) they are posted via Nabble {which *always* attaches a small > pro-Nabble spam at the end of the message} > 2) the e-mail address of the sender is from a freemail > provider, quite often 'at gmail dot com', and often the part > *before* the '@' (at-sign) ends with digits. > > We hereby ask those among you who use a freemail account to > please no longer post via nabble. > > Thank you for your support of R-help, *the* "community mailing > list" of the R project since even before that project existed > "formally", namely since 1997-04-01, > today 13 years and two months. > > Martin Maechler, ETH Zurich > (and R-help creator and principal manager) > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >-- Joris Meys Statistical Consultant Ghent University Faculty of Bioscience Engineering Department of Applied mathematics, biometrics and process control Coupure Links 653 B-9000 Gent tel : +32 9 264 59 87 Joris.Meys@Ugent.be ------------------------------- Disclaimer : http://helpdesk.ugent.be/e-maildisclaimer.php [[alternative HTML version deleted]]
Joshua Wiley
2010-Jun-02 15:34 UTC
[R] R-help "spam" detection; please help the moderators
Hello Martin and Ted, First off thank you to you guys and all the volunteers for providing this wonderful service. I have two questions. 1) Do you know if it is a problem to respond to a post from nabble using a gmail account? 2) Would it be easier for you if people just used non free accounts? I don't particularly relish the idea, but if it helped it would be worth it. Thanks again, Josh On Tue, Jun 1, 2010 at 6:25 AM, Martin Maechler <maechler at stat.math.ethz.ch> wrote:> Dear readers of R-help > > as most of you will *not* be aware, R-help has continued to work the > way it does, only thanks to a dozen of volunteers, > see https://stat.ethz.ch/mailman/listinfo/r-help . > > The volunteers manually moderate e-mails that "look like spam" (and > sometimes are and sometimes are not). > While much more than 90% of the spam is filtered out long before > a human sees it, with the increasing sophistication of spammers, > manual intervention has deemed to be necessary and served the > community very well. > > OTOH, in recent weeks, the amount of work for the volunteers has > increased, mainly because an increasingly number of non-spam postings are > erronously tagged as "possibly spam". > We have discussed about this and done some analysis and found > that most of these message that produce a considerable amount of > extra work share two properties : > ?1) they are posted via Nabble ?{which *always* attaches a small > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? pro-Nabble spam at the end of the message} > ?2) the e-mail address of the sender is from a freemail > ? ?provider, quite often 'at gmail dot com', and often the part > ? ?*before* the '@' (at-sign) ends with digits. > > We hereby ask those among you who use a freemail account to > please no longer post via nabble. > > Thank you for your support of R-help, *the* "community mailing > list" of the R project since even before that project existed > "formally", namely since 1997-04-01, > today 13 years and two months. > > Martin Maechler, ETH Zurich > (and R-help creator and principal manager) > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >-- Joshua Wiley Senior in Psychology University of California, Riverside http://www.joshuawiley.com/
Reasonably Related Threads
- Warning message when items of Hmisc are masked by loading a package.
- Improved Data Aggregation and Summary Statistics in R
- Non-procedural access to columns of a matrix
- Best practices in developing package: From a single file
- `[` not recognized as a primitive in certain cases.