Bob Green
2008-Sep-20 06:24 UTC
[R] removing a word, the following space and the next word
>Hello,I am hoping for advice as to how I could remove all words immediately following the words 'Mr' or 'Mr.' in a csv file. For example, the following phrases are included in lines of text (along with other Mr) that could be anywhere in the file: "Mr Jones ate lunch" and "Mr Smith was tied". I want to remove the words Jones and Smith (etc) leaving the other text intact. Any suggestions are appreciated, regards Bob
Andrew Robinson
2008-Sep-20 06:42 UTC
[R] removing a word, the following space and the next word
Hi Bob, I recommend doing some background reading on regular expressions[1] and using gsub(). Cheers Andrew [1] http://en.wikipedia.org/wiki/Regular_expressions On Sat, Sep 20, 2008 at 04:24:36PM +1000, Bob Green wrote:> > >Hello, > > I am hoping for advice as to how I could remove all words immediately > following the words 'Mr' or 'Mr.' in a csv file. For example, the > following phrases are included in lines of text (along with other Mr) > that could be anywhere in the file: "Mr Jones ate lunch" and "Mr > Smith was tied". > > > I want to remove the words Jones and Smith (etc) leaving the other > text intact. > > Any suggestions are appreciated, > > regards > > > Bob > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.-- Andrew Robinson Department of Mathematics and Statistics Tel: +61-3-8344-6410 University of Melbourne, VIC 3010 Australia Fax: +61-3-8344-4599 http://www.ms.unimelb.edu.au/~andrewpr http://blogs.mbs.edu/fishing-in-the-bay/
jim holtman
2008-Sep-20 17:15 UTC
[R] removing a word, the following space and the next word
> x <- 'Mr Jones ate lunch and Mr Smith was tied' > gsub('(Mr\\.*)\\s+\\w+', "\\1 <file://0.0.0.1/> xxxx", x)[1] "Mr xxxx ate lunch and Mr xxxx was tied"> >On Sat, Sep 20, 2008 at 2:24 AM, Bob Green <bgreen@dyson.brisnet.org.au>wrote:> > Hello, >> > > I am hoping for advice as to how I could remove all words immediately > following the words 'Mr' or 'Mr.' in a csv file. For example, the following > phrases are included in lines of text (along with other Mr) that could be > anywhere in the file: "Mr Jones ate lunch" and "Mr Smith was tied". > > > I want to remove the words Jones and Smith (etc) leaving the other text > intact. > > Any suggestions are appreciated, > > regards > > > Bob > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html<http://www.r-project.org/posting-guide.html> > and provide commented, minimal, self-contained, reproducible code. >-- Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem that you are trying to solve? [[alternative HTML version deleted]]