thr3ads.net - R help - [R] : how to select rows at random [Mar 2009]

If this information is useful, please help other people find it:
Share via:

Laura Rodriguez Murillo

2009-Mar-27 19:11 UTC

[R] : how to select rows at random

Hi dear list,

I have a list of around 2000 identifiers aranged in a dataframe in one
column and I would like to choose a random subset of these. I wonder
if somebody can tell me if I could do this with R...

Thank you so much!

Laura RM

jim holtman

2009-Mar-27 19:14 UTC

head link

[R] : how to select rows at random

?sample

On Fri, Mar 27, 2009 at 3:11 PM, Laura Rodriguez Murillo
<laura.lmurillo at gmail.com> wrote:> Hi dear list,
>
> I have a list of around 2000 identifiers aranged in a dataframe in one
> column and I would like to choose a random subset of these. I wonder
> if somebody can tell me if I could do this with R...
>
> Thank you so much!
>
> Laura RM
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>


-- 
Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem that you are trying to solve?

Gavin Simpson

2009-Mar-27 19:21 UTC

head link

[R] : how to select rows at random

On Fri, 2009-03-27 at 15:11 -0400, Laura Rodriguez Murillo
wrote:> Hi dear list,
> 
> I have a list of around 2000 identifiers aranged in a dataframe in one
> column and I would like to choose a random subset of these. I wonder
> if somebody can tell me if I could do this with R...
Not sure what you mean by identifiers, but to select a subset of the
2000 cells in that column, you could use sample(). See ?sample for
details, but here is an example.

## choose a random subset of 500 out of 2000 entries
## dummy data
dat <- data.frame(identifiers = sample(2000, 2000), X = rnorm(2000))
## set seed to make this the same on your PC as mine
## comment this if you want a different subset each time you run
set.seed(1234)
## random subset of 500
want <- sample(2000, 500)
## select out that subset
## head to show only first n of the selected
head(dat$identifiers[want])

Gives:
> head(dat$identifiers[want])[1] 1327  587  835  430 1422 1687

This assumes the identifiers are unique.

HTH

G
> 
> Thank you so much!
> 
> Laura RM
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.-- 
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
 Dr. Gavin Simpson             [t] +44 (0)20 7679 0522
 ECRC, UCL Geography,          [f] +44 (0)20 7679 0565
 Pearson Building,             [e] gavin.simpsonATNOSPAMucl.ac.uk
 Gower Street, London          [w] http://www.ucl.ac.uk/~ucfagls/
 UK. WC1E 6BT.                 [w] http://www.freshwaters.org.uk
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%

R help - Mar 2009 - : how to select rows at random

[R] : how to select rows at random

[R] : how to select rows at random

[R] : how to select rows at random