Patrick Zimmermann
2007-Jul-18 10:30 UTC
[R] how to combine presence only data sets to one presence/absence table
Problem: I have a Set of samples each with a list of observed species (presence only). Data is stored in a excel spreadsheet and the columns (spl) have different numbers of observations (spcs). Now I want to organize the data in a species by sample matrix with presence/absence style in R. data style (in excel): spl_A spl_B spl_C spcs1 spcs1 spcs2 spcs2 spcs3 spcs3 spcs4 spcs5 spcs5 desired style: spl_A spl_B spl_C spcs1 1 1 0 spcs2 1 0 1 spcs3 0 1 1 . . . How and in which form do I import the data to R? (read.table() seems not to be appropriate, as data is not organized as a table) How can I create the species by sample matrix? Thanks for any help, Patrick Zimmermann
Chuck Cleland
2007-Jul-18 11:03 UTC
[R] how to combine presence only data sets to one presence/absence table
Patrick Zimmermann wrote:> Problem: I have a Set of samples each with a list of observed species > (presence only). > Data is stored in a excel spreadsheet and the columns (spl) have > different numbers of observations (spcs). > Now I want to organize the data in a species by sample matrix with > presence/absence style in R. > > data style (in excel): > > spl_A spl_B spl_C > spcs1 spcs1 spcs2 > spcs2 spcs3 spcs3 > spcs4 spcs5 > spcs5 > > desired style: > > spl_A spl_B spl_C > spcs1 1 1 0 > spcs2 1 0 1 > spcs3 0 1 1 > . > . > . > > How and in which form do I import the data to R? > (read.table() seems not to be appropriate, as data is not organized as a table) > > How can I create the species by sample matrix?I'm not going to tackle how to read in the Excel data, but assuming you had several vectors of species names gather together in a list, you could construct a presence/absence data frame or matrix as follows: spl_A <- c("spcs1","spcs2","spcs4","spcs5") spl_B <- c("spcs1","spcs3") spl_C <- c("spcs2","spcs3","spcs5") mylist <- list(spl_A = spl_A, spl_B = spl_B, spl_C = spl_C) mymat <- sapply(mylist, function(x){as.numeric(sort(unique(unlist(mylist))) %in% x)}) rownames(mymat) <- sort(unique(unlist(mylist))) mymat spl_A spl_B spl_C spcs1 1 1 0 spcs2 1 0 1 spcs3 0 1 1 spcs4 1 0 0 spcs5 1 0 1> Thanks for any help, > Patrick Zimmermann > > ______________________________________________ > R-help at stat.math.ethz.ch mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.-- Chuck Cleland, Ph.D. NDRI, Inc. 71 West 23rd Street, 8th floor New York, NY 10010 tel: (212) 845-4495 (Tu, Th) tel: (732) 512-0171 (M, W, F) fax: (917) 438-0894
Stephen Tucker
2007-Jul-18 12:52 UTC
[R] how to combine presence only data sets to one presence/absence table
I think you can still read as a table, just use argument fill=TRUE.
Reading from Excel in general: you can save data as 'csv' or
tab-delimited
file and then use read.csv or read.delim, respectively, or use one of the
packages listed in the following post (for some reason lines breaks are
messed up but hope you can extract the content):
http://tolstoy.newcastle.edu.au/R/e2/help/07/06/19925.html
## read in data
x <-
read.table(textConnection(
"spl_A spl_B spl_C
spcs1 spcs1 spcs2
spcs2 spcs3 spcs3
spcs4 spcs5
spcs5"
),fill=TRUE,header=TRUE,na.string="")
Then,
## 1. find unique
spcs <- sort(na.omit(unique(unlist(x))))
## 2. create matrix of zeros
mat <- matrix(0,ncol=ncol(x),nrow=length(spcs),
dimnames=list(spcs,names(x)))
## 3. assign zeros to matches
for( i in 1:ncol(mat) ) mat[match(x[,i],rownames(mat)),i] <- 1
Alternatively,
## find unique
spcs <- sort(na.omit(unique(unlist(x))))
## return the matrix you want (combine steps 2 and 3 from above)
sapply(x,function(.x,spcs)
"names<-"(ifelse(!is.na(match(spcs,.x)),1,0),spcs),spcs)
Hope this helps.
ST
--- Patrick Zimmermann <brassnotdead at googlemail.com> wrote:
> Problem: I have a Set of samples each with a list of observed species
> (presence only).
> Data is stored in a excel spreadsheet and the columns (spl) have
> different numbers of observations (spcs).
> Now I want to organize the data in a species by sample matrix with
> presence/absence style in R.
>
> data style (in excel):
>
> spl_A spl_B spl_C
> spcs1 spcs1 spcs2
> spcs2 spcs3 spcs3
> spcs4 spcs5
> spcs5
>
> desired style:
>
> spl_A spl_B spl_C
> spcs1 1 1 0
> spcs2 1 0 1
> spcs3 0 1 1
> .
> .
> .
>
> How and in which form do I import the data to R?
> (read.table() seems not to be appropriate, as data is not organized as a
> table)
>
> How can I create the species by sample matrix?
>
> Thanks for any help,
> Patrick Zimmermann
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
antu
2011-Apr-21 05:06 UTC
[R] how to combine presence only data sets to one presence/absence table
What about the opposite of this, This has been very helpful for me, but at
the same time, I needed the opposite of this to..
ie, this to
spl_A spl_B spl_C
spcs1 1 1 0
spcs2 1 0 1
spcs3 0 1 1
this
spl_A spl_B spl_C
spcs1 spcs1 spcs2
spcs2 spcs3 spcs3
spcs4 spcs5
spcs5
Thank you
--
View this message in context:
http://r.789695.n4.nabble.com/how-to-combine-presence-only-data-sets-to-one-presence-absence-table-tp830140p3465121.html
Sent from the R help mailing list archive at Nabble.com.