thr3ads.net - similar to: "Extract subset of rows"

Displaying 20 results from an estimated 10000 matches similar to: "Extract subset of rows"

2010 Oct 15

Random assignment

Dear List, I am doing some simulation in R and need basic help! I have a list of animal families for which i know the number of species in each family. I am working under the assumption that a species has a 7.48% chance of being at risk. I want to simulate the number of species expected to be at risk under a random binomial distribution with 10,000 randomizations. I am relatively knew to

Randomly shuffle an array multiple times

2010 Oct 18

Randomly shuffle an array multiple times

Dear List, I have a table i have read into R: Name Yes/No John 0 Frank 1 Ann 0 James 1 Alex 1 etc - 800 different times. What i want to do is shuffle yes/no and randomly re-assign them to the name. I have used sample() and permute(), however there is no way to do this 1000 times. Furthermore, i want to copy the data into a excel spreadsheet in the same order as the data was input so i can

removing factor level represented by less than x rows

2005 Jul 08

removing factor level represented by less than x rows

In a number of different situations I'm trying to remove factor levels that are represented by less than a certain number of rows, e.g. if I had the dataset aa below and wanted to remove the species that are represented in less than 2 rows: data(iris) aa <- iris[1:101,] In this case, since I can see that the species virginica only has one row, I can write: table(aa$Species) setosa

Strange behavior using subset

2007 Aug 31

Strange behavior using subset

Dear R Gurus, Let's get the technical details out of the way first: Computer: 1.83 GHz MacBook R version 2.5.1 I have a data set that contains the following variables: site, species, total.vines. I need to partition the main data set by site, the further select only those species that occurred at each site. When I select by site (site.name<-subset(data frame,Site=="Site

Extract data

2011 Jan 06

Extract data

Dear List, I have a data frame called trait with roughly 800 species in, each species have 15 columns of information: Species 1 2 3 etc.. a t y h b f j u c r y u etc.. I then have another data frame called com with the composition of species in each region, there are 506 different communities: community species NA1102 a NA1102 c NA0402 b NA0402 c AT1302 a AT1302 b etc.. What

generating a data frame with a subset from another data frame

2007 Jul 08

generating a data frame with a subset from another data frame

R gurus, I have a data set that looks something like this: Site Species DBH #Vines G PLOC 45.9 4 G ACNE 23.3 1 G ACNE 12.0 0 G FRAM 35.9 5 G AEGL 11.2 2 N PLOC 77.3 12 N JUNI 78.6 7 N ACNE 18.9 1 N ACNE 15.7 3 N ACRU 35.5 4 H ACSA2 24.1 6 H ULAM 35.2 7 There are 730 individual trees (22 species) from four sites in the actual data set. I would like to create a second data frame that

using subset

2004 Dec 06

using subset

Hi, I have doubts in using subset command. I have a list of, lets say, 15 species and I want to make a subset with only 2 of them. I??m the command data2 <- subset(data1, species=="sp1"|species=="sp2") Nevertheless, when I ask for the summary (summary(data2)) the others species names still apearing. Also I tried data2<-data1(data1$species %in%

subset

2005 Feb 09

subset

Dear all, I am trying to extract rows from a data.frame based on the rowSums != 0. I want to preserve rownames in the first column in the subset. Does anyone know how to extract all species that don't have rowSums equal to zero? Here it is: # dataset x <- data.frame( species=c("sp.1","sp.2","sp.3","sp.4"), site1=c(2,3,0,0), site2=c(0,0,0,0),

fill a dataframe with zeros where the rows are a smaller subset of a larger dataframe (species by site)

2012 Apr 25

fill a dataframe with zeros where the rows are a smaller subset of a larger dataframe (species by site)

row <- c("a","b","c","d","e","f","g") #rows from larger data frame row.1 <- c("a","b","c","g") #rows of smaller data frame because d, e, and f don't contain any of the species, but the zeros are important x <- data.frame(sp1=rnorm(4), sp2=rnorm(4), sp3=rnorm(4),

passing arguments to subset from a function

2008 Dec 17

passing arguments to subset from a function

Hello R-helpers, I'm writing a long function in which I manipulate a certain number of datasets. I want the arguments of said function to allow me to adapt the way I do this. Among other things, I want my function to have an argument which I will pass on to subset() somewhere inside my function. Here is a quick and simplified example with the iris dataset. myfunction<-function(table,

selecting only specific rows in R

2010 Jun 05

selecting only specific rows in R

Hi, I have a data frame with columns as follows: tree species (independent variable) and several dependent variables (e.g. carbon, nitrogen, phosphorus). Each row represents one tree sample. Some tree samples are unique species in the data frame while other species were replicated (i.e. rows 1,2,3 may be identical for the "tree species" column but have different values for the

[test message] Can R replicate this data manipulation in SAS?

2011 Apr 20

[test message] Can R replicate this data manipulation in SAS?

Apologies for troubling the list, but it is a test that needs to be carried out. I am resending the message that I sent earlier on behalf of Paul Miller, but with a certain word used in the variables names of the SAS code changed to a different word. With thanks for your tolerance ... Ted. [*** PLEASE NOTE: I am sending this message on behalf of Paul Miller: Paul Miller <pjmiller_57 at

Adding a subset to a glm messes up factors?

2007 Dec 07

Adding a subset to a glm messes up factors?

Hi everyone, I have a problem with running a glm using a subset of my data. Whenever I choose a subset, in the summary the factors arent shown (as if the variable was a continuous variable). If I dont use subsets then all the factors are shown. I have copied the output from summary for both cases. Thanks for the help, Muri > model<-glm(log(cpue)~year,family=gaussian) Call: glm(formula =

Multiple subsets of data

2011 Jan 06

Multiple subsets of data

Extracting sums for individual factors in data frames

2007 Jul 02

Extracting sums for individual factors in data frames

I have a data frame with two columns, one of which is a factor (Species) and the other is numeric (BA, which stands for basal area). Here's a sample: Species BA ACSA 55.7632696 FRAM 122.9933524 ACSA 67.54424205 ACSA 89.22123136 ACSA 82.46680716 ACSA 22.46238747 ACSA 19.94911335 ACSA 20.42035225 ACSA 19.00663555 ACSA 21.67698931 ACSA 57.80530483 ACSA 30.31636911 Dead 43.98229715 Dead

0-only-rows in correspondence analysis

2003 Oct 22

0-only-rows in correspondence analysis

Hello, I am looking for a workaround of "species empty" plots in a correspondence analysis. I tried to do a community structure analysis with a ca via decorana(), ca() and CAIV(), but none of them allow 0 only-rows. I have species (abs./pres) in columns and sites in rows sp1 sp1 sp3 ... plot1 0 1 1 plot2 0 0 0 plot3 1 0 1 ..... but on some plots no

simple loop(?) analysing subsets

2010 Jul 18

simple loop(?) analysing subsets

Hi All, I have a large data set with many columns of data. One of these columns is a species identifier and the remainder are variables such as temperature or mass. Currently I am carrying out a single regression on subsets of the data set, e.g. separated data sets with only the data from one species at a time. I have been searching for a thread that will help me to understand how best to repeat

How to extract partial predictions, package mgcv

2009 Sep 14

How to extract partial predictions, package mgcv

Dear package mgcv users, I am using package mgcv to describe presence of a migratory bird species as a function of several variables, including year, day number (i.e. day-of-the-year), duration of survey, latitude and longitude. Thus, the "global model" is: global_model<-gam(present ~ as.factor(year) + s(dayno, k=5) + s(duration, k=5) + s(x, k=5) + s(y, k=5), family =

Calculating subsets "on the fly" with ddply

2010 Feb 03

Calculating subsets "on the fly" with ddply

Hi, [I sent this to the plyr mailing list (late) last night, but it seems to be lost in the moderation queue, so here's a shot to the broadeR community] Apologies in advance for being more verbose than necessary, but I'm not even sure how to ask this question in the context of plyr, so ... here goes. As meaningless as this might be to do with the `iris` data, the spirit of it is what

averaging two tables (rows with columns)

2012 May 10

averaging two tables (rows with columns)

Hi R user,I am struggling to figure out on how I can calculate the average from the two tables in R. Any one can help me? really your help would be grateful- I am spending so much time to figure it out. It should not be so hard, I think. I have very big data but I have created a hypothetical data for simplification. for example I have : table 1 table 1: species occurance data

similar to: Extract subset of rows