Hello R-users, The below is a snippet of my data: fid crop year value 5_1_1 SWHE 1995 171 5_1_1 SWHE 1997 696 5_1_1 BARL 1996 114 5_1_1 BARL 1997 344 5_2_2 SWHE 1995 120 5_2_2 SWHE 1996 511 5_2_2 BARL 1996 239 5_2_2 BARL 1997 349 Here, I want to create dummy variables with the names of the content of a column 'crop' in a way that the new variable 'SWHE' would receive a value of 1 if the column 'crop' contains 'SWHE' and 0 otherwise. So, I would have two new variables SWHE and BARL as below: fid crop year value SWHE BARL 5_1_1 SWHE 1995 171 1 0 5_1_1 SWHE 1997 696 1 0 5_1_1 BARL 1996 114 0 1 5_1_1 BARL 1997 344 0 1 5_2_2 SWHE 1995 120 1 0 5_2_2 SWHE 1996 511 1 0 5_2_2 BARL 1996 239 0 1 5_2_2 BARL 1997 349 0 1 Cheers, Shyam Nepal [[alternative HTML version deleted]]
Dummy variables are not needed in R. Bert Sent from my iPhone -- please excuse typos. On Apr 20, 2013, at 11:23 AM, shyam basnet <shyamabc2002 at yahoo.com> wrote:> Hello R-users, > > The below is a snippet of my data: > > > fid crop year value > 5_1_1 SWHE 1995 171 > 5_1_1 SWHE 1997 696 > 5_1_1 BARL 1996 114 > 5_1_1 BARL 1997 344 > 5_2_2 SWHE 1995 120 > 5_2_2 SWHE 1996 511 > 5_2_2 BARL 1996 239 > 5_2_2 BARL 1997 349 > > Here, I want to create dummy variables with the names of the content of a column 'crop' in a way that the new variable 'SWHE' would receive a value of 1 if the column 'crop' contains 'SWHE' and 0 otherwise. So, I would have two new variables SWHE and BARL as below: > > > fid crop year value SWHE BARL > 5_1_1 SWHE 1995 171 1 0 > 5_1_1 SWHE 1997 696 1 0 > 5_1_1 BARL 1996 114 0 1 > 5_1_1 BARL 1997 344 0 1 > 5_2_2 SWHE 1995 120 1 0 > 5_2_2 SWHE 1996 511 1 0 > 5_2_2 BARL 1996 239 0 1 > 5_2_2 BARL 1997 349 0 1 > > > Cheers, > Shyam > Nepal > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.
Hello Shyam, if your data is stored in variable "dataset", for example, the following code will create the desired dummy-coded variables and attach them to the dataset: ###################### #init vars SWHE=BARL <- vector(length=nrow(dataset)) SWHE[]=BARL[] <- 0 #initialize dummy-coded vars with all 0s #fill in variables SWHE[grep("SWHE", dataset$crop)] <- 1 #grep returns the indices where a match is found, see ?grep BARL[grep("BARL", dataset$crop)] <- 1 #attach new dummy codes to dataset dataset$SWHE <- SWHE dataset$BARL <- BARL ###################### Hope this helps, Patrick 2013/4/20 shyam basnet <shyamabc2002@yahoo.com>> Hello R-users, > > The below is a snippet of my data: > > > fid crop year value > 5_1_1 SWHE 1995 171 > 5_1_1 SWHE 1997 696 > 5_1_1 BARL 1996 114 > 5_1_1 BARL 1997 344 > 5_2_2 SWHE 1995 120 > 5_2_2 SWHE 1996 511 > 5_2_2 BARL 1996 239 > 5_2_2 BARL 1997 349 > > Here, I want to create dummy variables with the names of the content of a > column 'crop' in a way that the new variable 'SWHE' would receive a value > of 1 if the column 'crop' contains 'SWHE' and 0 otherwise. So, I would have > two new variables SWHE and BARL as below: > > > fid crop year value SWHE BARL > 5_1_1 SWHE 1995 171 1 0 > 5_1_1 SWHE 1997 696 1 0 > 5_1_1 BARL 1996 114 0 1 > 5_1_1 BARL 1997 344 0 1 > 5_2_2 SWHE 1995 120 1 0 > 5_2_2 SWHE 1996 511 1 0 > 5_2_2 BARL 1996 239 0 1 > 5_2_2 BARL 1997 349 0 1 > > > Cheers, > Shyam > Nepal > [[alternative HTML version deleted]] > > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > >[[alternative HTML version deleted]]