Hi all, I started to ask this in the other post, but it is off topis...so here it is again. I have a data.frame (created with the helpof this mail list) that looks like this : 'data.frame': 22801 obs. of 15 variables: $ V1 : chr "HUMUS" "SLABO" "MALO" "SLABO" ... $ V2 : chr "IN" "GRANULIRAN" "PREPEREL" "VEZAN" ... $ V3 : chr "HUMUSNA" "PE©ÈEN" "MELJAST" ",KONGLOMERAT," ... $ V4 : chr "GLINA" "PROD" "PROD" "P0ROZEN," ... $ V5 : chr "Z" "DO" "DO" "S" ... $ V6 : chr "MALO" "r" "r" "PLASTMI" ... $ V7 : chr "PODA," "=" "=" "GFs," ... $ V8 : chr "LAHKO" "8Q" "60mm," "SIVORJAV" ... $ V9 : chr "GNETNA," "mm," "S" "" ... $ V10: chr "RJAVA" "S" "PRODNIKI," "" ... $ V11: chr "" "PRODNIKI" "MALO" "" ... $ V12: chr "" "DO" "PE©ÈEN" "" ... $ V13: chr "" "R" "S" "" ... $ V14: chr "" "=" "TANKIMI" "" ... Is it possible to count which word occours most often in each field (V1, V2, V3, ...) and which one is the second and so on. Ideally i would like to create a table for each field (V1, V2, V3, ...) with the prevailing word and the number of occurancies of that word in that field (column) . Hope that explains it ok... Thank you, m [[alternative HTML version deleted]]
Nevermind, i think summary() does this ... -----Original Message----- From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf Of Matev? Pavli? Sent: Tuesday, November 02, 2010 10:12 PM To: r-help at r-project.org Subject: [R] count different words in a field Hi all, I started to ask this in the other post, but it is off topis...so here it is again. I have a data.frame (created with the helpof this mail list) that looks like this : 'data.frame': 22801 obs. of 15 variables: $ V1 : chr "HUMUS" "SLABO" "MALO" "SLABO" ... $ V2 : chr "IN" "GRANULIRAN" "PREPEREL" "VEZAN" ... $ V3 : chr "HUMUSNA" "PE??EN" "MELJAST" ",KONGLOMERAT," ... $ V4 : chr "GLINA" "PROD" "PROD" "P0ROZEN," ... $ V5 : chr "Z" "DO" "DO" "S" ... $ V6 : chr "MALO" "r" "r" "PLASTMI" ... $ V7 : chr "PODA," "=" "=" "GFs," ... $ V8 : chr "LAHKO" "8Q" "60mm," "SIVORJAV" ... $ V9 : chr "GNETNA," "mm," "S" "" ... $ V10: chr "RJAVA" "S" "PRODNIKI," "" ... $ V11: chr "" "PRODNIKI" "MALO" "" ... $ V12: chr "" "DO" "PE??EN" "" ... $ V13: chr "" "R" "S" "" ... $ V14: chr "" "=" "TANKIMI" "" ... Is it possible to count which word occours most often in each field (V1, V2, V3, ...) and which one is the second and so on. Ideally i would like to create a table for each field (V1, V2, V3, ...) with the prevailing word and the number of occurancies of that word in that field (column) . Hope that explains it ok... Thank you, m [[alternative HTML version deleted]]
On Nov 2, 2010, at 5:11 PM, Matev? Pavli? wrote:> Hi all, > > > > I started to ask this in the other post, but it is off topis...so > here it is again. > > > > I have a data.frame (created with the helpof this mail list) that > looks like this : >? table > tbl <- table(c("HUMUS", "SLABO", "MALO", "SLABO")) > tbl[order(tbl)][1] HUMUS 1 Just make a function that does this to a vector and use lapply(dfrm, func) on the dataframe. -- David.> > > 'data.frame': 22801 obs. of 15 variables: > > $ V1 : chr "HUMUS" "SLABO" "MALO" "SLABO" ... > > $ V2 : chr "IN" "GRANULIRAN" "PREPEREL" "VEZAN" ... > > $ V3 : chr "HUMUSNA" "PE??EN" "MELJAST" ",KONGLOMERAT," ... > > $ V4 : chr "GLINA" "PROD" "PROD" "P0ROZEN," ... > > $ V5 : chr "Z" "DO" "DO" "S" ... > > $ V6 : chr "MALO" "r" "r" "PLASTMI" ... > > $ V7 : chr "PODA," "=" "=" "GFs," ... > > $ V8 : chr "LAHKO" "8Q" "60mm," "SIVORJAV" ... > > $ V9 : chr "GNETNA," "mm," "S" "" ... > > $ V10: chr "RJAVA" "S" "PRODNIKI," "" ... > > $ V11: chr "" "PRODNIKI" "MALO" "" ... > > $ V12: chr "" "DO" "PE??EN" "" ... > > $ V13: chr "" "R" "S" "" ... > > $ V14: chr "" "=" "TANKIMI" "" ... > > > > Is it possible to count which word occours most often in each field > (V1, V2, V3, ...) and which one is the second and so on. Ideally i > would like to create a table for each field (V1, V2, V3, ...) with > the prevailing word and the number of occurancies of that word in > that field (column) . > > > > Hope that explains it ok... > > > > Thank you, m > > > > > > > > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.David Winsemius, MD West Hartford, CT