Hi All,
I have a Data.frame that looks like that one below. I would like to do some text
mining on it to possibly find some patterns between Opis, ACklasifikacija and
Vodja. I looked over a tm package which loks promissing, more specifically
DocumentTermMatrix or TermDocumentMatrix. But I can not figure out how to change
my data from data.frame to Corpus or VCorpus.
Globina ACKlasifikacija
Opis GlobinaOd GlobinaDo Vodja
3671 8 GP SLABO GRADUIRAN PE©ÈEN
PROD DO r = 70 mm, PREVLADUJE DO r = 30 mm, GOST, SIV 0.30 4.05
Beljsak
3675 12 GP SLABO GRADUIRAN PE©ÈEN PROD DO r = 80mm,
PREVLADUJE DO r = 30mm, GOST, VLA®EN DO MOKER, SIV 0.40 7.50 Kovacic
3684 8 GP SLABO GRADUIRAN PE©ÈEN PROD DO r
= 70 mm, PREVLADUJE DO r = 30 mm, SREDNJE GOST, SIV 4.00 6.15 Beljsak
3689 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO r = 80mm,
PREVLADUJE DO r = 30mm, GOST, VLA®EN DO MOKER, SIV 0.20 5.20 Kovacic
3695 10 GP SLABO GRADUIRAN PE©ÈEN PROD
DO r = 70mm, PREVLADUJE DO 30mm, GOST, VLA®EN, SIV 0.90 6.00 Kovacic
3699 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO r =
90mm, PREVLADUJE DO r = 30mm, GOST, MOKER, SVETLORJAV 0.35 4.85
Kovacic
3706 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO
r = 70mm, PREVLADUJE DO r = 30mM, GOST, VLA®EN, SIV 0.50 4.10 Kovacic
3713 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO
r = 80mm, PREVLADUJE DO r = 30mm, GOST, VLA®EN, SIV 1.00 4.00 Kovacic
3739 32 GP SLABO GRADUIRAN, ZELO
PE©ÈEN PROD, MALO MELJAST, SREDNJE GOST, MOKER, SlV 15.40 16.00 Fasalek
3761 19 GP SLABO GRADUIRAN MELJAST
TER PE©ÈEN PROD, VLA®EN DO MOKER, PROD DO r = 50MM 7.10 11.00 Fasalek
3801 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO r = 70 mm,
PREVLADUJE DO r = 30 mm, Z VEÈJIMI PRODNIKI, GOST, SIVO RJAV 0.60 4.50
Beljsak
Any help or ideas would be greatly appreciated,
m
[[alternative HTML version deleted]]
Hi All,
I have a Data.frame that looks like that one below. I would like to do some text
mining on it to possibly find some patterns between Opis, ACklasifikacija and
Vodja. I looked over a tm package which loks promissing, more specifically
DocumentTermMatrix or TermDocumentMatrix. But I can not figure out how to change
my data from data.frame to Corpus or VCorpus.
Globina ACKlasifikacija
Opis GlobinaOd GlobinaDo Vodja
3671 8 GP SLABO GRADUIRAN PE©ÈEN
PROD DO r = 70 mm, PREVLADUJE DO r = 30 mm, GOST, SIV 0.30 4.05
Beljsak
3675 12 GP SLABO GRADUIRAN PE©ÈEN PROD DO r = 80mm,
PREVLADUJE DO r = 30mm, GOST, VLA®EN DO MOKER, SIV 0.40 7.50 Kovacic
3684 8 GP SLABO GRADUIRAN PE©ÈEN PROD DO r
= 70 mm, PREVLADUJE DO r = 30 mm, SREDNJE GOST, SIV 4.00 6.15 Beljsak
3689 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO r = 80mm,
PREVLADUJE DO r = 30mm, GOST, VLA®EN DO MOKER, SIV 0.20 5.20 Kovacic
3695 10 GP SLABO GRADUIRAN PE©ÈEN PROD
DO r = 70mm, PREVLADUJE DO 30mm, GOST, VLA®EN, SIV 0.90 6.00 Kovacic
3699 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO r =
90mm, PREVLADUJE DO r = 30mm, GOST, MOKER, SVETLORJAV 0.35 4.85
Kovacic
3706 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO
r = 70mm, PREVLADUJE DO r = 30mM, GOST, VLA®EN, SIV 0.50 4.10 Kovacic
3713 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO
r = 80mm, PREVLADUJE DO r = 30mm, GOST, VLA®EN, SIV 1.00 4.00 Kovacic
3739 32 GP SLABO GRADUIRAN, ZELO
PE©ÈEN PROD, MALO MELJAST, SREDNJE GOST, MOKER, SlV 15.40 16.00 Fasalek
3761 19 GP SLABO GRADUIRAN MELJAST
TER PE©ÈEN PROD, VLA®EN DO MOKER, PROD DO r = 50MM 7.10 11.00 Fasalek
3801 10 GP SLABO GRADUIRAN PE©ÈEN PROD DO r = 70 mm,
PREVLADUJE DO r = 30 mm, Z VEÈJIMI PRODNIKI, GOST, SIVO RJAV 0.60 4.50
Beljsak
Any help or ideas would be greatly appreciated,
m
[[alternative HTML version deleted]]