Hi, I’m using R (TM package) for text mining and I’m having problems
filtering articles out of my data set by local meta data.
Here is the code:
*data <- ("C:/… /19970331")*
* *
* *
*rs <- ReutersSource(data , encoding = "UTF-8")*
*RC <- VCorpus(DirSource(data), readerControl = list(reader readRCV1asPlain,*
*
language = "en_US",*
*
load = TRUE),*
*
dbControl = list(useDb = TRUE,*
*
dbName = "texts.db",*
*
dbType = "DB1"))*
* *
* *
* *
*tm_index(RC, FUN = sFilter, doclevel = F, useMeta = T, "Topics ==
'MCAT'")
*
* *
* *
When I use sFilter, I can only filter fields in yellow, I want to filter
fields in red, what am I doing wrong?
Thanks, Andy
This is meta data that is attached to each article
Available meta data pairs are:
Author :
DateTimeStamp: 1997-03-31
Description :
Heading : USA: WHX begins tender offer for Dynamics Corp.
ID : 476871
Language : en_US
Origin : Reuters Corpus Volume 1
User-defined local meta data pairs are:
$Publisher
[1] "Reuters Holdings Plc"
$Topics
[1] "C18" "C181" "CCAT"
$Industries
[1] "I22100" "I34000"
$Countries
[1] "USA"
[[alternative HTML version deleted]]