Siddharth Arun
2013-Sep-04 12:05 UTC
[R-sig-Debian] Error in Parts of Speach Tagging using openNLP in Cloud environment
I have an Ubuntu Quantal 12.10 Server 64-bit instance. I am using openNLP
for POS Tagging of sentences.
I am using POS tagging using openNLP with “Parallel Lapply setup”. It is
running fine in RStudio environment. But in Ubuntu environment it is
showing the following error.
*Error in do.call(c, clusterApply(cl, x = splitList(X, length(cl)), fun lapply,
:*
* second argument must be a list*
Any suggestion for the problem I’m facing?
This is the code that I am using:
tagPOS <- function(x, ...) {
s <- as.String(x)
word_token_annotator <- Maxent_Word_Token_Annotator()
a2 <- Annotation(1L, "sentence", 1L, nchar(s))
a2 <- annotate(s, word_token_annotator, a2)
a3 <- annotate(s, PTA, a2)
a3w <- a3[a3$type == "word"]
POStags <- unlist(lapply(a3w$features, `[[`, "POS"))
POStagged <- paste(sprintf("%s/%s", s[a3w], POStags), collapse
= " ")
list(POStagged = POStagged, POStags = POStags)
}
cl <- makeCluster(mc <- getOption("cl.cores",
detectCores()/2))
clusterEvalQ(cl, {
library(openNLP)
library(NLP)
PTA <- Maxent_POS_Tag_Annotator()
})
This is the setup I am using:
· Created an cloud instance with "Ubuntu Quantal 12.10 Server 64-bit
instance"
· Installed LAMP server in the instance
· After which I installed R. By default R version was 2.15.0
· Upgraded the R version to R 3.0.1
--
Regards,
Siddharth Arun,
Contact No. - +91 8880065278
[[alternative HTML version deleted]]
