Siddharth Arun
2013-Sep-04  12:05 UTC
[R-sig-Debian] Error in Parts of Speach Tagging using openNLP in Cloud environment
I have an Ubuntu Quantal 12.10 Server 64-bit instance. I am using openNLP
for POS Tagging of sentences.
I am using POS tagging using openNLP with “Parallel Lapply setup”. It is
running fine in RStudio environment. But in Ubuntu environment it is
showing the following error.
*Error in do.call(c, clusterApply(cl, x = splitList(X, length(cl)), fun lapply, 
:*
*  second argument must be a list*
Any suggestion for the problem I’m facing?
This is the code that I am using:
tagPOS <-  function(x, ...) {
    s <- as.String(x)
    word_token_annotator <- Maxent_Word_Token_Annotator()
    a2 <- Annotation(1L, "sentence", 1L, nchar(s))
    a2 <- annotate(s, word_token_annotator, a2)
    a3 <- annotate(s, PTA, a2)
    a3w <- a3[a3$type == "word"]
    POStags <- unlist(lapply(a3w$features, `[[`, "POS"))
    POStagged <- paste(sprintf("%s/%s", s[a3w], POStags), collapse
= " ")
    list(POStagged = POStagged, POStags = POStags)
  }
  cl <- makeCluster(mc <- getOption("cl.cores",
detectCores()/2))
  clusterEvalQ(cl, {
    library(openNLP)
    library(NLP)
    PTA <- Maxent_POS_Tag_Annotator()
  })
This is the setup I am using:
· Created an cloud instance with "Ubuntu Quantal 12.10 Server 64-bit
instance"
· Installed LAMP server in the instance
· After which I installed R. By default R version was 2.15.0
· Upgraded the R version to R 3.0.1
-- 
Regards,
Siddharth Arun,
Contact No. - +91 8880065278
	[[alternative HTML version deleted]]
