Displaying 1 result from an estimated 1 matches for "housechamberbusi".
2012 May 11
0
Using xpathapply or getnodeset to get text between two distinct tags
...' and then return the text between 'English' and 'Translation' and mark it as English.
Does any one have any suggestions? Yours truly,
Simon J. Kiss
#Necessary libraries
library(XML)
library(scrapeR)
#URL for links to 2012 transcripts
hansard<-c('http://www.parl.gc.ca/housechamberbusiness/ChamberSittings.aspx?View=H&Language=E&Mode=1&Parl=41&Ses=1')
#Scrape the page with the links
doc<-scrape(url=hansard, parse=TRUE, follow=TRUE)
#Not sure what exactly this does, but it is necessary
doc<-doc[[1]]
#Get the xmlRoot directory
doc<- xmlRoot(doc)
#Get no...