Displaying 1 result from an estimated 1 matches for "italicitz".
Did you mean:
italicitze
2012 May 11
0
Using xpathapply or getnodeset to get text between two distinct tags
Hello:
The following code extracts the links to the daily transcripts of Canada's House Of Commons. 'links' is a matrix of URLs (ncol=1), each of which points to one day's transcripts.
If you inspect the code for scrape(links[1]), you will find that periodically there appears an italicitze tag after a paragraph tag (<p some text ><i>Translation</i></p>. At this point, the speaker is speaking French.
Then there are some <div> tags that list some text, and then, after the speaker has returned to English, you get the same formula as above, <p some text...