Displaying 1 result from an estimated 1 matches for "chapter_vii_operational_mod".
2018 May 23
0
Using R htmlParse() for manipulating URLs to access multiple pages
...te here the exact URL, anyway, the structure is as follows:
https://home.lala.com/bibi/blabla/chapter_i_organization/101_contracts/whatever/,DanaInfo=intranet.lala.com+
https://home.lala.com/bibi/blabla/chapter_i_organization/125_bills/,DanaInfo=intranet.lala.com+
https://home.lala.com/bibi/blabla/chapter_vii_operational_modalities/701_wonderwall_18_oasis/701_wonderwall_18_oasis/
and so forth. Of course, I don't want to scrape the single URLs one by one. Hence, I am considering the base URL for parsing and to start from there onward.
baseurl <- htmlParse( "https://home.lala.com/bibi/blabla/",...