Hi Josh, Thanks for helping me with my second topic of the day. I am scraping from the fmpcloud website and trying to get data from their survivorship free data (so they say). The result is supposed to be in json format, but I can't figure out how to get any of R's json interpreters to parse the result. The programming in R to get the initial data is html <- read_html("https://fmpcloud.io/api/v4/historical-price-full/ABI/2005-01-04?apikey=1af241eb87c43cfa7e124419cb180d7e") Any (additional) guidance would be appreciated. --John Sparks ________________________________ From: Joshua Ulrich <josh.m.ulrich at gmail.com> Sent: Sunday, January 5, 2025 2:55 PM To: Sparks, John <jspark4 at uic.edu> Cc: r-help at r-project.org <r-help at r-project.org> Subject: Re: [R] Help Parsing String? From HTML CAUTION: External Sender On Sun, Jan 5, 2025 at 2:03?PM Sparks, John via R-help <r-help at r-project.org> wrote:> > Hi, > > I am scraping some data from the internet and I got what I want but in a big long string (sort of) and can't figure out a way to parse it. >What site are you scraping? There may be an easier way to get the data on it.> What I have gotten to is > > > mystring > {xml_nodeset (1)} > [1] <p>{\n "symbol": "ABI",\n "open": 21.04,\n "high": 21.05,\n "low": 20.06,\n "close": 20.2,\n "volume": 938700,\n "from": "2005-01-04"\n}</p> > > But I can't find a way to isolate the individual elements such as symbol, open, etc. > > I'll bet there is someone out there with a lot more experience at html parsing than me who can see a way to solve this in minutes. > > Any guidance would be appreciated. > > --John Sparks > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fstat.ethz.ch%2Fmailman%2Flistinfo%2Fr-help&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670167325%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=vpJ%2Bva%2BI%2Fnm1bBXakuB34TSW%2BV832R4sOgyYCAq6iIA%3D&reserved=0<https://stat.ethz.ch/mailman/listinfo/r-help> > PLEASE do read the posting guide https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.r-project.org%2Fposting-guide.html&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670183537%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=HDxFV4GG6w%2Fut71t3IHHEz%2FTw3kiKYZeuxZvvbGCG7A%3D&reserved=0<https://www.r-project.org/posting-guide.html> > and provide commented, minimal, self-contained, reproducible code.-- Joshua Ulrich | about.me/joshuaulrich FOSS Trading | https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.fosstrading.com%2F&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670195224%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=RSnWCt4%2BS04spT8n1DgklRukVCYHDdoNUW%2BGMtA4iAQ%3D&reserved=0<http://www.fosstrading.com/> This email originated from outside the University of Illinois System. Use caution when replying, clicking links, or opening attachments. DO NOT reply to any requests asking you to reply from a personal account or SMS. [[alternative HTML version deleted]]
On Sun, Jan 5, 2025 at 5:41?PM Sparks, John <jspark4 at uic.edu> wrote:> > Hi Josh, > > Thanks for helping me with my second topic of the day. > > I am scraping from the fmpcloud website and trying to get data from their survivorship free data (so they say). > > The result is supposed to be in json format, but I can't figure out how to get any of R's json interpreters to parse the result. > > The programming in R to get the initial data is > > html <- read_html("https://fmpcloud.io/api/v4/historical-price-full/ABI/2005-01-04?apikey=1af241eb87c43cfa7e124419cb180d7e") > > Any (additional) guidance would be appreciated. >That's a REST API endpoint. It responds with JSON, not HTML. So you just need x <- jsonlite::parse_json("https://fmpcloud.io/api/v4/historical-price-full/ABI/2005-01-04?apikey=1af241eb87c43cfa7e124419cb180d7e")> --John Sparks > ________________________________ > From: Joshua Ulrich <josh.m.ulrich at gmail.com> > Sent: Sunday, January 5, 2025 2:55 PM > To: Sparks, John <jspark4 at uic.edu> > Cc: r-help at r-project.org <r-help at r-project.org> > Subject: Re: [R] Help Parsing String? From HTML > > CAUTION: External Sender > > On Sun, Jan 5, 2025 at 2:03?PM Sparks, John via R-help > <r-help at r-project.org> wrote: > > > > Hi, > > > > I am scraping some data from the internet and I got what I want but in a big long string (sort of) and can't figure out a way to parse it. > > > What site are you scraping? There may be an easier way to get the data on it. > > > What I have gotten to is > > > > > mystring > > {xml_nodeset (1)} > > [1] <p>{\n "symbol": "ABI",\n "open": 21.04,\n "high": 21.05,\n "low": 20.06,\n "close": 20.2,\n "volume": 938700,\n "from": "2005-01-04"\n}</p> > > > > But I can't find a way to isolate the individual elements such as symbol, open, etc. > > > > I'll bet there is someone out there with a lot more experience at html parsing than me who can see a way to solve this in minutes. > > > > Any guidance would be appreciated. > > > > --John Sparks > > > > [[alternative HTML version deleted]] > > > > ______________________________________________ > > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > > https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fstat.ethz.ch%2Fmailman%2Flistinfo%2Fr-help&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670167325%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=vpJ%2Bva%2BI%2Fnm1bBXakuB34TSW%2BV832R4sOgyYCAq6iIA%3D&reserved=0 > > PLEASE do read the posting guide https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.r-project.org%2Fposting-guide.html&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670183537%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=HDxFV4GG6w%2Fut71t3IHHEz%2FTw3kiKYZeuxZvvbGCG7A%3D&reserved=0 > > and provide commented, minimal, self-contained, reproducible code. > > > > -- > Joshua Ulrich | about.me/joshuaulrich > > This email originated from outside the University of Illinois System. Use caution when replying, clicking links, or opening attachments. DO NOT reply to any requests asking you to reply from a personal account or SMS.