Hi Josh,
Thanks for helping me with my second topic of the day.
I am scraping from the fmpcloud website and trying to get data from their
survivorship free data (so they say).
The result is supposed to be in json format, but I can't figure out how to
get any of R's json interpreters to parse the result.
The programming in R to get the initial data is
html <-
read_html("https://fmpcloud.io/api/v4/historical-price-full/ABI/2005-01-04?apikey=1af241eb87c43cfa7e124419cb180d7e")
Any (additional) guidance would be appreciated.
--John Sparks
________________________________
From: Joshua Ulrich <josh.m.ulrich at gmail.com>
Sent: Sunday, January 5, 2025 2:55 PM
To: Sparks, John <jspark4 at uic.edu>
Cc: r-help at r-project.org <r-help at r-project.org>
Subject: Re: [R] Help Parsing String? From HTML
CAUTION: External Sender
On Sun, Jan 5, 2025 at 2:03?PM Sparks, John via R-help
<r-help at r-project.org> wrote:>
> Hi,
>
> I am scraping some data from the internet and I got what I want but in a
big long string (sort of) and can't figure out a way to parse it.
>
What site are you scraping? There may be an easier way to get the data on it.
> What I have gotten to is
>
> > mystring
> {xml_nodeset (1)}
> [1] <p>{\n "symbol": "ABI",\n "open":
21.04,\n "high": 21.05,\n "low": 20.06,\n
"close": 20.2,\n "volume": 938700,\n "from":
"2005-01-04"\n}</p>
>
> But I can't find a way to isolate the individual elements such as
symbol, open, etc.
>
> I'll bet there is someone out there with a lot more experience at html
parsing than me who can see a way to solve this in minutes.
>
> Any guidance would be appreciated.
>
> --John Sparks
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fstat.ethz.ch%2Fmailman%2Flistinfo%2Fr-help&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670167325%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=vpJ%2Bva%2BI%2Fnm1bBXakuB34TSW%2BV832R4sOgyYCAq6iIA%3D&reserved=0<https://stat.ethz.ch/mailman/listinfo/r-help>
> PLEASE do read the posting guide
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.r-project.org%2Fposting-guide.html&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670183537%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=HDxFV4GG6w%2Fut71t3IHHEz%2FTw3kiKYZeuxZvvbGCG7A%3D&reserved=0<https://www.r-project.org/posting-guide.html>
> and provide commented, minimal, self-contained, reproducible code.
--
Joshua Ulrich | about.me/joshuaulrich
FOSS Trading |
https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.fosstrading.com%2F&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670195224%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=RSnWCt4%2BS04spT8n1DgklRukVCYHDdoNUW%2BGMtA4iAQ%3D&reserved=0<http://www.fosstrading.com/>
This email originated from outside the University of Illinois System. Use
caution when replying, clicking links, or opening attachments. DO NOT reply to
any requests asking you to reply from a personal account or SMS.
[[alternative HTML version deleted]]
On Sun, Jan 5, 2025 at 5:41?PM Sparks, John <jspark4 at uic.edu> wrote:> > Hi Josh, > > Thanks for helping me with my second topic of the day. > > I am scraping from the fmpcloud website and trying to get data from their survivorship free data (so they say). > > The result is supposed to be in json format, but I can't figure out how to get any of R's json interpreters to parse the result. > > The programming in R to get the initial data is > > html <- read_html("https://fmpcloud.io/api/v4/historical-price-full/ABI/2005-01-04?apikey=1af241eb87c43cfa7e124419cb180d7e") > > Any (additional) guidance would be appreciated. >That's a REST API endpoint. It responds with JSON, not HTML. So you just need x <- jsonlite::parse_json("https://fmpcloud.io/api/v4/historical-price-full/ABI/2005-01-04?apikey=1af241eb87c43cfa7e124419cb180d7e")> --John Sparks > ________________________________ > From: Joshua Ulrich <josh.m.ulrich at gmail.com> > Sent: Sunday, January 5, 2025 2:55 PM > To: Sparks, John <jspark4 at uic.edu> > Cc: r-help at r-project.org <r-help at r-project.org> > Subject: Re: [R] Help Parsing String? From HTML > > CAUTION: External Sender > > On Sun, Jan 5, 2025 at 2:03?PM Sparks, John via R-help > <r-help at r-project.org> wrote: > > > > Hi, > > > > I am scraping some data from the internet and I got what I want but in a big long string (sort of) and can't figure out a way to parse it. > > > What site are you scraping? There may be an easier way to get the data on it. > > > What I have gotten to is > > > > > mystring > > {xml_nodeset (1)} > > [1] <p>{\n "symbol": "ABI",\n "open": 21.04,\n "high": 21.05,\n "low": 20.06,\n "close": 20.2,\n "volume": 938700,\n "from": "2005-01-04"\n}</p> > > > > But I can't find a way to isolate the individual elements such as symbol, open, etc. > > > > I'll bet there is someone out there with a lot more experience at html parsing than me who can see a way to solve this in minutes. > > > > Any guidance would be appreciated. > > > > --John Sparks > > > > [[alternative HTML version deleted]] > > > > ______________________________________________ > > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > > https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fstat.ethz.ch%2Fmailman%2Flistinfo%2Fr-help&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670167325%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=vpJ%2Bva%2BI%2Fnm1bBXakuB34TSW%2BV832R4sOgyYCAq6iIA%3D&reserved=0 > > PLEASE do read the posting guide https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.r-project.org%2Fposting-guide.html&data=05%7C02%7Cjspark4%40uic.edu%7C7bd4229e6e74413d678c08dd2dcb5cff%7Ce202cd477a564baa99e3e3b71a7c77dd%7C0%7C0%7C638717073670183537%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=HDxFV4GG6w%2Fut71t3IHHEz%2FTw3kiKYZeuxZvvbGCG7A%3D&reserved=0 > > and provide commented, minimal, self-contained, reproducible code. > > > > -- > Joshua Ulrich | about.me/joshuaulrich > > This email originated from outside the University of Illinois System. Use caution when replying, clicking links, or opening attachments. DO NOT reply to any requests asking you to reply from a personal account or SMS.