Displaying 3 results from an estimated 3 matches for "read_ndjson".
2017 Sep 03
2
readLines() segfaults on large file & question on how to work around
Jeroen:
Thank you for pointing me to ndjson, which I had not heard of and is
exactly my case.
My experience:
jsonlite::stream_in - segfaults
ndjson::stream_in - my fault, I am running Ubuntu 14.04 and it is too old
so it won't compile the package
corpus::read_ndjson - works!!! Of course it does a different simplification
than jsonlite::fromJSON, so I have to change some code, but it works
beautifully at least in simple tests. The memory-map option may be of
use in the future.
Another correspondent said that strings in R can only be 2^31-1 long,...
2017 Sep 04
0
readLines() segfaults on large file & question on how to work around
...> Thank you for pointing me to ndjson, which I had not heard of and is
> exactly my case.
>
> My experience:
> jsonlite::stream_in - segfaults
> ndjson::stream_in - my fault, I am running Ubuntu 14.04 and it is too old
> so it won't compile the package
> corpus::read_ndjson - works!!! Of course it does a different simplification
> than jsonlite::fromJSON, so I have to change some code, but it works
> beautifully at least in simple tests. The memory-map option may be of
> use in the future.
>
> Another correspondent said that strings i...
2017 Sep 02
5
readLines() segfaults on large file & question on how to work around
Hi:
I have a 2.1GB JSON file. Typically I use readLines() and
jsonlite:fromJSON() to extract data from a JSON file.
When I try and read in this file using readLines() R segfaults.
I believe the two salient issues with this file are
1). Its size
2). It is a single line (no line breaks)
I can reproduce this issue as follows
#Generate a big file with no line breaks
# In R
>