Displaying 20 results from an estimated 1000 matches similar to: "Trouble reading a UTF-16LE file"
2024 Feb 28
1
Trouble reading a UTF-16LE file
Try this:
> x <- file("C:\\Users\\Jim\\Downloads\\PV2-ch2 - R_Help.ANA",+ encoding = "UTF-16")> y <- readLines(x)> head(y)[1] "1\t36,74\t0" "2\t269,02\t-44" "1\t326,62\t29" "2\t354,52\t24"
[5] "8\t390,75\t1838" "2\t395,11\t-1053">
>
Thanks
Jim Holtman
*Data Munger Guru*
2024 Feb 28
0
Trouble reading a UTF-16LE file
Dear R-help,
I am having trouble reading a UTF-16LE formatted file. The issue appears to be a byte order mark at the beginning of the file. I have tried readLines(file, encoding='utf-16LE') but that got me
[1]"\xff\xfe1" "" "" "" "" ""
This is a tab delimited text file, despite the ANA extension. It is created by a custom
2024 Feb 28
1
Trouble reading a UTF-16LE file
The earlier post had an attached text file that did not go through.
I hope this link works. I tested it with a coworker, but that is no guarantee.
https://uflorida-my.sharepoint.com/:u:/g/personal/tebert_ufl_edu/EXf5u_CtTwJCrhdfTBIPr7wBefZHx4P_suj4wAWb8i8HFA?e=iQawhh
Regards,
Tim
2010 Aug 16
1
[PATCH] Install VirtIO storage and network drivers in Windows
Currently when converting a Windows guest we do a minimum installation of the
viostor driver, configure the RHEV guest agent and leave RHEV to properly
install viostor and all remaining drivers. This works well if RHEV is properly
configured and the installation is not interrupted on first boot.
However, if the target of the conversion is not RHEV, RHEV is not properly
configured, or the first
2016 Feb 16
2
iconv to UTF-16 encoding produces error due to embedded nulls (write.table with fileEncoding param)
If I execute the code from the "?write.table" examples section
x <- data.frame(a = I("a \" quote"), b = pi)
# (ommited code)
write.csv(x, file = "foo.csv", fileEncoding = "UTF-16LE")
the resulting CSV file has a size of 6 bytes which is too short
(truncated):
""",3
The problem seems to be the iconv function:
2019 Aug 04
1
iconv: embedded nulls when converting to UTF-16
R-devel community:
I have encountered some unexpected behavior using iconv, which may be the source of errors I am getting when connecting to a UTF-16 -encoded SQL Server database. A simple example is below.
When researching this problem, I found r-devel reports of the same problem in threads from June 2010 and February, 2016, and that bug #16738 was posted to Bugzilla as a result. However, I
2007 Oct 27
2
UTF-8 BOM
I'v encountered what seems to be a bug in PHP Markdown v1.0.1k. Given
a UTF-8 with a BOM, the first line is not parsed.
A brief search revealed a bug report for a similar issue in python-
markdown:
> I'm using the markdown.py command to generate HTML from Markdown
> documentation and I've found that if a utf-8 file starts with a BOM
> a paragraph is generated
2005 Mar 16
1
minor iconvlist() bug in r-devel
This is on r-devel from 2005-03-15. iconvlist() uses (at least some of
the time)
icfile <- system.file("iconvlist", package = "utils")
which looks like
"""
The following list contain all the coded character sets known. This
does not necessarily mean that all combinations of these names can be
used for the FROM and TO command line parameters. One coded
2016 Feb 23
4
iconv to UTF-16 encoding produces error due to embedded nulls (write.table with fileEncoding param)
>>>>> nospam at altfeld-im de <nospam at altfeld-im.de>
>>>>> on Mon, 22 Feb 2016 18:45:59 +0100 writes:
> Dear R developers
> I think I have found a bug that can be reproduced with two lines of code
> and I am very thankful to get your first assessment or feed-back on my
> report.
> If this is the wrong mailing list or I
2016 Feb 23
1
iconv to UTF-16 encoding produces error due to embedded nulls (write.table with fileEncoding param)
Excellent analysis, thank you both for the quick reply!
Is there anything I can do to get the bug fixed in the next version of R
(e. g. filing a bug report at https://bugs.r-project.org/bugzilla3/)?
On Tue, 2016-02-23 at 14:06 +0200, Mikko Korpela wrote:
> On 23.02.2016 11:37, Martin Maechler wrote:
> >>>>>> nospam at altfeld-im de <nospam at altfeld-im.de>
>
2010 Feb 16
7
YAML, UTF-8, TextMate, Notepad
This is not a question but a report on the difficulties I had and the
solution I found with respect to UTF-8, YAML::load, and Ruby/Rails.
Comments are appreciated.
- - -
I had been struggling for two days to get UTF-8 working in my Rails app.
I had/have a localization file, lib\locale\de.yml, that had iso-8859-1
encoding. I could not get that to display properly.
Marnen, quite correctly,
2005 Oct 19
1
Rails + Eclipse + Utf-8
I''m developing an international website that uses AJAX. In order to make the
international characters appear correctly, I have set my default char set to
UTF-8.
The problem seems to be that Eclipse even though has the setting to use
UTF-8 for encoding, doesn''t save it in the correct format. If I open up the
.rhtml file under Notepad++ it encodes the file correctly and accents
2016 Feb 24
2
iconv to UTF-16 encoding produces error due to embedded nulls (write.table with fileEncoding param)
On 23/02/2016 7:06 AM, Mikko Korpela wrote:
> On 23.02.2016 11:37, Martin Maechler wrote:
>>>>>>> nospam at altfeld-im de <nospam at altfeld-im.de>
>>>>>>> on Mon, 22 Feb 2016 18:45:59 +0100 writes:
>>
>> > Dear R developers
>> > I think I have found a bug that can be reproduced with two lines of code
2024 Sep 08
1
Reading a txt file from internet
On 2024-09-07 7:37 p.m., Jeff Newmiller wrote:
> I tried it on R 4.4.1 on Linux Mint 21.3 just before I posted it, and I just tried it on R 3.4.2 on Ubuntu 16.04 and R 4.3.2 on Windows 11 just now and it works on all of them.
>
> I don't have a big-endian machine to test on, but the Unicode spec says to honor the BOM and if there isn't one to assume that it is big-endian data.
2014 Dec 09
4
UTF8 markdown vignette
A few things to clarify:
1. You do not necessarily have to keep the \usepackage{} line if you
use %\VignetteEncoding{UTF-8}, because Pandoc will use UTF-8 anyway in
its LaTeX template.
2. Perhaps the vignette engine in R has done something clever to
convert utf8 to UTF-8, but I'd recommend %\VignetteEncoding{UTF-8}
instead of %\VignetteEncoding{utf8} to make sure it is a valid
encoding name,
2024 Sep 07
1
Reading a txt file from internet
On 2024-09-07 4:52 p.m., Jeff Newmiller via R-help wrote:
> When you specify LE in the encoding type, you are logically telling the decoder that you know the two-byte pairs are in little-endian order... which could override whatever the byte-order-mark was indicating. If the BOM indicated big-endian then the file decoding would break. If there is a BOM, don't override it unless you have to
2024 Sep 07
1
Reading a txt file from internet
I tried it on R 4.4.1 on Linux Mint 21.3 just before I posted it, and I just tried it on R 3.4.2 on Ubuntu 16.04 and R 4.3.2 on Windows 11 just now and it works on all of them.
I don't have a big-endian machine to test on, but the Unicode spec says to honor the BOM and if there isn't one to assume that it is big-endian data. But in this case there is a BOM so your machine has a buggy
2024 Sep 07
1
Reading a txt file from internet
When you specify LE in the encoding type, you are logically telling the decoder that you know the two-byte pairs are in little-endian order... which could override whatever the byte-order-mark was indicating. If the BOM indicated big-endian then the file decoding would break. If there is a BOM, don't override it unless you have to (e.g. for a wrong BOM)... leave off the LE unless you really
2024 Sep 07
1
Reading a txt file from internet
On Sun, 08 Sep 2024, Christofer Bogaso writes:
> Hi,
>
> I am trying to the data from
> https://online.stat.psu.edu/onlinecourses/sites/stat501/files/ch15/employee.txt
> without any success. Below is the error I am getting:
>
>> read.delim('https://online.stat.psu.edu/onlinecourses/sites/stat501/files/ch15/employee.txt')
>
> Error in make.names(col.names,
2017 Feb 18
11
[PATCH 0/8] Miscellaneous cleanups to Windows registry code.
A very miscellaneous set of cleanups to how we handle the Windows
registry in virt-v2v, firstboot, and inspection code. This should all
be straightforward non-controversial refactoring. Some highlights:
- Add a new mllib Registry module containing various utility
functions that are currently scattered all around.
- Only compute the software/system hive paths once during inspection,
and