IAGO GINÉ VÁZQUEZ
2019-Sep-13 09:01 UTC
[Rd] Printing chinese characters (UTF-8) on R 3.5.2 -windows 10
I have a chinese character on a data frame, but the output of printing it is its UTF-8 code. Concretely, the character is ? and the code is U+6703. Following the code I arrive to the instruction> base::format.default("?")which prints [1] "<U+6703>" I do not know which is the extent of this behaviour either if it follows on most recent versions of R. Is it expected? Thank you! Iago [[alternative HTML version deleted]]
Tomas Kalibera
2019-Sep-13 09:24 UTC
[Rd] Printing chinese characters (UTF-8) on R 3.5.2 -windows 10
On 9/13/19 11:01 AM, IAGO GIN? V?ZQUEZ wrote:> I have a chinese character on a data frame, but the output of printing it is its UTF-8 code. Concretely, the character is ? and the code is U+6703. Following the code I arrive to the instruction > >> base::format.default("?") > which prints > > [1] "<U+6703>" > > I do not know which is the extent of this behaviour either if it follows on most recent versions of R. > > Is it expected?If you are running this on Windows in an encoding where the character cannot be represented (e.g. non-Chinese locale), then yes, this is expected behavior. On Unix systems where R can run in UTF-8 encoding (Linux, macOS), the character will be formatted/displayed properly. Best Tomas> > Thank you! > > Iago > > [[alternative HTML version deleted]] > > ______________________________________________ > R-devel at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-devel
IAGO GINÉ VÁZQUEZ
2019-Sep-13 09:37 UTC
[Rd] Printing chinese characters (UTF-8) on R 3.5.2 -windows 10
But if I type> "?"the output is [1] "?" so seemingly it can be represented. Or, am I wrong? Best Iago ________________________________ De: Tomas Kalibera <tomas.kalibera at gmail.com> Enviat el: divendres, 13 de setembre de 2019 11:24 Per a: IAGO GIN? V?ZQUEZ <i.gine at pssjd.org>; r-devel at r-project.org <r-devel at r-project.org> Tema: Re: [Rd] Printing chinese characters (UTF-8) on R 3.5.2 -windows 10 On 9/13/19 11:01 AM, IAGO GIN? V?ZQUEZ wrote:> I have a chinese character on a data frame, but the output of printing it is its UTF-8 code. Concretely, the character is ? and the code is U+6703. Following the code I arrive to the instruction > >> base::format.default("?") > which prints > > [1] "<U+6703>" > > I do not know which is the extent of this behaviour either if it follows on most recent versions of R. > > Is it expected?If you are running this on Windows in an encoding where the character cannot be represented (e.g. non-Chinese locale), then yes, this is expected behavior. On Unix systems where R can run in UTF-8 encoding (Linux, macOS), the character will be formatted/displayed properly. Best Tomas> > Thank you! > > Iago > > [[alternative HTML version deleted]] > > ______________________________________________ > R-devel at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-devel[[alternative HTML version deleted]]