thr3ads.net - llvm dev - [LLVMdev] Quick question about Unicode support [Nov 2013]

If this information is useful, please help other people find it:
Share via:

Mikael Lyngvig

2013-Nov-08 12:35 UTC

[LLVMdev] Quick question about Unicode support

Hi,

I have decided to go the path of generating LLVM IR source from my
bootstrap compiler so that I can code it in C# instead of C++ (it is going
to be thrown away anyway).  As my language is going to support Unicode, I'm
wondering about whether or not LLVM is capable of reading UTF-8 encoded
source files?  I'm not thinking about string literals - they are easy to
handle by outputing them as a raw, albeit encoded byte sequence, but rather
Unicode-enabled program identifiers.

Do I need to keep my LLVM IR source file as a strict ASCII file or can the
reader handle UTF8?

Thanks in advance,
Mikael Lyngvig
-- Frogs dug channels on Earth millions of years before Man looked to Mars.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20131108/a3da7be9/attachment.html>

Eric Christopher

2013-Nov-12 21:41 UTC

head link

[LLVMdev] Quick question about Unicode support

On Fri, Nov 8, 2013 at 4:35 AM, Mikael Lyngvig <mikael at lyngvig.org>
wrote:> Hi,
>
> I have decided to go the path of generating LLVM IR source from my
bootstrap
> compiler so that I can code it in C# instead of C++ (it is going to be
> thrown away anyway).  As my language is going to support Unicode, I'm
> wondering about whether or not LLVM is capable of reading UTF-8 encoded
> source files?  I'm not thinking about string literals - they are easy
to
> handle by outputing them as a raw, albeit encoded byte sequence, but rather
> Unicode-enabled program identifiers.
>
> Do I need to keep my LLVM IR source file as a strict ASCII file or can the
> reader handle UTF8?
>
Since no one has responded and I'm not sure I'll wager a guess of
"probably the former" since as far as I can tell all of the strings
from other languages end up being encoded as escape sequences.

I'm sure if I'm wrong someone will correct me. :)

-eric
> Thanks in advance,
> Mikael Lyngvig
> -- Frogs dug channels on Earth millions of years before Man looked to Mars.
>
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

Maybe Matching Threads

Search for more possibly parallel threads

llvm dev - Nov 2013 - [LLVMdev] Quick question about Unicode support

[LLVMdev] Quick question about Unicode support

[LLVMdev] Quick question about Unicode support

Maybe Matching Threads