On Fri, Nov 8, 2013 at 4:35 AM, Mikael Lyngvig <mikael at lyngvig.org>
wrote:> Hi,
>
> I have decided to go the path of generating LLVM IR source from my
bootstrap
> compiler so that I can code it in C# instead of C++ (it is going to be
> thrown away anyway). As my language is going to support Unicode, I'm
> wondering about whether or not LLVM is capable of reading UTF-8 encoded
> source files? I'm not thinking about string literals - they are easy
to
> handle by outputing them as a raw, albeit encoded byte sequence, but rather
> Unicode-enabled program identifiers.
>
> Do I need to keep my LLVM IR source file as a strict ASCII file or can the
> reader handle UTF8?
>
Since no one has responded and I'm not sure I'll wager a guess of
"probably the former" since as far as I can tell all of the strings
from other languages end up being encoded as escape sequences.
I'm sure if I'm wrong someone will correct me. :)
-eric
> Thanks in advance,
> Mikael Lyngvig
> -- Frogs dug channels on Earth millions of years before Man looked to Mars.
>
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>