thr3ads.net - llvm dev - [LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs [Nov 2013]

If this information is useful, please help other people find it:
Share via:

Rob Stewart

2013-Nov-02 18:48 UTC

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

Hi, I'm getting started with LLVM, with the intention of writing a DSL
that compiles to LLVM IR, to target a new CPU instruction set. I have
two questions:

1. Multimedia IO in LLVM

In the paper "The LLVM Instruction Set and Compilation Strategy" [1],
there is talk about a possible multimedia instruction set in a future
LLVM release:

"Note that LLVM is a virtual instruction set: it does not deﬁne
runtime and operating system functions such as I/O, memory management
(in particular, garbage collection), signals, and many others. Some
issues that do belong in the instruction set and may be added in the
future include support for multimedia instructions (e.g., packed
operations),
predicated instructions, and explicit parallelism."

My question is a very basic one: how do I use LLVM IR code to do file
IO? As a simple example, take a file with 3 tab-separated integers.
How would I read this file in to an LLVM vector <3 x i32> ? Now take a
more complicated example: decoding a file to an LLVM vector
representation. To use the multimedia instruction set described above
(though I guess not implemented yet), how do I: A) read a greyscale
3x3 pixel image file, B) translate this to an LLVM vector <9 x i8> ?


2. Implementing an LLVM backend subset.

I have not yet delved deeply in to the requirements of implementing
LLVM backends to target new hardware instruction sets. Given a
constrained hardware implementation, is it acceptable to implement a
backend subset, and throwing "Not Supported" exceptions where ever
appropriate. To give a concrete example, take a simple CPU
architecture that does not support floating point arithmetic, but does
support simple integer arithmetic. If my LLVM frontend produces only
i32 values, then `llc -march=my_new_arch` doesn't complain. If,
however, my frontend produces float values, then `llc
-marc=my_new_arch` would complain with the "Type 'float' Not
Supported
by architecture my_new_arch".

Is this the philosophy of LLVM backend implementation? Must the new
target architecture support all LLVM instructions, or am I able to
develop an LLVM backend for an architecture that lacks features like
floating point arithmetic? Any related resources would be appreciated.

[1] - http://goo.gl/HA5AXU

--
Rob

Sean Silva

2013-Nov-03 05:44 UTC

head link

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

On Sat, Nov 2, 2013 at 2:48 PM, Rob Stewart <robstewart57 at gmail.com>
wrote:
> Hi, I'm getting started with LLVM, with the intention of writing a DSL
> that compiles to LLVM IR, to target a new CPU instruction set. I have
> two questions:
>
> 1. Multimedia IO in LLVM
>
> In the paper "The LLVM Instruction Set and Compilation Strategy"
[1],
> there is talk about a possible multimedia instruction set in a future
> LLVM release:
>
> "Note that LLVM is a virtual instruction set: it does not deﬁne
> runtime and operating system functions such as I/O, memory management
> (in particular, garbage collection), signals, and many others. Some
> issues that do belong in the instruction set and may be added in the
> future include support for multimedia instructions (e.g., packed
> operations),
> predicated instructions, and explicit parallelism."
>
We have vectors. No predicated instructions in the LLVM IR (only in the MI
layer). I'm not sure what you mean by "explicit parallelism" (do
you mean
something like Cilk?) but I don't think LLVM supports that (those things
usually have a nontrivial runtime component).

>
> My question is a very basic one: how do I use LLVM IR code to do file
> IO? As a simple example, take a file with 3 tab-separated integers.
> How would I read this file in to an LLVM vector <3 x i32> ? Now take
a
> more complicated example: decoding a file to an LLVM vector
> representation. To use the multimedia instruction set described above
> (though I guess not implemented yet), how do I: A) read a greyscale
> 3x3 pixel image file, B) translate this to an LLVM vector <9 x i8> ?
>
LLVM doesn't provide a runtime or "VM". You basically do these
things the
same way that you do them in C. Yes, this unfortunately requires knowing
your target platform's system libraries and how to link to them and such;
LLVM doesn't paper over this.

>
>
> 2. Implementing an LLVM backend subset.
>
> I have not yet delved deeply in to the requirements of implementing
> LLVM backends to target new hardware instruction sets. Given a
> constrained hardware implementation, is it acceptable to implement a
> backend subset, and throwing "Not Supported" exceptions where
ever
> appropriate. To give a concrete example, take a simple CPU
> architecture that does not support floating point arithmetic, but does
> support simple integer arithmetic. If my LLVM frontend produces only
> i32 values, then `llc -march=my_new_arch` doesn't complain. If,
> however, my frontend produces float values, then `llc
> -marc=my_new_arch` would complain with the "Type 'float' Not
Supported
> by architecture my_new_arch".
>
> Is this the philosophy of LLVM backend implementation? Must the new
> target architecture support all LLVM instructions, or am I able to
> develop an LLVM backend for an architecture that lacks features like
> floating point arithmetic? Any related resources would be appreciated.
>
If you don't want to merge this with trunk, then you can do whatever you
want; typically when starting out you will only implement a subset of
instructions (you can't implement them all simultaneously, now can you?).
The state of backend documentation is pretty dire. I brain dumped basically
all the backend docs I could think of in <
http://thread.gmane.org/gmane.comp.compilers.llvm.devel/65898>. That thread
also has some other good pointers for a person interested in writing a
backend.

Getting a backend intro trunk generally speaking is a large long-term
commitment. I'm not sure what the exact requirements are for getting a
backend into trunk (I don't think anybody really does at a deep level,
other than "some critical subset of experienced LLVM developers think it
looks good enough to be committed"), but the biggest hurdle (generally)
with getting a backend into LLVM trunk is that it requires a demonstrated
ability to work with the community and a clear investment in maintaining
the backend from now into the foreseeable future (to put it bluntly, this
generally will mean at least 1 or 2 people with a paycheck tied to
maintaining the backend and contributing to the community, and these being
people that have a history of submitting good patches and that are familiar
with the community expectations). (The reason for these requirements is
purely experience with other backends, which have been a maintainability
nightmare without active developers responsible for maintaining them and
compensating for the increase in general codebase complexity (e.g. each new
backend makes refactoring the target-independent backend parts more
difficult)).

-- Sean Silva

>
> [1] - http://goo.gl/HA5AXU
>
> --
> Rob
>
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20131103/8538b99f/attachment.html>

Pekka Jääskeläinen

2013-Nov-03 09:08 UTC

head link

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

On 11/03/2013 07:44 AM, Sean Silva wrote:> We have vectors. No predicated instructions in the LLVM IR (only in the MI
> layer). I'm not sure what you mean by "explicit parallelism"
(do you mean
> something like Cilk?) but I don't think LLVM supports that (those
things usually
> have a nontrivial runtime component).
There's some support for (usually explicit from the language at hand) data
parallel loops; the parallel loop metadata:

http://llvm.org/docs/LangRef.html#llvm-mem-parallel-loop-access-metadata

#pragma simd and even cilk_for could use that.

-- 
--Pekka

Rob Stewart

2013-Nov-05 14:39 UTC

head link

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

On 3 November 2013 05:44, Sean Silva <chisophugis at gmail.com> wrote:
> LLVM doesn't provide a runtime or "VM". You basically do
these things the
> same way that you do them in C. Yes, this unfortunately requires knowing
> your target platform's system libraries and how to link to them and
such;
> LLVM doesn't paper over this.
OK. So to be specific, I am using a Haskell language binding to LLVM,
not C. With my multimedia IO examples, am I correct in thinking I have
a few options:
1. Do IO in my host language, and parse bytestrings in to LLVM data
structures, e.g. vectors. Then, pass these data structures to LLVM
generated code for JIT compilation.
2. Write IO functions in C, and compile with -emit-llvm . Then in my
LLVM code generation, I read this external function from a bitcode
file generated by clang. Here, there is no IO in my host language.
3. Call libc functions within LLVM to parse bytestrings directly in to
structs or vectors. If libc embedded in LLVM even possible?

Is there an LLVM cookbook for interaction with IO runtime systems?
> The state of backend documentation is pretty dire. I brain dumped basically
> all the backend docs I could think of in
> <http://thread.gmane.org/gmane.comp.compilers.llvm.devel/65898>. That
thread
> also has some other good pointers for a person interested in writing a
> backend.
That's a great resource, thanks.

One thing I'd really appreciate is a cookbook on LLVM data structures.
I have read the language reference http://llvm.org/docs/LangRef.html ,
and understand the expressivity of aggregate types. What I do not yet
have a good feeling for is when to use them. To give a concrete
example, I'd like to parse a greyscale image in to an LLVM data
structure. At each {x,y} point, there is an Int8 value between 0 and
255. Take a small 4x3 image. I could feed my pixels in to a flat Int8
vector of length 12. I could also feed it in to an array of length 4,
of Int8 arrays of length 3.

Now take 2 simple functions: one does greyscale brightening, the other
does a sobel filter. The first needs only to know the value of 1 pixel
at a time, i.e. to increase its value. For this, the vector option
would be fine, and I assume (naively) that I'd enjoy SIMD performance
over this vector, executing `add x` to each element? However, the
Sobel filter needs information not only about the value of a pixel,
but also the values of its surrounding pixels. In this case, the 2D
array would be more suitable, as the shape of the image is known.
Would I lose SIMD vectorisation, probably? Or as a third option, would
I use a struct with a triple of three elements: a vector, and two Int8
values indicating the X and Y lengths of the image?

What I'm after is a cookbook for LLVM data structures, and how to
apply them. E.g, when to use structs, when to use aggregated types,
and how to hold on to SIMD vectorisation when ever is possible.

--
Rob

Reasonably Related Threads

Search for more possibly parallel threads

llvm dev - Nov 2013 - [LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

[LLVMdev] Multimedia IO instructions & partial backend implementations for simple CPUs

Reasonably Related Threads