thr3ads.net - llvm dev - [LLVMdev] Creating an LLVM backend for a very small stack machine [Feb 2009]

If this information is useful, please help other people find it:
Share via:

Wesley J. Landaker

2009-Feb-22 23:25 UTC

[LLVMdev] Creating an LLVM backend for a very small stack machine

Hi folks,

I am interesting in creating an LLVM backend for a very small stack machine. 

Before I start in on the actual implementation, I'd like to make sure that 
I'm on the right track. Any comments, suggestions, warnings, tips, etc 
would be greatly appreciated.

Background
----------

There are a number of small as other embedded microprocessors that are often 
used in FPGAs such as Xilinx's Picoblaze, Lattice's Mico8, or Bernd 
Paysan's b16-small.

I am developping a very small stack machine primarily for use inside FPGAs 
and ASICs. This serves roughly the same niche as the above processors, 
however from my perpsective, my design has a number of advantages. For 
example, mine has a much smaller minimal configuration, has higher code 
density, is much more parameterizable, fully supports indirect branches, is 
FPGA-architecture independent, etc.

The main goal, however, is to not just have a shiny new architecture, but to 
have an entire optimized supporting toolchain. I am hoping to build on LLVM 
as infrastructure. I don't think I need to sell you all on the reasons why 
this would be handy.

I am already pretty "familiar" with LLVM, in that I have successfully
made
my own front-end DSL for another project by basically following the 
Kalidescope tutorial. I have a good grasp of the LLVM assembly language, 
and have a lot of notes (just a sanity-check paper excercize) about how I 
would lower just about every instruction into my architecture if I were 
doing it by hand. I've read through all the LLVM documentation available 
online, and (superficially) looked at a lot of the other backend targets. I 
have written custom assemblers and peephole optimizers outside of LLVM in 
the past. I am familiar academically with stack-machine specific "register 
allocation" optimizations. I'm basically ready to get started on an
LLVM
backend for my processor.

My main concerns are that in my searching online, I've gleaned a number of 
things that urge me to caution. I don't know how much of this is
"true",
but it's my impression after spending a lot of time googling:

  * A Picoblaze backend was apparently attempted both for LLVM and GCC, but 
apparently didn't go anywhere on either due to missing support for register 
indirect jumps, and possibly for other reasons. Of course my processor does 
support this, but ... 

  * There were a number of threads in various GCC lists and other places 
implying that, at least for GCC, targeting a stack machine would be very 
very difficult because of it's backend assumptions. Of course LLVM is not 
GCC, but TableGen seems register-biased ...

  * There seems to be a (to me) strange negative vibe whenever someone 
brings up targeting a compiler to a very small microprocessor, usually 
including an argument that it's not worth it. Of course, I'm the one
doing
the work, and I already have decided it's worth it ...

Anyway, hopefully that gives you an idea of where I'm coming from. Before I 
get too knee-deep into writing a backend, I have a few questions that I 
hope someone can help me with.

Questions
---------

  * Has anyone else out there targeted (or tried to target) a stack machine 
before? Was it successfull? What problems did you have? 

  * What parts of the LLVM backend code generator infrastructure would be 
usable for targeting a stack machine? e.g. Is it even possible to use 
TableGen to target a stack machine?

  * When/where/how do things like big integer (iXXXXX), phi nodes, llvm.* 
instrincs get lowered; e.g. does my target have to do that, or is it done 
generically?

Ultimtely, I'm wondering if targeting a stack machine with the current LLVM 
infrastructure is going to be somewhat straightforward even if it's not 
totally optimal (desirable), or if it's going to be so problematic that
I'd
be better off implementing an entire new code-generator myself 
(undesirable).

Any other comments or discussion is welcome.

All of my work (hardware design and all software) will be publicly and 
freely available.

-- 
Wesley J. Landaker <wjl at icecavern.net> <xmpp:wjl at
icecavern.net>
OpenPGP FP: 4135 2A3B 4726 ACC5 9094  0097 F0A9 8A4C 4CD6 E3D2
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20090222/4db71cb1/attachment.sig>

Eli Friedman

2009-Feb-23 00:06 UTC

head link

[LLVMdev] Creating an LLVM backend for a very small stack machine

On Sun, Feb 22, 2009 at 3:25 PM, Wesley J. Landaker <wjl at icecavern.net>
wrote:>  * Has anyone else out there targeted (or tried to target) a stack machine
> before? Was it successfull? What problems did you have?
Haven't done that, and I don't think there are any existing backends
like this.  It should be feasible, though; the backend code is pretty
flexible.
>  * What parts of the LLVM backend code generator infrastructure would be
> usable for targeting a stack machine? e.g. Is it even possible to use
> TableGen to target a stack machine?
You should be able to use existing LLVM backend code and TableGen at
least through instruction selection; I'm not sure whether you'll want
register allocation or not, but it should be easy to choose either
way.  The whole thing is quite flexible; see
LLVMTargetMachine::addCommonCodeGenPasses in
lib/CodeGen/LLVMTargetMachine.cpp for a high-level overview of how
CodeGen works.  It might also be useful to look at LLVM handles x87
floating-point; the relevant code is in
lib/Target/X86/X86FloatingPoint.cpp.
>  * When/where/how do things like big integer (iXXXXX), phi nodes, llvm.*
> instrincs get lowered; e.g. does my target have to do that, or is it done
> generically?
Aribitrary-width integers, vectors, llvm.*, etc. are lowered
generically by the Legalize infrastructure; the backend just has to
say what it can and can't support.  See
lib/Target/X86/X86ISelLowering.cpp for an example.  I don't know the
details of PHI nodes, but that's also taken care of by instruction
selection.

-Eli

Wesley J. Landaker

2009-Feb-23 01:43 UTC

head link

[LLVMdev] Creating an LLVM backend for a very small stack machine

On Sunday 22 February 2009 17:06:06 Eli Friedman wrote:> On Sun, Feb 22, 2009 at 3:25 PM, Wesley J. Landaker <wjl at
icecavern.net>
wrote:> >  * Has anyone else out there targeted (or tried to target) a stack
> > machine before? Was it successfull? What problems did you have?
>
> Haven't done that, and I don't think there are any existing
backends
> like this.  It should be feasible, though; the backend code is pretty
> flexible.
At the very least, there isn't anything in the LLVM instruction set that I 
think I would have any trouble lowering to the architecture ... but so far 
that's just on paper. ;)

I would love to see a Kalescope-like tutorial that goes step-by-step through 
making a backend. At the very least, I'll be documenting my adventure, so 
maybe once I know what I'm doing I can turn it into a tutorial.
> You should be able to use existing LLVM backend code and TableGen at
> least through instruction selection; I'm not sure whether you'll
want
> register allocation or not, but it should be easy to choose either
> way.  The whole thing is quite flexible; see
> LLVMTargetMachine::addCommonCodeGenPasses in
> lib/CodeGen/LLVMTargetMachine.cpp for a high-level overview of how
> CodeGen works.  It might also be useful to look at LLVM handles x87
> floating-point; the relevant code is in
> lib/Target/X86/X86FloatingPoint.cpp.
Thank you for the references, I'll have a look at those.

I've read quite a few papers on stack-machine-specific "register
allocation"
algorithms, but at least for the first pass, I want to make this as 
straightforward as possible. 

One thing I was considering was pretending I had multiple registers, and 
then when they actually get emitted I would thunk in code to do stack 
manipulations, hoping that my architecture specific peephole optimizer 
would be able to clean it up (or not, for the first cut).
> Aribitrary-width integers, vectors, llvm.*, etc. are lowered
> generically by the Legalize infrastructure; the backend just has to
> say what it can and can't support.  See
> lib/Target/X86/X86ISelLowering.cpp for an example.  I don't know the
> details of PHI nodes, but that's also taken care of by instruction
> selection.
Okay, I obviously need to learn more about the infrastructure here, but this 
at least sounds promising. I was worried that if I didn't have a register 
architecture that I'd have to reinvent the wheel in more places than it 
soudns like I will have to.

I'm sure I will be back with more questions once I seriously try starting a 
target.

-- 
Wesley J. Landaker <wjl at icecavern.net> <xmpp:wjl at
icecavern.net>
OpenPGP FP: 4135 2A3B 4726 ACC5 9094  0097 F0A9 8A4C 4CD6 E3D2
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20090222/051ef2c6/attachment.sig>

Mark Shannon

2009-Feb-23 10:23 UTC

head link

[LLVMdev] Creating an LLVM backend for a very small stack machine

Hi Wesley,

I've done quite a lot of work on register allocation for stack machines.
You might want to look at my papers:
http://www.dcs.gla.ac.uk/~marks/euroforth.pdf
http://www.dcs.gla.ac.uk/~marks/thesis.pdf

Good luck,
Mark.


-----Original Message-----
From: llvmdev-bounces at cs.uiuc.edu on behalf of Wesley J. Landaker
Sent: Sun 22/02/2009 23:25
To: llvmdev at cs.uiuc.edu
Subject: [LLVMdev] Creating an LLVM backend for a very small stack
machine[MESSAGE NOT SCANNED]
 
Hi folks,

I am interesting in creating an LLVM backend for a very small stack machine. 

Before I start in on the actual implementation, I'd like to make sure that 
I'm on the right track. Any comments, suggestions, warnings, tips, etc 
would be greatly appreciated.

Background
----------

There are a number of small as other embedded microprocessors that are often 
used in FPGAs such as Xilinx's Picoblaze, Lattice's Mico8, or Bernd 
Paysan's b16-small.

I am developping a very small stack machine primarily for use inside FPGAs 
and ASICs. This serves roughly the same niche as the above processors, 
however from my perpsective, my design has a number of advantages. For 
example, mine has a much smaller minimal configuration, has higher code 
density, is much more parameterizable, fully supports indirect branches, is 
FPGA-architecture independent, etc.

The main goal, however, is to not just have a shiny new architecture, but to 
have an entire optimized supporting toolchain. I am hoping to build on LLVM 
as infrastructure. I don't think I need to sell you all on the reasons why 
this would be handy.

I am already pretty "familiar" with LLVM, in that I have successfully
made
my own front-end DSL for another project by basically following the 
Kalidescope tutorial. I have a good grasp of the LLVM assembly language, 
and have a lot of notes (just a sanity-check paper excercize) about how I 
would lower just about every instruction into my architecture if I were 
doing it by hand. I've read through all the LLVM documentation available 
online, and (superficially) looked at a lot of the other backend targets. I 
have written custom assemblers and peephole optimizers outside of LLVM in 
the past. I am familiar academically with stack-machine specific "register 
allocation" optimizations. I'm basically ready to get started on an
LLVM
backend for my processor.

My main concerns are that in my searching online, I've gleaned a number of 
things that urge me to caution. I don't know how much of this is
"true",
but it's my impression after spending a lot of time googling:

  * A Picoblaze backend was apparently attempted both for LLVM and GCC, but 
apparently didn't go anywhere on either due to missing support for register 
indirect jumps, and possibly for other reasons. Of course my processor does 
support this, but ... 

  * There were a number of threads in various GCC lists and other places 
implying that, at least for GCC, targeting a stack machine would be very 
very difficult because of it's backend assumptions. Of course LLVM is not 
GCC, but TableGen seems register-biased ...

  * There seems to be a (to me) strange negative vibe whenever someone 
brings up targeting a compiler to a very small microprocessor, usually 
including an argument that it's not worth it. Of course, I'm the one
doing
the work, and I already have decided it's worth it ...

Anyway, hopefully that gives you an idea of where I'm coming from. Before I 
get too knee-deep into writing a backend, I have a few questions that I 
hope someone can help me with.

Questions
---------

  * Has anyone else out there targeted (or tried to target) a stack machine 
before? Was it successfull? What problems did you have? 

  * What parts of the LLVM backend code generator infrastructure would be 
usable for targeting a stack machine? e.g. Is it even possible to use 
TableGen to target a stack machine?

  * When/where/how do things like big integer (iXXXXX), phi nodes, llvm.* 
instrincs get lowered; e.g. does my target have to do that, or is it done 
generically?

Ultimtely, I'm wondering if targeting a stack machine with the current LLVM 
infrastructure is going to be somewhat straightforward even if it's not 
totally optimal (desirable), or if it's going to be so problematic that
I'd
be better off implementing an entire new code-generator myself 
(undesirable).

Any other comments or discussion is welcome.

All of my work (hardware design and all software) will be publicly and 
freely available.

-- 
Wesley J. Landaker <wjl at icecavern.net> <xmpp:wjl at
icecavern.net>
OpenPGP FP: 4135 2A3B 4726 ACC5 9094  0097 F0A9 8A4C 4CD6 E3D2

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 5354 bytes
Desc: not available
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20090223/81f5f759/attachment.bin>

Wesley J. Landaker

2009-Feb-23 16:00 UTC

head link

[LLVMdev] Creating an LLVM backend for a very small stack machine

On Monday 23 February 2009 03:23:59 Mark Shannon wrote:> I've done quite a lot of work on register allocation for stack
machines.
> You might want to look at my papers:
> http://www.dcs.gla.ac.uk/~marks/euroforth.pdf
> http://www.dcs.gla.ac.uk/~marks/thesis.pdf
Hi Mark,

I've read your papers, and in fact they were part of the data that convinced
me that I really could go with a stack machine for the work I'm doing.
I'm
planning on implementing register allocation based partially on your paper.

Actually, I was wondering if any of your lcc implementation work was 
publicly available. 

-- 
Wesley J. Landaker <wjl at icecavern.net> <xmpp:wjl at
icecavern.net>
OpenPGP FP: 4135 2A3B 4726 ACC5 9094  0097 F0A9 8A4C 4CD6 E3D2
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20090223/5428d77f/attachment.sig>

Apparently Analagous Threads

Search for more reasonably related threads

llvm dev - Feb 2009 - [LLVMdev] Creating an LLVM backend for a very small stack machine

[LLVMdev] Creating an LLVM backend for a very small stack machine

[LLVMdev] Creating an LLVM backend for a very small stack machine

[LLVMdev] Creating an LLVM backend for a very small stack machine

[LLVMdev] Creating an LLVM backend for a very small stack machine

[LLVMdev] Creating an LLVM backend for a very small stack machine

Apparently Analagous Threads