Andrew Friedley
2010-Apr-02 19:34 UTC
[LLVMdev] Problem returning aggregate struct (complex number) by value
I'm trying to return an aggregate structure (a complex number, really) containing two 32-bit floats or two 64-bit doubles, by value. LLVM 2.6 compiled as 64-bit on OSX 10.6/x86-64. The IR I generate for a simple test case looks like this (float case): %0 = type { float, float } define %0 @test600() { entry: ret %0 { float 4.200000e+01, float 3.500000e+01 } } Running that through llc, the x86-64 assembly looks like this (abbreviated): movss LCPI72_0(%rip), %xmm0 movss LCPI72_1(%rip), %xmm1 ret Now, if I write a C function that does the same thing: struct complex_float { float real; float imag; }; static struct complex_float foo() { struct complex_float x = {42.0, 35.0}; return x; } The assembly code looks like this (compiled with GCC and disassembled by GDB): 0x00000001000010ac <foo+0>: push %rbp 0x00000001000010ad <foo+1>: mov %rsp,%rbp 0x00000001000010b0 <foo+4>: mov $0x42280000,%eax 0x00000001000010b5 <foo+9>: mov %eax,-0x10(%rbp) 0x00000001000010b8 <foo+12>: mov $0x420c0000,%eax 0x00000001000010bd <foo+17>: mov %eax,-0xc(%rbp) 0x00000001000010c0 <foo+20>: mov -0x10(%rbp),%rax 0x00000001000010c4 <foo+24>: movd %rax,%xmm0 0x00000001000010c9 <foo+29>: leaveq 0x00000001000010ca <foo+30>: retq Here the two values are returned 'packed' in xmm0, while LLVM returns them separately in xmm0 and xmm1. This is causing problems when calling the function generated by LLVM from C (or my real use case, via libFFI). What happens is that I get the 'real' value in the struct back with the right value, but the 'imag' value is wrong -- always comes back as zero. What makes this more confusing is that if you switch the IR and C examples to use double instead of float, everything works -- the C disassembly appears to use xmm1 for the second value just like LLVM. Furthermore, I can do the same experiment with 32- and 64-bit int as my struct element types. Behavior mirrors that of float and double -- 32-bit does not work right, while 64-bit does. Am I doing something wrong here, or is this an issue in LLVM? Is there something I can do to work around this issue and get the proper behavior? Thanks, Andrew
Duncan Sands
2010-Apr-02 20:03 UTC
[LLVMdev] Problem returning aggregate struct (complex number) by value
Hi Andrew,> %0 = type { float, float } > > define %0 @test600() { > entry: > ret %0 { float 4.200000e+01, float 3.500000e+01 } > } > > > Running that through llc, the x86-64 assembly looks like this (abbreviated):unfortunately the x86-64 ABI says that complex numbers are *not* passed the same as a struct containing two floats. Your LLVM IR is not ABI conformant and thus will not interact properly with ABI conformant code, like that produced by llvm-gcc. Ciao, Duncan.
Andrew Friedley
2010-Apr-02 20:12 UTC
[LLVMdev] Problem returning aggregate struct (complex number) by value
Duncan Sands wrote:> Hi Andrew, > >> %0 = type { float, float } >> >> define %0 @test600() { >> entry: >> ret %0 { float 4.200000e+01, float 3.500000e+01 } >> } >> >> >> Running that through llc, the x86-64 assembly looks like this (abbreviated): > > unfortunately the x86-64 ABI says that complex numbers are *not* passed the > same as a struct containing two floats. Your LLVM IR is not ABI conformant > and thus will not interact properly with ABI conformant code, like that produced > by llvm-gcc.Thanks for the response. I'm aware of that, but dont think I quite understand why that affects me here -- I'm using structs in all cases (although as an abstraction for a complex number), not actual C99 complex. Does LLVM just assume that I really mean C99 complex when I'm working with a struct of two floats? Even so this doesn't answer my question, of what can I do to make this work? :) Andrew
Possibly Parallel Threads
- [LLVMdev] Problem returning aggregate struct (complex number) by value
- [LLVMdev] Problem returning aggregate struct (complex number) by value
- [LLVMdev] Loop unrolling opportunity in SPEC's libquantum with profile info
- [LLVMdev] Using llc to compile a .ll into a .s for ARM
- Identify c-style pointer to array