Fraser Cormack via llvm-dev
2020-Dec-16 16:25 UTC
[llvm-dev] Understanding an issue with indirect split CCVals (RISC-V)
Hi,
I have found a bug when passing fixed-length vectors to/from functions in
RISC-V. I'm a bit rusty on the CC support in general (let alone on those for
RISC-V) so I was hoping for some pointers on how to fix it.
Take this program:
declare i32 @foo(<4 x i8>)
define i32 @bar(<4 x i8> %v) {
%r = call i32 @foo(<4 x i8> %v)
ret i32 %r
}
And compile with for RV64:
$> llc -mtriple riscv64 bug.ll -o - -stop-after finalize-isel
stack:
- { id: 0, name: '', type: default, offset: 0, size: 4, alignment: 4,
stack-id: default, callee-saved-register: '',
callee-saved-restored: true,
debug-info-variable: '', debug-info-expression: '',
debug-info-location: '' }
SD killed %4, %stack.0, 24 :: (store 8 into %stack.0)
SD killed %3, %stack.0, 16 :: (store 8 into %stack.0)
SD killed %2, %stack.0, 8 :: (store 8 into %stack.0)
SD killed %1, %stack.0, 0 :: (store 8 into %stack.0)
%5:gpr = ADDI %stack.0, 0
$x10 = COPY %5
PseudoCALL target-flags(riscv-plt) @foo, csr_ilp32_lp64, implicit-def dead
$x1, implicit $x10, implicit-def $x2, implicit-def $x10
You'll notice that the v4i8 parameter has been split into 4 8-byte registers
and passed indirectly through the stack as a 32-byte object, but the temporary
stack location has been created according to v4i8, whose store size and
alignment are both 4 bytes. This clobbers the stack and will produce misaligned
stores.
Should we be storing these with four SDs but to a 32-byte sized and 8-byte
aligned location, or should they be truncating SB stores? I couldn't find
the answer in the RISC-V ABI docs.
Regardless of what we should be doing, I couldn't see that LLVM would
currently handle either particularly well. For truncating stores, I couldn't
see a way of getting at what would be the "IntermediateVT" (e.g. i8)
through ISD::InputArg and ISD::OutputArg in the backend. For correcting the size
& alignment of the stack location, I'm not convinced the issue is only
in the RISC-V backend; in SelectionDAGISel::LowerArguments it looks as though
it's possible for PartBase to get out of sync when NumValues != 1 &&
VT.getStoreSize() != NumRegs * RegisterVT.getStoreSize(). Is that documented?
Any pointers on where to take this would be appreciated.
Thanks,
Fraser Cormack
Senior Software Engineer, Compilers
Codeplay Software Ltd
Level C, Argyle House
3 Lady Lawson St
Edinburgh EH3 9DR
Tel: +44 (0)131 466 0503
Website: http://www.codeplay.com
Twitter: https://twitter.com/codeplaysoft
This email and any attachments may contain confidential and /or privileged
information and is for use by the addressee only. If you are not the intended
recipient, please notify Codeplay Software Ltd immediately and delete the
message from your computer. You may not copy or forward it, or use or disclose
its contents to any other person. Any views or other information in this message
which do not relate to our business are not authorized by Codeplay software Ltd,
nor does this message form part of any contract unless so stated.
As internet communications are capable of data corruption Codeplay Software Ltd
does not accept any responsibility for any changes made to this message after it
was sent. Please note that Codeplay Software Ltd does not accept any liability
or responsibility for viruses and it is your responsibility to scan any
attachments.
Company registered in England and Wales, number: 04567874
Registered office: Regent House, 316 Beulah Hill, London, United Kingdom, SE19
3HF
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20201216/83c0829c/attachment.html>