Displaying 6 results from an estimated 6 matches for "bb0_3".
Did you mean:
lbb0_3
2010 Dec 14
2
[LLVMdev] Branch delay slots broken.
...bit the same bug, which is not surprising
since the code is very similar. If I compile code with this snippit:
while (n--)
*s++ = (char) c;
I get this (for the Microblaze):
swi r19, r1, 0
add r3, r0, r0
cmp r3, r3, r7
beqid r3, ($BB0_3)
brid ($BB0_1)
add r19, r1, r0
add r3, r5, r0
$BB0_2:
addi r4, r3, 1
addi r7, r7, -1
add r8, r0, r0
sbi r6, r3, 0
cmp r8, r8, r7
bneid r8, ($BB0_2)
brid ($...
2014 May 10
6
[LLVMdev] Replacing Platform Specific IR Codes with Generic Implementation and Introducing Macro Facilities
On 10 May 2014, at 13:53, Tim Northover <t.p.northover at gmail.com> wrote:
> It doesn't make sense for everything though, particularly if you want
> target-specific IR to simply not exist. What would you map ARM's
> "ldrex" to on x86?
This isn't a great example. Having load-linked / store-conditional in the IR would make a number of transforms related to
2017 Jul 01
2
KNL Assembly Code for Matrix Multiplication
...# %.preheader26
>>>>> # =>This Loop Header: Depth=1
>>>>> # Child Loop BB0_2 Depth 2
>>>>> # Child Loop BB0_3 Depth
>>>>> 3
>>>>> # Child Loop BB0_5 Depth
>>>>> 3
>>>>> xor r11d, r11d
>>>>> .p2align 4, 0x90
>>>>> .LBB0_2: # %.preheader
>&g...
2017 Dec 19
4
A code layout related side-effect introduced by rL318299
...-------------
~/workarea/llvm-r318298/dbuild/bin/opt -loop-rotate -S < b.ll
|~/workarea/llvm-r318298/dbuild/bin/llc
.cfi_startproc
# BB#0: # %entry
pushq %rax
.cfi_def_cfa_offset 16
movl $i, %eax
cmpq %rax, %rsi
ja .LBB0_5
# BB#1:
movl $i, %eax
.p2align 4, 0x90
.LBB0_3: # %while.body
# =>This Inner Loop Header: Depth=1
movq (%rdi), %rcx
movq %rcx, (%rsi)
movq 8(%rdi), %rcx
movq %rcx, (%rsi)
addq $6, %rsi
cmpq %rdx, %rsi
jae .LBB0_4
# BB#2: # %while.cond...
2016 Oct 12
4
[test-suite] making polybench/symm succeed with "-Ofast" and "-ffp-contract=on"
On Wed, Oct 12, 2016 at 10:53 AM, Hal Finkel <hfinkel at anl.gov> wrote:
> I don't think that Clang/LLVM uses it by default on x86_64. If you're using -Ofast, however, that would explain it. I recommend looking at -O3 vs -O0 and make sure those are the same. -Ofast enables -ffast-math, which can legitimately cause differences.
>
The following tests pass at "-O3" and
2017 Dec 19
2
A code layout related side-effect introduced by rL318299
...ild/bin/llc
>>
>> .cfi_startproc
>> # BB#0: # %entry
>> pushq %rax
>> .cfi_def_cfa_offset 16
>> movl $i, %eax
>> cmpq %rax, %rsi
>> ja .LBB0_5
>> # BB#1:
>> movl $i, %eax
>> .p2align 4, 0x90
>> .LBB0_3: # %while.body
>> # =>This Inner Loop Header:
>> Depth=1
>> movq (%rdi), %rcx
>> movq %rcx, (%rsi)
>> movq 8(%rdi), %rcx
>> movq %rcx, (%rsi)
>> addq $6, %rsi
>> cmpq %rdx, %rsi...