Displaying 4 results from an estimated 4 matches for "mod_input".
Did you mean:
id_input
2013 Aug 09
2
[LLVMdev] [RFC] Poor code generation for paired load
...e the poor code generation for x86-64.
opt -sroa current_input.ll -S -o - | llc -O3 -o -
You will see 2 vmovd and 1 shrq that can be avoided as illustrated with the next command.
Here is a nicer code produced by modifying the input so that SROA generates friendlier code for this case.
opt -sroa mod_input.ll -S -o - | llc -O3 -o -
Basically the difference between both inputs is that memcpy has not been expanded in mod_input.ll (instcombine normally replaces it). Thus, SROA inserts its own loads to get rid of the memcpy instead of extracting the values from the 64-bits loads.
** Advices Required *...
2013 Aug 12
2
[LLVMdev] [RFC] Poor code generation for paired load
...- | llc -O3 -o -
>>
>> You will see 2 vmovd and 1 shrq that can be avoided as illustrated with the
>> next command.
>>
>> Here is a nicer code produced by modifying the input so that SROA generates
>> friendlier code for this case.
>>
>> opt -sroa mod_input.ll -S -o - | llc -O3 -o -
>>
>> Basically the difference between both inputs is that memcpy has not been
>> expanded in mod_input.ll (instcombine normally replaces it). Thus, SROA
>> inserts its own loads to get rid of the memcpy instead of extracting the
>> values fr...
2013 Aug 10
0
[LLVMdev] [RFC] Poor code generation for paired load
...t; opt -sroa current_input.ll -S -o - | llc -O3 -o -
>
> You will see 2 vmovd and 1 shrq that can be avoided as illustrated with the
> next command.
>
> Here is a nicer code produced by modifying the input so that SROA generates
> friendlier code for this case.
>
> opt -sroa mod_input.ll -S -o - | llc -O3 -o -
>
> Basically the difference between both inputs is that memcpy has not been
> expanded in mod_input.ll (instcombine normally replaces it). Thus, SROA
> inserts its own loads to get rid of the memcpy instead of extracting the
> values from the 64-bits loads....
2013 Aug 12
0
[LLVMdev] [RFC] Poor code generation for paired load
...t; opt -sroa current_input.ll -S -o - | llc -O3 -o -
>
> You will see 2 vmovd and 1 shrq that can be avoided as illustrated with the
> next command.
>
> Here is a nicer code produced by modifying the input so that SROA generates
> friendlier code for this case.
>
> opt -sroa mod_input.ll -S -o - | llc -O3 -o -
>
> Basically the difference between both inputs is that memcpy has not been
> expanded in mod_input.ll (instcombine normally replaces it). Thus, SROA
> inserts its own loads to get rid of the memcpy instead of extracting the
> values from the 64-bits loads....