Displaying 3 results from an estimated 3 matches for "vxorpd".
Did you mean:
vxorps
2014 Sep 19
4
[LLVMdev] Please benchmark new x86 vector shuffle lowering, planning to make it the default very soon!
...reproducible.
3. When zero extending 2 packed 32-bit integers, we should try to
emit a vpmovzxdq
Example:
vmovq 20(%rbx), %xmm0
vpshufd $80, %xmm0, %xmm0 # %xmm0 = %xmm0[0,0,1,1]
Before:
vpmovzxdq 20(%rbx), %xmm0
4. We no longer emit a simpler 'vmovq' in the following case:
vxorpd %xmm4, %xmm4, %xmm4
vblendpd $2, %xmm4, %xmm2, %xmm4 # %xmm4 = %xmm2[0],%xmm4[1]
Before, we used to generate:
vmovq %xmm2, %xmm4
Before, the vmovq implicitly zero-extended to 128 bits the quadword in
%xmm2. Now we always do this with a vxorpd+vblendps.
As I said, I will try to create small...
2014 Mar 26
3
[LLVMdev] [cfe-dev] computing a conservatively rounded square of a double
...0000e+00
> .text
> .globl _Z21inspect_singleton_sqrd
> .align 16, 0x90
> .type _Z21inspect_singleton_sqrd, at function
> _Z21inspect_singleton_sqrd: # @_Z21inspect_singleton_sqrd
> .cfi_startproc
> # BB#0:
> vmulsd %xmm0, %xmm0, %xmm1
> vxorpd .LCPI1_0(%rip), %xmm1, %xmm0
> ret
> .Ltmp1:
> .size _Z21inspect_singleton_sqrd, .Ltmp1-_Z21inspect_singleton_sqrd
> .cfi_endproc
>
> I realize this is unsupported behavior, but it would be nice to still
> be able to use clang to do numerical computation. Is there a...
2014 Sep 10
13
[LLVMdev] Please benchmark new x86 vector shuffle lowering, planning to make it the default very soon!
On Tue, Sep 9, 2014 at 11:39 PM, Chandler Carruth <chandlerc at google.com> wrote:
> Awesome, thanks for all the information!
>
> See below:
>
> On Tue, Sep 9, 2014 at 6:13 AM, Andrea Di Biagio <andrea.dibiagio at gmail.com>
> wrote:
>>
>> You have already mentioned how the new shuffle lowering is missing
>> some features; for example, you explicitly