search for: packuswb

Displaying 8 results from an estimated 8 matches for "packuswb".

2009 Jun 09
3
[LLVMdev] [Patch] Fix SSE2 packing intrinsics return type
Hi all, Please consider committing the attached patch. I believe the SSE2 packsswb, packssdw and packuswb intrinsics have an incorrect return type. Thanks, Nicolas -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20090609/85ed0b5e/attachment.html> -------------- next part -------------- A non-text attach...
2009 Jun 10
0
[LLVMdev] [Patch] Fix SSE2 packing intrinsics return type
On Tue, Jun 9, 2009 at 2:58 PM, Nicolas Capens<nicolas at capens.net> wrote: > Please consider committing the attached patch. I believe the SSE2 packsswb, > packssdw and packuswb intrinsics have an incorrect return type. If we really wanted to do this, an AutoUpgrade patch would be necessary for backwards-compatibility. I'm not sure it's worth bothering. -Eli
2009 Jun 10
1
[LLVMdev] [Patch] Fix SSE2 packing intrinsics return type
On Jun 9, 2009, at 5:56 PM, Eli Friedman wrote: > On Tue, Jun 9, 2009 at 2:58 PM, Nicolas Capens<nicolas at capens.net> > wrote: >> Please consider committing the attached patch. I believe the SSE2 >> packsswb, >> packssdw and packuswb intrinsics have an incorrect return type. > > If we really wanted to do this, an AutoUpgrade patch would be > necessary for backwards-compatibility. I'm not sure it's worth > bothering. Did they work before with llvm-gcc? If not, then don't worry about backward compatib...
2005 Aug 17
2
MMX loop filter for theora-exp
...ddw %%mm7,%%mm5\n" /* (_pix[_ystride]+f); */ +"movq %%mm1,%%mm2\n" +"punpcklbw %%mm0,%%mm1\n" +"punpckhbw %%mm0,%%mm2\n" //[ystride*2] +"psubw %%mm6,%%mm1\n" /* (_pix[_ystride*2]-f); */ +"psubw %%mm7,%%mm2\n" /* (_pix[_ystride*2]-f); */ +"packuswb %%mm2,%%mm1\n" +"packuswb %%mm5,%%mm4\n" +"movq %%mm1,(%0,%1,2)\n" /* _pix[_ystride*2]= */ +"movq %%mm4,(%0,%1)\n" /* _pix[_ystride]= */ +"emms\n" +: +: "r" (_pix), "r" (_ystride), "r" (_bv) +: "esi", "edi&qu...
2005 Mar 23
0
[PATCH]
...put values */ +" decl %%ecx \n\t" /* dec counter */ +" paddsw %%mm3, %%mm1 \n\t" /* add+128 and saturate to 16bit */ +" lea 0x10(%1), %1 \n\t" /*_residuo+16 */ +" paddsw %%mm2, %%mm0 \n\t" /* add+128 and saturate to 16bit */ +" packuswb %%mm1, %%mm0 \n\t" /* pack saturate with next(high) four values */ +" movq %%mm0, (%0) \n\t" /* writeback */ +" lea (%0,%2), %0 \n\t" /*_dst+_dst_ystride */ +" jns 1b \n\t" /* loop */ + : "+r" (_dst) + : "r" (_residue),...
2009 Jun 10
1
[LLVMdev] [Patch] Fix SSE2 packing intrinsics return type
...ni 2009 2:57 To: LLVM Developers Mailing List Subject: Re: [LLVMdev] [Patch] Fix SSE2 packing intrinsics return type On Tue, Jun 9, 2009 at 2:58 PM, Nicolas Capens<nicolas at capens.net> wrote: > Please consider committing the attached patch. I believe the SSE2 packsswb, > packssdw and packuswb intrinsics have an incorrect return type. If we really wanted to do this, an AutoUpgrade patch would be necessary for backwards-compatibility. I'm not sure it's worth bothering. -Eli _______________________________________________ LLVM Developers mailing list LLVMdev at cs.uiuc.edu...
2005 Mar 23
3
[PATCH] promised MMX patches rc1
...put values */ +" decl %%ecx \n\t" /* dec counter */ +" paddsw %%mm3, %%mm1 \n\t" /* add+128 and saturate to 16bit */ +" lea 0x10(%1), %1 \n\t" /*_residuo+16 */ +" paddsw %%mm2, %%mm0 \n\t" /* add+128 and saturate to 16bit */ +" packuswb %%mm1, %%mm0 \n\t" /* pack saturate with next(high) four values */ +" movq %%mm0, (%0) \n\t" /* writeback */ +" lea (%0,%2), %0 \n\t" /*_dst+_dst_ystride */ +" jns 1b \n\t" /* loop */ + : "+r" (_dst) + : "r" (_residue),...
2004 Aug 24
5
MMX/mmxext optimisations
quite some speed improvement indeed. attached the updated patch to apply to svn/trunk. j -------------- next part -------------- A non-text attachment was scrubbed... Name: theora-mmx.patch.gz Type: application/x-gzip Size: 8648 bytes Desc: not available Url : http://lists.xiph.org/pipermail/theora-dev/attachments/20040824/5a5f2731/theora-mmx.patch-0001.bin