Displaying 7 results from an estimated 7 matches for "_mm_empty".
2020 Aug 31
2
Proposal to remove MMX support.
...ne asm can still have MMX
> operands/results/clobbers, and can still put the processor in MMX mode. If
> code is mixing MMX intrinsics and inline asm, there could be a significant
> penalty to moving values across register files. And it’s not clear what we
> want to do with _mm_empty(): under full emulation, it should be a no-op,
> but if there’s MMX asm, we need to actually clear the register file.
>
> Moving data between the register files in order to call an inline asm is
not a correctness issue, however, just a potential performance issue. The
compiler will inse...
2020 Aug 30
3
Proposal to remove MMX support.
I recently diagnosed a bug in someone else's software, which turned out to
be due to incorrect MMX intrinsics usage: if you use any of the x86
intrinsics that accept or return __m64 values, then you, the *programmer* are
required to call _mm_empty() before using any x87 floating point
instructions or leaving the function. I was aware that this was required at
the assembly-level, but not that the compiler forced users to deal with
this when using intrinsics.
This is a real nasty footgun -- if you get this wrong, your program
doesn't cras...
2008 Nov 26
0
[ANNOUNCE] pixman release 0.13.2 now available
...e current ARM code to ARM SIMD
Alan Coopersmith (1):
Define force_inline for compilers other than gcc & MS Visual C
Benjamin Otte (3):
force alignment of arg pointer in potential SSE users
update .gitignore
unswitch red and blue
David M?ller (1):
Move _mm_empty() to correct place
David Woodhouse (1):
Less fragile Linux altivec detection
Jeff Muizelaar (5):
Add support for ARMv6 SIMD fastpaths.
Fix inner branch code.
Cleanup inline definitions
[arm-simd] Fix typo found by ???
Unify scanline buffer types
Ke...
2005 Apr 19
0
mmx optimization
...p);
sign = _m_paddw (sign, cmp);
cmp = _m_pmaddwd (diff, sign);
sad = _m_paddd (sad, cmp);
return _m_to_int (sad) + _m_to_int (_m_psrlqi (sad, 32));
}
but it isn't faster. Does anyone of you have got a hint to make it faster?
I've got another question: why don't you call _mm_empty when you use
intrinsic asm?
Thank you and excuse me for the OT.
--
Ottavio Campana
Telecommunication Engineer
Lab. Immagini
Dept. of Information Engineering
University of Padova
Via Gradenigo 6/B
35131 Padova
Italy
2008 Jun 09
0
[ANNOUNCE] pixman release 0.11.2 now available
...Move PIXMAN_EXPORT into pixman-region.c
Move all the PIXMAN_EXPORT into .c files
Some formatting fixing
Add pixman_region32_copy_from_region16
Use 32 bit regions internally
Add pixman_image_set_clip_region32
Call _mm_empty() at the end of fbComposeSetupSSE
Fix bug in pixman_compute_composite_region()
Export pixman_compute_composite_region32() and use it in
walk_region
Add infinite loop test
Fix uses of short variables in pixman-region.c...
2015 Aug 01
0
[ANNOUNCE] pixman 0.33.2 release candidate now available
...to Mpx_per_sec()
lowlevel-blt-bench: add CSV output mode
lowlevel-blt-bench: add option to skip memcpy measurement
Siarhei Siamashka (3):
configure.ac: Check if the compiler supports GCC vector extensions
test: Fix OpenMP clauses for the tolerance-test
mmx: Fix _mm_empty problems for over_8888_8888/over_8888_n_8888
Søren Sandmann (14):
Post-release version bump to 0.33.1
demos/Makefile.am: Move EXTRA_DIST outside "if HAVE_GTK"
test/utils.c: Make the stack unaligned only on 32 bit Windows
general: Support component alpha for...
2016 Jan 31
0
[ANNOUNCE] pixman major release 0.34.0 now available
...utils.[ch]: add fence_get_page_size()
implementation: add PIXMAN_DISABLE=wholeops
armv6: enable over_n_8888
Siarhei Siamashka (5):
configure.ac: Check if the compiler supports GCC vector extensions
test: Fix OpenMP clauses for the tolerance-test
mmx: Fix _mm_empty problems for over_8888_8888/over_8888_n_8888
pixman-general: Fix stack related pointer arithmetic overflow
vmx: implement fast path vmx_composite_over_n_8888
Søren Sandmann (24):
Post-release version bump to 0.33.1
demos/Makefile.am: Move EXTRA_DIST outside "if HA...