thr3ads.net - flac dev - [flac-dev] AVX2 / 3DNow. [Sep 2014]

If this information is useful, please help other people find it:
Share via:

lvqcl

2014-Sep-30 18:57 UTC

[flac-dev] AVX2 / 3DNow.

It is relatively easy to convert some SSE2/3/4 code into AVX2: just
use AVX2 intrinsics instead of SSE and the logic of the functions.
Unfortunately my CPU doesn't have AVX2. But today I managed to briefly
test AVX2 code on i5 Haswell CPU. Unfortunately I wasn't able to run
full test suite on Haswell, but it seems that the new code works correctly.
The results of a quick performance test are:

16-bit WAV encoding: ~20% speed increase
24-bit WAV encoding: ~40% speed increase

The speed increase isn't impressive for 16-bit input...
and this code requires Haswell. But it's still some
speed improvement that will cost another increase of
the size of executable files (by 20-30 kB).

What do you think?


Also the new code requires AVX CPU/OS support detection code to be added
to cpu.c I'd like to simplify it slightly further before this. For example,
by removing 3DNow code because it's hardly relevant these days.

Erik de Castro Lopo

2014-Oct-01 20:30 UTC

head link

[flac-dev] AVX2 / 3DNow.

lvqcl wrote:
> Also the new code requires AVX CPU/OS support detection code to be added
> to cpu.c I'd like to simplify it slightly further before this. For
example,
> by removing 3DNow code because it's hardly relevant these days.
>
> What do you think?
I'd be willing to accept a clean set of patches that support this. I
even have a machine that seems to support AVX2.

Cheers,
Erik
-- 
----------------------------------------------------------------------
Erik de Castro Lopo
http://www.mega-nerd.com/

Reasonably Related Threads

Search for more maybe matching threads

flac dev - Sep 2014 - AVX2 / 3DNow.

[flac-dev] AVX2 / 3DNow.

[flac-dev] AVX2 / 3DNow.

Reasonably Related Threads