thr3ads.net - flac dev - [flac-dev] Two questions about RG in flac [Jun 2014]

If this information is useful, please help other people find it:
Share via:

lvqcl

2014-Jun-03 14:45 UTC

[flac-dev] Two questions about RG in flac

1) to the author of test/test_replaygain.sh
There are 2 identical lines in this file: line 137 and next.
Is it intended or just a copy/paste error?


2) to ALL:
I attached a small program. Compile and run it.
* Does it work correctly when compiled with -O3 -msse2 options?
* If yes, does it work correctly when compiled with -O3 -funroll-loops -msse2
options?
   ( and what is the version of your GCC? )
-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.zip
Type: application/zip
Size: 580 bytes
Desc: not available
Url :
http://lists.xiph.org/pipermail/flac-dev/attachments/20140603/87f6fcd1/attachment.zip

Robert Kausch

2014-Jun-03 15:34 UTC

head link

[flac-dev] Two questions about RG in flac

Am 03.06.2014 16:45, schrieb lvqcl:> 2) to ALL:
> I attached a small program. Compile and run it.
> * Does it work correctly when compiled with -O3 -msse2 options?
> * If yes, does it work correctly when compiled with -O3 -funroll-loops 
> -msse2 options?
>   ( and what is the version of your GCC? )Tested various versions of TDM-GCC on Windows.

32 bit executable produced with TDM-GCC 4.8.1 fails as soon as -O3 and 
SSE2 come together. SSE2 is enabled by -O3 here, so compiling with -O3 
is sufficient to trigger the bug. Compiling with -O3 -mno-sse2 produces 
a correctly working executable just as -O2 -msse2 does. -funroll-loops 
does not make any difference.

Same with TDM-GCC 4.4.1, 4.5.0, 4.6.1 and 4.7.1; only difference is that 
-O3 does not include SSE2 there, so it has to be enabled manually with 
-msse2 to trigger the problem.

TDM-GCC 4.3.2 produces a correctly working executable even with -O3 -msse2.

64 bit executables produced with any of the tested GCC versions work 
correctly in all cases.

Robert Kausch

2014-Jun-03 17:19 UTC

head link

[flac-dev] Two questions about RG in flac

Am 03.06.2014 16:45, schrieb lvqcl:> 2) to ALL:
> I attached a small program. Compile and run it.
> * Does it work correctly when compiled with -O3 -msse2 options?
> * If yes, does it work correctly when compiled with -O3 -funroll-loops 
> -msse2 options?
>   ( and what is the version of your GCC? )I further reduced the testcase (attached).

The bug only occurs if N >= 64; presumably the second loop is only SSE2 
optimized if that's the case.

The problem seems to be that sum is interpreted as a 64 bit value if 
SSE2 was used in the loop (the lower 32 bits of the result give the 
expected value). If sum is evaluated another time before or after (!) 
the printf, the problem goes away. For example, changing the last line 
to "return sum + 1;" lets the problem disappear.

I confirmed the bug with GCC 4.6.3 on Ubuntu. As on Windows, only 32 bit 
code generation is affected.

You should file a bug report with the GCC team.
-------------- next part --------------
#include <stdio.h>

#define N 64 /* problem is triggered only if N >= 64 */
unsigned A[N];

int main()
{
	unsigned i, sum = 0; /* both sum and A[] need to be unsigned for the bug to
happen */

	for (i = 0; i < N; i++) A[i] = 1;
	for (i = 0; i < N; i++) sum += A[i];

	printf("Sum = %f (should be equal to %i)\n", (float) sum, N);

	return 0;
}

Ozkan Sezer

2014-Jun-03 19:50 UTC

head link

[flac-dev] Two questions about RG in flac

On 6/3/14, Robert Kausch <robert.kausch at freac.org>
wrote:> Am 03.06.2014 16:45, schrieb lvqcl:
>> 2) to ALL:
>> I attached a small program. Compile and run it.
>> * Does it work correctly when compiled with -O3 -msse2 options?
>> * If yes, does it work correctly when compiled with -O3 -funroll-loops
>> -msse2 options?
>>   ( and what is the version of your GCC? )
> I further reduced the testcase (attached).
>
> The bug only occurs if N >= 64; presumably the second loop is only SSE2
> optimized if that's the case.
>
> The problem seems to be that sum is interpreted as a 64 bit value if
> SSE2 was used in the loop (the lower 32 bits of the result give the
> expected value). If sum is evaluated another time before or after (!)
> the printf, the problem goes away. For example, changing the last line
> to "return sum + 1;" lets the problem disappear.
>
> I confirmed the bug with GCC 4.6.3 on Ubuntu. As on Windows, only 32 bit
> code generation is affected.
>
> You should file a bug report with the GCC team.
>
With gcc-3,3,6, 3,4,6, 4.3.0 and gcc-4.9.1 (svn r210839) the output is
normal:
Sum = 64.000000 (should be equal to 64)

With gcc-4.8.3 (release version) it's broken:
Sum = 206158430272.000000 (should be equal to 64)

With clang-3.4.1 (compiled with gcc-4.8.3) the output is normal again.

This is on i686-linux (fedora9, glibc-2.8, kernel-2.6.27.35)

lvqcl

2014-Jun-06 14:57 UTC

head link

[flac-dev] Two questions about RG in flac

Robert Kausch wrote:
> The problem seems to be that sum is interpreted as a 64 bit value if
> SSE2 was used in the loop (the lower 32 bits of the result give the
> expected value). If sum is evaluated another time before or after (!)
> the printf, the problem goes away. For example, changing the last line
> to "return sum + 1;" lets the problem disappear.
>
> I confirmed the bug with GCC 4.6.3 on Ubuntu. As on Windows, only 32 bit
> code generation is affected.
Thank you for testing.
> You should file a bug report with the GCC team.
Done: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=61423

Seemingly Similar Threads

Search for more possibly parallel threads

flac dev - Jun 2014 - Two questions about RG in flac

[flac-dev] Two questions about RG in flac

[flac-dev] Two questions about RG in flac

[flac-dev] Two questions about RG in flac

[flac-dev] Two questions about RG in flac

[flac-dev] Two questions about RG in flac

Seemingly Similar Threads