This patch adds new SSE code: it's simpler but not faster so this new version is disabled by default. Maybe it will be faster on newer CPUs though... -------------- next part -------------- A non-text attachment was scrubbed... Name: predictor_intrin.patch Type: application/octet-stream Size: 5165 bytes Desc: not available Url : http://lists.xiph.org/pipermail/flac-dev/attachments/20140920/4ae37c9c/attachment-0001.obj
lvqcl wrote:> This patch adds new SSE code: it's simpler but not faster so > this new version is disabled by default. Maybe it will be faster > on newer CPUs though...I'm a little sceptical about adding code that by default is not being compiled. Applied anyway. Cheers, Erik -- ---------------------------------------------------------------------- Erik de Castro Lopo http://www.mega-nerd.com/
Erik de Castro Lopo wrote:>> This patch adds new SSE code: it's simpler but not faster so >> this new version is disabled by default. Maybe it will be faster >> on newer CPUs though... > > I'm a little sceptical about adding code that by default is not > being compiled.FLAC__fixed_compute_residual() has the following code: #if 1 /* OPT: may be faster with some compilers on some systems */ residual[i] = data[i] - (data[i-1] << 1) + data[i-2]; #else residual[i] = data[i] - 2*data[i-1] + data[i-2]; #endif So I thought that adding a different algorithm to calculate err1 variable won't hurt. ...I just noticed that I forgot to write something explanatory after "[PATCH 2/4]" in the subject of the original message.