thr3ads.net - flac dev - [flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO

If this information is useful, please help other people find it:
Share via:

Johnny Rosenberg

2013-Feb-09 11:13 UTC

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

2013/2/8 Erik de Castro Lopo <mle+la at
mega-nerd.com>:> Collin wrote:
>
>> It turns out it was an error on my part; sorry for the trouble. I
misunderstood
>> the format libflac was expecting my samples in. I found the wording in
the
>> documentation very confusing.
>>
>> My audio file has 2 channels and 16-bit samples, so I have to pull out
each
>> individual sample, push it into a FLAC__int32, and fix endian-ness
before I can
>> pass it to libflac.
>>
>> The documentation made me think that libflac just wanted me to shove my
2-byte
>> sample into a FLAC__int32. I wasn't aware that it wanted me to
extend the sign,
>> as well.
>
> At some stage it might make sense for us to make the FLAC API a little
> more user/programmer friendly. Something to match libsndfile's ease of
> use.
That would be great!
I was actually starting my small ?project? with libsndfile, but it was
done in a way that I didn't agree with, and since I only work with
FLAC files (mostly 24 bits) I decided to try to go with libflac
instead, which seems to let me do things the way I want more than
libsndfile.

For example, when loading an audiofile with libsndfile, all the
samples are converted to floating point numbers, which is what I want
anyway, but the conversion is done by dividing the integers from the
file with pow(2,bps-1), but when converting back, they multiply with
pow(2,bps-1)-1, so if you just read and then write, you end up with a
slightly lower volume (not that you can hear any difference if you do
it only a few times, but still, it isn't right, AND it is very easy to
do it right, so why don't they?)?


Johnny Rosenberg, the beginner
>
> Erik
> --
> ----------------------------------------------------------------------
> Erik de Castro Lopo
> http://www.mega-nerd.com/
> _______________________________________________
> flac-dev mailing list
> flac-dev at xiph.org
> http://lists.xiph.org/mailman/listinfo/flac-dev

Brian Willoughby

2013-Feb-09 12:28 UTC

head link

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

2013/2/8 Erik de Castro Lopo <mle+la at
mega-nerd.com>:> Collin wrote:
>
>> It turns out it was an error on my part; sorry for the trouble. I  
>> misunderstood
>> the format libflac was expecting my samples in. I found the  
>> wording in the
>> documentation very confusing.
>>
>> My audio file has 2 channels and 16-bit samples, so I have to pull  
>> out each
>> individual sample, push it into a FLAC__int32, and fix endian-ness  
>> before I can
>> pass it to libflac.
>>
>> The documentation made me think that libflac just wanted me to  
>> shove my 2-byte
>> sample into a FLAC__int32. I wasn't aware that it wanted me to  
>> extend the sign,
>> as well.
>
> At some stage it might make sense for us to make the FLAC API a little
> more user/programmer friendly.
I seem to recall that the FLAC library requires fixed point numbers  
that are right-justified in a 32-bit fixed-point 'long int', whereas  
the standard everywhere else seems to be left-justified. Left- 
justified makes more sense because it allows 16-bit, 20-bit, 24-bit,  
32-bit, and other depths to be more easily converted.

It's been a while since I needed to know this level of detail,  
because I wrote my own ObjC library on top of the C FLAC library  
since I didn't want to use the C++ FLAC library, and once I solved  
the justification of the bits I never had to think about it again.

However, for new programs being written to use the FLAC libraries, I  
totally agree that there should be a friendlier interface (assuming I  
haven't remembered all of this wrong).

Note that the old interface should be maintained for compatibility  
with existing software. The new, friendlier interface can simply use  
a slightly different (member) function name to access the same  
objects. There's no reason both can't be supported, or even float as  
well.

> Something to match libsndfile's ease of
> use.
I do not use libsndfile, so I wouldn't make this comparison.

On Feb 9, 2013, at 03:13, Johnny Rosenberg wrote:> That would be great!
> I was actually starting my small ?project? with libsndfile, but it was
> done in a way that I didn't agree with, and since I only work with
> FLAC files (mostly 24 bits) I decided to try to go with libflac
> instead, which seems to let me do things the way I want more than
> libsndfile.
>
> For example, when loading an audiofile with libsndfile, all the
> samples are converted to floating point numbers, which is what I want
> anyway, but the conversion is done by dividing the integers from the
> file with pow(2,bps-1), but when converting back, they multiply with
> pow(2,bps-1)-1, so if you just read and then write, you end up with a
> slightly lower volume (not that you can hear any difference if you do
> it only a few times, but still, it isn't right, AND it is very easy to
> do it right, so why don't they?)?
>
>
> Johnny Rosenberg, the beginner

For a beginner, you sure have the math right. Lossless conversion  
should always be handled via a factor with only a single significant  
digit, otherwise you're actually performing a sum. Multiplying by a  
factor with a single (binary) significant digit is equivalent to  
multiplying by 1. pow(2, N) is a valid conversion factor. pow(2, N)-1  
is not. Basically, converting a 16-bit sample using pow(2, 15)-1 is  
the same as summing 15 lossless conversions that happen to be one off  
from the proper lossless conversion.

The 'why' is that libsndfile hopes to solve a problem that doesn't  
always exist. Namely, that scientific synthesis of data in floating  
point often ranges from +1 to -1, but the fixed point output format  
of most sound files does not allow +1 (even though it does allow -1).  
This is due to the nature of twos-complement fixed-point numbers. I  
have no idea why libsndfile handles integer input to integer output  
in this lossy way, but that was a choice made by libsndfile that  
seems beyond the possibility of correcting. It would have seemed much  
simpler to require floating point data synthesis to apply the pow(2,  
N)-1 scaling only as needed, rather than apply it to all audio files  
(most of which come from fixed-point A/D convertors that don't need  
any scaling). In other words, libsndfile is really only correctly  
applicable to a very esoteric set of data which does not include most  
audio.

Regardless of libsndfile's architecture and design, libFLAC should by  
no means duplicate the incorrect conversion. There especially should  
not be an asymmetrical set of conversion factors.

All of Apple's CoreAudio tools use float without lossy conversion.  
The only requirement there is that synthesized data should avoid +1.0  
sample values unless clipping is acceptable or scaling is added  
before conversion.

Brian Willoughby
Sound Consulting

Declan Kelly

2013-Feb-09 15:27 UTC

head link

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

On Sat, Feb 09, 2013 at 04:28:57AM -0800, brianw at sounds.wa.com wrote:
> Regardless of libsndfile's architecture and design, libFLAC should by  
> no means duplicate the incorrect conversion. There especially should  
> not be an asymmetrical set of conversion factors.
+1 to that!

-- 
-Dec.
---
   (no microsoft products were used to create this message)

Erik de Castro Lopo

2013-Feb-09 20:50 UTC

head link

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

Johnny Rosenberg wrote:
> For example, when loading an audiofile with libsndfile, all the
> samples are converted to floating point numbers,
With libsndfile, converting to float is optional. You can just as
easily read int or short.
> which is what I want
> anyway, but the conversion is done by dividing the integers from the
> file with pow(2,bps-1), but when converting back, they multiply with
> pow(2,bps-1)-1, so if you just read and then write, you end up with a
> slightly lower volume (not that you can hear any difference if you do
> it only a few times, but still, it isn't right, AND it is very easy to
> do it right, so why don't they?)?
Firstly, you do realise that I am the main author and maintainer of
libsndfile, don't you?

Secondly, the scaling can be switched off don't you? See:

    http://www.mega-nerd.com/libsndfile/command.html#SFC_SET_NORM_DOUBLE

Finally, there are about 700 different way so convert between int
and float. I chose the one that I thought had provided the best
trade off. I stand by that decision.

Erik
-- 
----------------------------------------------------------------------
Erik de Castro Lopo
http://www.mega-nerd.com/

Johnny Rosenberg

2013-Feb-09 21:44 UTC

head link

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

2013/2/9 Erik de Castro Lopo <mle+la at
mega-nerd.com>:> Johnny Rosenberg wrote:
>
>> For example, when loading an audiofile with libsndfile, all the
>> samples are converted to floating point numbers,
>
> With libsndfile, converting to float is optional. You can just as
> easily read int or short.
>
>> which is what I want
>> anyway, but the conversion is done by dividing the integers from the
>> file with pow(2,bps-1), but when converting back, they multiply with
>> pow(2,bps-1)-1, so if you just read and then write, you end up with a
>> slightly lower volume (not that you can hear any difference if you do
>> it only a few times, but still, it isn't right, AND it is very easy
to
>> do it right, so why don't they?)?
>
> Firstly, you do realise that I am the main author and maintainer of
> libsndfile, don't you?
Yes I do, now that you mentioned it. I'm not sure why that matters in
a FLAC-dev mailing list, though.
I didn't know there was a maintainer anyway these days, since I didn't
get a reply when emailing about this a couple of years ago (or maybe
it was last year, I don't remember ? I sent it from one of my other
email addresses, since I use this one for mailing lists only).
>
> Secondly, the scaling can be switched off don't you? See:
>
>     http://www.mega-nerd.com/libsndfile/command.html#SFC_SET_NORM_DOUBLE
Thanks, I don't think I saw that page before, for some strange reason.
The sentence ?setting normalisation to SF_FALSE means that no scaling
will take place?, does this mean, for instance for a 24-bit file, that
the values will be doubles in the interval -8388608 to +8388607?
That information should probably be added to question 10 in your FAQ, I suppose.

For my purpose, not ?normalising? at all seems to be a good enough
idea, I think. I will calculate the levels from dB anyway, so I just
need to know how many bits per sample there is, which libsndfile
easily lets me know.
But I will try to do my thing with libflac first. If I fail making my
code clean, simple and readable (and working?), I might give
libsndfile another try.
>
> Finally, there are about 700 different way so convert between int
> and float. I chose the one that I thought had provided the best
> trade off. I stand by that decision.
Well, the good thing is that this SFC_SET_NORM_DOUBLE thing allows me
to do it the 701th way? Maybe I should convert to % or ??


Anyway, if I'm allowed to ask a libsndfile question here, is there a
special reason why you read sound files into one-dimensional arrays
instead of two-dimensional ones (such as AudioData[Channel][Sample])?
If so, what is that reason?


Thanks.

Kind regards

Johnny Rosenberg
????????????

>
> Erik
> --
> ----------------------------------------------------------------------
> Erik de Castro Lopo
> http://www.mega-nerd.com/
> _______________________________________________
> flac-dev mailing list
> flac-dev at xiph.org
> http://lists.xiph.org/mailman/listinfo/flac-dev

Brian Willoughby

2013-Feb-10 00:34 UTC

head link

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

Whatever enhancements occur for libFLAC, we certainly should not  
create a situation where some flag gives different results for one  
set of users versus another. Normalization of data should be separate  
from conversion of data. libFLAC should only handle conversion of  
data. If people want a method for normalizing their audio data, then  
they should use a DAW or at least a library like libsndfile.

To avoid bloat, libFLAC should merely handle conversion between FLAC  
compressed and uncompressed samples. Any potential problems where  
synthetic float waveform samples might clip during conversion to FLAC  
should be handled by a separate tool. In other words, the file should  
be processed before it is compressed.

There are industry standards for conversion whereby only exact powers  
of two are used. Anything else doesn't belong in libFLAC. Namely, pow 
(2,bps-1)-1 should not be used or allowed anywhere in libFLAC.

Brian Willoughby
Sound Consulting

On Feb 9, 2013, at 12:50, Erik de Castro Lopo wrote:> Secondly, the scaling can be switched off don't you? See:
>
>     http://www.mega-nerd.com/libsndfile/ 
> command.html#SFC_SET_NORM_DOUBLE

Reasonably Related Threads

Search for more maybe matching threads

flac dev - Feb 2013 - Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

[flac-dev] Commonly getting FLAC__STREAM_ENCODER_VERIFY_MISMATCH_IN_AUDIO_DATA on valid audio

Reasonably Related Threads