thr3ads.net - Vorbis dev - [vorbis-dev] my vorbis comments [Sep 2000]

If this information is useful, please help other people find it:
Share via:

sbj@mit.edu

2000-Sep-13 07:51 UTC

[vorbis-dev] my vorbis comments

Ok I just finished writing a Vorbis encoder/decoder, and here are some
of my comments (I'm sure some of these are well known, but here they
are anyway - sorry this is long):

1. vorbis_info:
  a. Accessing the pre-computed info structs through pointers to
     static data is messy IMO.  I should be able to vorbis_info_init
     (or similar) an info struct to be exactly the same as info_A, for
     instance.  That way, I could vorbis_clear and free it at the end
     just like I do for vorbis_info's which come from a vorbis file
  b. A function which compares two vorbis_info structs, to see if they
     are equivalent (i.e. would produce the same output for a given
     input stream) This would be useful say if I am copying an input
     vorbis file (whose info is not info_[A-E]) to an output vorbis
     file with a specific info, I could determine then whether it is
     necessary to actually re-encode, or whether I could just copy
     packets.
  c. A way to cleanly write a file using the same info struct I pulled
     from an input vorbis file.  vorbiscomment.c, of course, has this
     same problem, and so I had to borrow the hack from there.
  d. People say the info_[A-E] has approximate bitrate X, but those
     structs don't have the bitrate_* fields set, what's up with that?
  e. The other stuff I'm _sure_ you folks are already working on, for
     example:
    i. A more general way to choose an info struct
2. vorbisfile:
  Write capability.
3. ogg_page:
  A way to compute page length.  In order to compute the page length,
  I've been adding header_len and body_len - which is currently
  correct - but it's not part of the spec as far as I can tell.
  ogg_sync_pageseek would tell me the page length, but if I should use
  that, then who needs ogg_sync_pageout?
4. analysis/synthesis:
  a. I wouldn't mind if vorbis didn't use separated channel buffers,
     but since I don't understand the compression details, they might
     be really important.
  b. I also wouldn't mind if there were vorbis_read_header
     /_write_header functions which handled the first 3 packets for
     me.
5. chained streams:
  a. The fact that vorbis allows different sampling-rate sounds to be
     chained together makes my head want to explode.

Finally, this last note could be a bug in my code, I have not finished
looking into it.  But I thought I'd mention it.  Feel free to ignore
from here on.

  When I read the first three packets (the headers) of a vorbis file,
  I expect that when I am finished reading the last packet, the offset
  into the stream (as computed by adding the page sizes together)
  should be the offset of the first sound page.  I don't know if
  that's spec, but it's what vorbisfile does, so I took it as gospel.
  However, when _my_ program does it, I end up needing 1 more page
  than I should for that last packet.  That is, I have to _pagein the
  first sound page before I can _packetout the last header packet.

  The only differences I have seen so far between vorbisfile and my
  program which are relevant to this problem is that vorbisfile uses
  ogg_sync_pageseek to get those pages, while I am using
  ogg_sync_pageout.

                               jamie

--- >8 ----
List archives:  http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to
'vorbis-dev-request@xiph.org'
containing only the word 'unsubscribe' in the body.  No subject is
needed.
Unsubscribe messages sent to the list will be ignored/filtered.

Michael Smith

2000-Sep-13 18:52 UTC

head link

[vorbis-dev] my vorbis comments

At 11:51 PM 9/13/00 +0900, you wrote:>
>Ok I just finished writing a Vorbis encoder/decoder, and here are some
>of my comments (I'm sure some of these are well known, but here they
>are anyway - sorry this is long):
>
>1. vorbis_info:
>  a. Accessing the pre-computed info structs through pointers to
>     static data is messy IMO.  I should be able to vorbis_info_init
>     (or similar) an info struct to be exactly the same as info_A, for
>     instance.  That way, I could vorbis_clear and free it at the end
>     just like I do for vorbis_info's which come from a vorbis file
There's going to be a bunch of API additions related to mode selection -
the current method is basically just a temporary hack. This applies to a
number of your other encode-side comments below.
>  b. A function which compares two vorbis_info structs, to see if they
>     are equivalent (i.e. would produce the same output for a given
>     input stream) This would be useful say if I am copying an input
>     vorbis file (whose info is not info_[A-E]) to an output vorbis
>     file with a specific info, I could determine then whether it is
>     necessary to actually re-encode, or whether I could just copy
>     packets.
Is this really useful? If you're doing anything which potentially involves
re-encoding, the chance of having an identical output info struct is almost
zero. Also, if you're encoding, the output info struct contains more
information than the one you've just read from a file - so even if the
common elements are the same, it's possible that the result will be quite
different.
>  c. A way to cleanly write a file using the same info struct I pulled
>     from an input vorbis file.  vorbiscomment.c, of course, has this
>     same problem, and so I had to borrow the hack from there.
Yes. This basically needs the header reading/writing functions to be split
up (or at least that's the straightforward way). Vorbiscomment would
ideally not be using ANY vorbis_* functions (just the ogg functions) except
the comment functions.
>  d. People say the info_[A-E] has approximate bitrate X, but those
>     structs don't have the bitrate_* fields set, what's up with
that?
The bitrate_* fields are for CBR (and constrained VBR). So they aren't used
yet (you'll see that the library doesn't use them anywhere, except to
write
and read them). The approximate bitrates are a function of the
psychoacoustic sets and the codebooks used, at the moment. There's no
simple way to figure out what they are just from looking at them, which is
why the headers say what they are.
>  e. The other stuff I'm _sure_ you folks are already working on, for
>     example:
>    i. A more general way to choose an info struct
Yes, as mentioned above.
>2. vorbisfile:
>  Write capability.
I think there's a encode-side equivalent to vorbisfile intended (but not
incorporated into vorbisfile itself).
>3. ogg_page:
>  A way to compute page length.  In order to compute the page length,
>  I've been adding header_len and body_len - which is currently
>  correct - but it's not part of the spec as far as I can tell.
>  ogg_sync_pageseek would tell me the page length, but if I should use
>  that, then who needs ogg_sync_pageout?
You don't, really. ogg_sync_pageout() is easier to use if you lose sync at
some point (it internally calls ogg_sync_pageseek() anyway.)
Perhaps ogg_sync_pageout() should return total page size instead of 1 in
case of success?
>4. analysis/synthesis:
>  a. I wouldn't mind if vorbis didn't use separated channel buffers,
>     but since I don't understand the compression details, they might
>     be really important.
Not entirely sure what you mean, here. Can you clarify? If you're talking
about how vorbis uses seperate buffers for each channel, rather than a
single buffer with the data interleaved, then there's a good reason for it
- it makes the internals a lot nicer. Would be very hard to change.
>  b. I also wouldn't mind if there were vorbis_read_header
>     /_write_header functions which handled the first 3 packets for
>     me.
I agree, the header bits of the api need changes.
>5. chained streams:
>  a. The fact that vorbis allows different sampling-rate sounds to be
>     chained together makes my head want to explode.
You should just treat chained streams as a set of completely independent
streams (which they are). So the different sample-rates shouldn't be a
problem.  I believe mp3 allows sample rate changes mid-stream (though I may
be wrong here, and I don't know of anything that correctly supports this) -
so merely having chained streams that can be completely different isn't
nearly as bad...
>
>Finally, this last note could be a bug in my code, I have not finished
>looking into it.  But I thought I'd mention it.  Feel free to ignore
>from here on.
>
>  When I read the first three packets (the headers) of a vorbis file,
>  I expect that when I am finished reading the last packet, the offset
>  into the stream (as computed by adding the page sizes together)
>  should be the offset of the first sound page.  I don't know if
>  that's spec, but it's what vorbisfile does, so I took it as
gospel.
>  However, when _my_ program does it, I end up needing 1 more page
>  than I should for that last packet.  That is, I have to _pagein the
>  first sound page before I can _packetout the last header packet.
I'm not sure I understand what you've been doing here, but it sounds
like
you're expecting the current page to end after the last header (such that
the audio starts on a new page). This indeed should be the case (after some
discussion a week or so back). 

However, if you based your encoder off any of the sample code more than a
week ago, then it wouldn't be doing this (flushing pages after the
headers). You should make this change, but you might also want to deal
correctly with files which don't do this.

Michael

--- >8 ----
List archives:  http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to
'vorbis-dev-request@xiph.org'
containing only the word 'unsubscribe' in the body.  No subject is
needed.
Unsubscribe messages sent to the list will be ignored/filtered.

Apparently Analagous Threads

Search for more possibly parallel threads

Vorbis dev - Sep 2000 - my vorbis comments

[vorbis-dev] my vorbis comments

[vorbis-dev] my vorbis comments

Apparently Analagous Threads