Displaying 6 results from an estimated 6 matches for "sonorant".
2007 Jun 07
2
VAD Questions
...the source itself) that
explains how the latest VAD algorithm works?
- Is it possible to obtain the VAD status of a Speex stream
asynchronously? The current API seems to imply that some kind of
polling is required to determine the voice/non-voice status.
- Does the VAD algorithm implement syllabic/sonorant rate detection,
as has been implemented many times in analog circuitry, and is
described in this (and other) papers?
http://people.csail.mit.edu/jrg/2005/IS05_schutte.pdf
- Over what time period is VAD done? Is it done on a frame by frame
basis or over some longer period?
Thank you,
--
Larry Gada...
2007 Jun 08
2
VAD Questions
...cates whether the encoded data contains
speech or not? The API has a "get VAD status", but it seems like that
might only indicate whether VAD is currently enabled. Perhaps the VAD
status is contained somewhere in the data frames?
>
> > - Does the VAD algorithm implement syllabic/sonorant rate detection,
> > as has been implemented many times in analog circuitry, and is
> > described in this (and other) papers?
> > http://people.csail.mit.edu/jrg/2005/IS05_schutte.pdf
>
> As far as I understand, the paper you reference above isn't applicable
> to the p...
2007 Jun 07
0
VAD Questions
...sly? The current API seems to imply that some kind of
> polling is required to determine the voice/non-voice status.
Don't understand your question. Also which VAD are you talking about?
The one in the encoder or the one in the preprocessor?
> - Does the VAD algorithm implement syllabic/sonorant rate detection,
> as has been implemented many times in analog circuitry, and is
> described in this (and other) papers?
> http://people.csail.mit.edu/jrg/2005/IS05_schutte.pdf
As far as I understand, the paper you reference above isn't applicable
to the problem here. Basically, we ha...
2007 Jun 08
0
VAD Questions
...return value of either speex_encode() or speex_preprocess_run().
> Okay. What I was trying to determine was whether or not the speech
> detection was done with something more sophisticated than frame
> energy. As you said above, I'll have to look at the sources. For many
> systems, sonorant energy rate detection is used to detect voice, even
> under very poor SNR conditions.
I *do* use more than the frame energy. I use the pitch and (IIRC) one of
two other things. However, it's still *very* hard to do any sort of good
detection based only on 20 ms. Give me 1 second of latency...
2007 Jun 08
2
VAD Questions
...or speex_preprocess_run().
OK. Thanks.
>
> > Okay. What I was trying to determine was whether or not the speech
> > detection was done with something more sophisticated than frame
> > energy. As you said above, I'll have to look at the sources. For many
> > systems, sonorant energy rate detection is used to detect voice, even
> > under very poor SNR conditions.
>
> I *do* use more than the frame energy. I use the pitch and (IIRC) one of
> two other things. However, it's still *very* hard to do any sort of good
> detection based only on 20 ms. Give...
2004 Oct 10
1
Fw: Souscription - Newsletter Ogg Vorbis en français !
----- Original Message -----
From: Claude de Limelette de Belgique
To: tarkin-dev-request@lists.xiph.org
Cc: vorbis-dev-request@lists.xiph.org
Sent: Thursday, October 07, 2004 9:15 AM
Subject: Fw: Souscription - Newsletter Ogg Vorbis en fran?ais !
----- Original Message -----
From: Claude de Limelette de Belgique
To: tremor-request@xiph.org
Sent: Thursday, October 07, 2004 9:13 AM