similar to: VAD levels and offsets.

Displaying 20 results from an estimated 20000 matches similar to: "VAD levels and offsets."

2005 Jun 27
1
VAD encoder sensitivity?
Hi, When using the encoder function, speex_encode_int(st, in_short, &bits), with VBR disabled and VAD enabled, the function always return a 1. Also, when enabled DTX along with VAD enabled, most of the time the function return a 1 even when it is silence. Is there a way to change the sensitivity of the VAD by changing some parameters in the source code, so when it is silence, the
2008 Apr 16
2
VAD CPU usage
Hi Jean-Marc I am using speex in my voip app (wideband mode). I have noticed that when VAD kicks in, CPU considerably increases (bitrate decreases to 4 kbps). It takes about 7 ms to encode one frame versus 0.5 ms (for speech). This happens regardless preprocessor is used. Although the issue seems to be driver dependent, it occurs on XP, Vista and Mac OS X. Besides complexity and quality, this
2007 Jun 08
2
VAD Questions
Hello Jean-Marc et al: On 07/06/07, Jean-Marc Valin <jean-marc.valin@usherbrooke.ca> wrote: > > - Is there a reference somewhere (other than the source itself) that > > explains how the latest VAD algorithm works? > > Read the source, Luke :-) (sorry) Okay. I had to ask :-) > > > - Is it possible to obtain the VAD status of a Speex stream > >
2007 Jun 07
2
VAD Questions
Hello all: I am interested in using Speex for an application that streams audio from a (noisy) source, so I am interested in VAD and DTX operation. However, after browsing the archives of this list, I note that a number of people have not been satisfied with the operation of the VAD algorithm in Speex. This leads me to a few questions: - Is there a reference somewhere (other than the source
2004 Aug 06
1
rgding VAD
On Tue, 2003-04-15 at 11:31, Jean-Marc Valin wrote: > > How do i detect whether there is silence in media using speex? > > Is there any API which decides that the audio data only contains > > silence? > > Basically i will have PCM linear data, I want to know whether it is > > complete silence. > > Well, the best way is probably to turn VAD *and*
2011 Aug 29
2
Speex VAD always returning 1
I have been trying to understand how to get the VAD algorithm working. I sent an input stream of all zeros into the preprocessor but still got a return value of 1 indicating that speech was detected. Is this feature not available with the latest release? I thought at the very least it would detect this as silence and return 0 but that does not seem to be the case. Does anyone have any
2007 Jan 08
1
VAD in preprocessor
Hi all and Jean-Marc, Firstly thanks a lot to you for helps, at last I accomplished to use speex-1.2beta1 in my speech application. I try to understand speex source code and improve speech quality. The problem in my application (like many people in the list says) is noise. So I've added VAD in preprocessor as following. VAD in preprocessor is very good, there is not noise outside speech. I
2011 Aug 29
3
Speex VAD always returning 1
From 1.2rc1 source code: preprocess.c: /* FIXME: This VAD is a kludge */ st->speech_prob = Pframe; if (st->vad_enabled) { if (st->speech_prob > st->speech_prob_start || (st->was_speech && st->speech_prob > st->speech_prob_continue)) { st->was_speech=1; return 1; } else { st->was_speech=0; return 0; } } else { return 1; } AND case
2009 Mar 17
1
VAD speex 1.2rc1
My data is speech sampled as pcm mono 32khz. I want to use vad for regulating the agc gain mechanism. So... if voice is not detected I just reduce the agc value in order to not amplify the 'silence' otherwise we just increase the agc value if it is below the minimum. Thanks, Daniel ----- Original Message ----- De: Jean-Marc Valin <jean-marc.valin at usherbrooke.ca> Para: danflu at
2006 Dec 25
1
A case including VAD bug
Hi all: I was analysising the speex code these days and learned much from it. Thank you so much! But I have a question about VAD: After preprocess(denoise and agc are off, vad is on),the input pcm speech was distorted seriously.The input speech include ten syllables(one~ ten), but the output speech lost one! Can you tell me why? Please see the attachment about input and output: (See attached
2017 Nov 27
3
Reg an issue with smoothing factor in VAD implementation
Hi, Can anyone let me know if this is a bug? Thank you, Chandrakala ----- Original Message ----- From: "Logan Stromberg" <loganstromberg at gmail.com> To: "Chandrakala Madhira" <chandrakala.madhira at soctronics.com> Cc: opus at xiph.org Sent: Wednesday, November 22, 2017 12:12:39 PM Subject: Re: [opus] Reg an issue with smoothing factor in VAD
2018 Feb 16
1
Reg an issue with smoothing factor in VAD implementation
Hi Chandrakala, Logan, Can you confirm that the attached patch fixes the overflow problem? Koen, can you confirm the fix makes sense? Cheers, Jean-Marc On 11/27/2017 12:10 PM, Logan Stromberg wrote: > Sorry, long holiday weekend in America. > I can say with pretty high certainty that there is an overflow occurring > and it is flipping smooth_coef_Q16 to be negative when it probably
2005 Jun 22
1
Speech detection in preprocessor with echo
agc_gain seemed to fit with the idea of what I wanted to do, it was easy to understand its units and behavior, and freezing it produced the desired results. Also I wanted to cap it, so that's done at the same place, and that definitely works. All I want to do is be able to freeze AGC adaptation and put an upper bound on the AGC (for example, 2x amplification). Both of these things seem
2013 Jan 16
1
Newbie question on VAD
Hi, I'm interested in speech recognition and it occurred to me to try to use VAD to automatically detect the start of speech prior to feeding the stream to the recognition engine. Speex has a preprocessor capability which presumably could be used for this, but I did not see similar in my quick perusal of Opus docs. First, is this is a dumb idea for some reason? If not, should I just use
2017 Nov 20
4
Reg an issue with smoothing factor in VAD implementation
Just for fun, I tried to reproduce such an overflow. I turned on all debug macros, assertions, and checked arithmetic and then encoded 2 hours of mixed speech/audio with these parameters: Sample rate = 48000 Channels = 1 Application = OPUS_APPLICATION_AUDIO Bitrate = 24 KB/s Force Mode = MODE_SILK_ONLY Signal Type = OPUS_SIGNAL_AUTO Complexity = 10 Frame size = 480 samples (10ms) No errors came
2017 Jun 16
2
[EXTERNAL] Re: Submitting a patch that exposes VAD voiced/unvoiced signal type
Hi Peter, Can you say a little bit more about what you're doing exactly with the information you're exposing and how? unfortunately, I don't have a concrete proposal in mind right now. That's in part because I don't quite understand the use case, but also because it's really hard to expose this kind of information in a way that both avoids breaking application with new
2004 Aug 06
2
rgding VAD
hi, How do i detect whether there is silence in media using speex? Is there any API which decides that the audio data only contains silence? Basically i will have PCM linear data, I want to know whether it is complete silence. Is it that during encoding speex_bits_write will return a lesser number if the audio at byte_ptr contains silence? thanx, Rahul -- Rahul Arvind Jadhav
2004 Aug 06
2
Memory leak in denoiser + a few questions
On Mar 28, 2004, at 8:23 PM, Jean-Marc Valin wrote: >> The st->zeta pointer isn't freed in the >> speex_preprocess_state_destroy() >> function of the preprocess.c file (alloced in line 167). It's in >> Speex 1.1.4 >> by the way. > > Oops... Thanks for letting me know. I'll change that for the next > release (in the mean time, the fix is
2004 Aug 06
1
reduction of noise due to high microphone gain
Hello, With high microphone gain, I seem to have problem making the silence detection work. The speech detection works well for the rare dish sample, which has very low noise amplitudes in silence regions. However, if the microphone gain is set to really high, noise samples are taken as speech, as indicated by the non-zero return value from speex_encode() calls. I had VAD turned on.
2017 Jun 07
2
Submitting a patch that exposes VAD voiced/unvoiced signal type
Hi Peter, There's two main issues with a patch like the one you're proposing. First, the data is only valid when SILK is being used and is essentially undefined in CELT mode. The second issue is that by exposing internals, it makes it impossible to improve these algorithms since it would break API compatibility. I'm not fundamentally against trying to expose some information, but