similar to: Opus VAD in 1.3 (and Music/Speech detection)

Displaying 20 results from an estimated 10000 matches similar to: "Opus VAD in 1.3 (and Music/Speech detection)"

2005 Jun 22
1
Speech detection in preprocessor with echo
agc_gain seemed to fit with the idea of what I wanted to do, it was easy to understand its units and behavior, and freezing it produced the desired results. Also I wanted to cap it, so that's done at the same place, and that definitely works. All I want to do is be able to freeze AGC adaptation and put an upper bound on the AGC (for example, 2x amplification). Both of these things seem
2006 Jul 03
1
speex vad could process music?
In 32k samplerate auio, I want the transmission of music. But the speex vad is designed for speech? What can I do to change it for music/noise activity detection? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.xiph.org/pipermail/speex-dev/attachments/20060704/3c0e0c58/attachment.html
2017 Sep 05
0
Suggest exposing VAD info. from opus frame
Hi. I would like to contribute to the project regarding exposing VAD information from opus frame. I think VAD flag is very useful in jitter buffer management. And it can be used at end-point detection for voice recognition, too. As you know, there isn't VAD flag in CELT mode. So, in my proposal all frames are assumed active signal in CELT mode. It works on only single packet. In multiple
2012 Mar 09
0
Generating comfort noise with preprocessor VAD
Hello, I am trying to use the preprocessor VAD to encode at lower bitrate during silence periods. I am able to run the preprocessor and get the VAD flag for each frame, and I am quite happy with it's performance. I would like to know how to pass the preprocessor VAD flag to speex encoder -- basically, i want to force the encoder to generate comfort noise when preprocessor detects silence.
2017 Jun 20
0
[EXTERNAL] Re: Submitting a patch that exposes VAD voiced/unvoiced signal type
Hi Jean-Marc, We're exposing the opus_internal_flags data structure so that we can access the value assigned to prevSignalType. Here's a snippet of our code: error = opus_encoder_get_internal_flags(vad->opus, &internalflags); if (error != OPUS_OK) { return OPUSVAD_OPUS_ERROR; } cur_signal_type = internalflags.prevSignalType; if
2007 Jun 08
2
VAD Questions
Hello Jean-Marc et al: On 07/06/07, Jean-Marc Valin <jean-marc.valin@usherbrooke.ca> wrote: > > - Is there a reference somewhere (other than the source itself) that > > explains how the latest VAD algorithm works? > > Read the source, Luke :-) (sorry) Okay. I had to ask :-) > > > - Is it possible to obtain the VAD status of a Speex stream > >
2018 Sep 27
0
Opus 1.2.1 crash on silk/VAD.c:315
Hi Dmitry, So it's not explicitly in your report, but it looks like the crash is due to a divide-by-zero at: min_coef = silk_DIV32_16( silk_int16_MAX, silk_RSHIFT( psSilk_VAD->counter, 4 ) + 1 ); which happens because counter is -16 (which means (-16 >> 4) + 1 == 0). Now, this could be caused by an integer wrap-around, but it should only happen after encoding around
2004 Aug 06
0
Kindly help me with VAD activation
Hi, I had the same experience. The VAD done in the encoder does not seem to work very well at all. However, the VAD done by speex_preprocess() works much better. You will also get better results if you enable the denoiser, I think. peex_preprocess_ctl(preprocess, SPEEX_PREPROCESS_SET_DENOISE, &denoise_enabled); speex_preprocess_ctl(preprocess, SPEEX_PREPROCESS_SET_VAD,
2005 Jun 20
1
Speech detection in preprocessor with echo
I think you'll have to modify Speex to get the functionality you're looking for. I've made a few simple modifications to the AGC to prevent it from 1) exceeding a specified level of amplification and 2) enable and disable adaptation, so I can freeze it at a certain level while speech is not detected. It's mostly just a matter of doing this at the end of speex_compute_agc():
2017 Nov 27
0
Reg an issue with smoothing factor in VAD implementation
Sorry, long holiday weekend in America. I can say with pretty high certainty that there is an overflow occurring and it is flipping smooth_coef_Q16 to be negative when it probably shouldn't be. I had originally thought it was only an issue where it was overflowing the 15th bit but not the 16th, which might still preserve the intended value for operations that ignore the sign bit (in cases
2017 Nov 22
0
Reg an issue with smoothing factor in VAD implementation
Yes, yes, I can reproduce it now, but only on platforms that define a 16-bit int by default (SA_Q15 is an opus_int rather than opus_int32). What system are you compiling this for? On Tue, Nov 21, 2017 at 8:34 PM, Chandrakala Madhira < chandrakala.madhira at soctronics.com> wrote: > Hi Logan, > > Please find attached the input stream we are using testing. > > Thank you, >
2007 Jun 08
2
VAD Questions
Hello Jean-Marc: On 08/06/07, Jean-Marc Valin <jean-marc.valin@usherbrooke.ca> wrote: > > Either one. The question is: If we treat the software like a black > > box, and we feed in PCM audio, we get Speex encoded data out. Where is > > the information that indicates whether the encoded data contains > > speech or not? The API has a "get VAD status", but it
2004 Aug 06
0
Kindly help me with VAD activation
Hello. I have generated a dll out of speex-1.1.4 using VC++6. I did not want all the ogg/wave header in the encoded stream. So I have replaced to codes of speexenc/dec with that of sampleenc/dec from the manual. Thus now the dll accepts the raw input, encodes it and returns the encoded data. I also pass on nbBytes for each frame encoded to the decoder and no other information. I intialise the
2007 Jan 08
1
VAD in preprocessor
Hi all and Jean-Marc, Firstly thanks a lot to you for helps, at last I accomplished to use speex-1.2beta1 in my speech application. I try to understand speex source code and improve speech quality. The problem in my application (like many people in the list says) is noise. So I've added VAD in preprocessor as following. VAD in preprocessor is very good, there is not noise outside speech. I
2006 Dec 29
0
VAD in preprocessor
Hi all and Jean-Marc, Firstly thanks a lot to you for helps, at last I accomplished to use speex-1.2beta1 in my speech application. I try to understand speex source code and improve speech quality. The problem in my application (like many people in the list says) is noise. So I've added VAD in preprocessor as following: int param_enabled = 1; int param_disabled = 0; int sampling_rate
2009 Mar 17
1
VAD speex 1.2rc1
My data is speech sampled as pcm mono 32khz. I want to use vad for regulating the agc gain mechanism. So... if voice is not detected I just reduce the agc value in order to not amplify the 'silence' otherwise we just increase the agc value if it is below the minimum. Thanks, Daniel ----- Original Message ----- De: Jean-Marc Valin <jean-marc.valin at usherbrooke.ca> Para: danflu at
2007 Jun 08
0
VAD Questions
> Either one. The question is: If we treat the software like a black > box, and we feed in PCM audio, we get Speex encoded data out. Where is > the information that indicates whether the encoded data contains > speech or not? The API has a "get VAD status", but it seems like that > might only indicate whether VAD is currently enabled. Perhaps the VAD > status is
2006 Dec 25
1
A case including VAD bug
Hi all: I was analysising the speex code these days and learned much from it. Thank you so much! But I have a question about VAD: After preprocess(denoise and agc are off, vad is on),the input pcm speech was distorted seriously.The input speech include ten syllables(one~ ten), but the output speech lost one! Can you tell me why? Please see the attachment about input and output: (See attached
2007 Jun 07
0
VAD Questions
> - Is there a reference somewhere (other than the source itself) that > explains how the latest VAD algorithm works? Read the source, Luke :-) (sorry) > - Is it possible to obtain the VAD status of a Speex stream > asynchronously? The current API seems to imply that some kind of > polling is required to determine the voice/non-voice status. Don't understand your question.
2018 Sep 27
1
[Re:] Re: Opus 1.2.1 crash on silk/VAD.c:315
Hi Jean-Marc, gdb out is "Program terminated with signal 8, Arithmetic exception." most likely this division by zero. you're right, this crash is reproduce on seq number 4294967265 (20ms rtp packet). This is about 994 days. "Jean-Marc Valin" <jmvalin at jmvalin.ca> писал(а):Hi Dmitry, > >So it's not explicitly in your report, but it looks like the crash