Displaying 20 results from an estimated 4000 matches similar to: "silent frame detection"
2004 Aug 06
1
auto-detection of frame boundary
I tried feeding in the 3 encoded frame in ONE BLOCK, and calling speex_decode() 3 times in a roll. Only the 1st frames came out perfectly. For the other 2, I got "corrupt" frame warning.
I was supposed to get 38 bytes consumed each frame (narrow-band, VBR off). I tried speex_bits_remaining() to peek on the # of bits consumed, and got variable (clearly wrong)#s returned.
But if I
2004 Aug 06
1
reduction of noise due to high microphone gain
Hello,
With high microphone gain, I seem to have problem making the silence
detection work.
The speech detection works well for the rare dish sample, which has very
low noise amplitudes in silence regions. However, if the microphone
gain is set to really high, noise samples are taken as speech, as
indicated by the non-zero return value from speex_encode() calls.
I had VAD turned on.
2004 Aug 06
1
DTX and VAD doesn't work on one of my computers.
Hi there,
I enable DTX and VAD in speex_encode to detect whether there is silence in voice data. But the same programme got different result on my computers. The function speex_encode always return 38 on one of my computers although it is silence. On all other computers the function speex_encode return 1 when it is silence. It means DTX and VAD doesn't work on particular OS or PC. Do anyone
2004 Aug 06
2
DTX in speech
Hello,
I am trying to use the DTX feature of speex to tell if the user is
speaking. I have VBR, DTX and VAD turned on, but the function
speex_encode always returns a non-zero value. It is my understanding
that it should return a zero value when the encoder detects silence?
Could anyone give me any clues to help figure out why this isn't working
for me? I am using the 1.13 version of speex on
2004 Aug 06
1
auto-detection of frame boundary
Here's the situation: 3 frames are being transferred back to back over
RTP. The total payload is known (thus the TOTAL frame size). We'd like
to decode all 3.
According to IETF RTP Profile (appendix C, of Jean-Marc's Speex Codec
Manual, verison 1.0), "it's only necessary to consider the sequence of
bits as output by the Speex encoder, and present the same sequence to
2004 Aug 06
2
rgding VAD
hi,
How do i detect whether there is silence in media using speex?
Is there any API which decides that the audio data only contains
silence?
Basically i will have PCM linear data, I want to know whether it is
complete silence.
Is it that during encoding speex_bits_write will return a lesser
number if the audio at byte_ptr contains silence?
thanx,
Rahul
--
Rahul Arvind Jadhav
2004 Aug 06
3
Error in my code
Hi
I apologise if the following message is inappropriate for
this list. However, I feel this is the only place where I
can get some help with my problem.
BOOL CALLBACK RecordProc(HRECORD hRecord, void *buffer,
DWORD length, DWORD user)
{
try
{
long sent = 0;
long inputOffset = 0;
long outputOffset = 0;
long nbBytes = 0;
char *outputBuffer = NULL;
2004 Aug 06
2
reduction of noise due to high microphone gain
This works really well for white noise reduction. However what I've noticed was the amplitudes of normal speech samples also get reduced.
Is this something by design, or is there a way to automatically recover the original speech sample volumes ?
<p>Thanks.
<p>Tongbiao
<p>-----Original Message-----
From: Jean-Marc Valin [mailto:jean-marc.valin@hermes.usherb.ca]
Sent:
2004 Aug 06
1
low sampling rate for Wideband ?
Hello,
We currently have code for sampling at 8 KHz at device layer. In order
to have minimal code change while still doing wideband I am thinking of
packing two 20 ms frames of the low band data as input to Wideband mode
for encoding. Is this a feasible idea?
I actually tried it out. Speech came out at the decoding end, but I
could occasionally hear low pitched, high frequency
2004 Aug 06
2
--dtx alone does nothing?
I'm running a:
for band in n w u
do
for quality in 0 1 2 3 4 5 6 7 8 9 10
do
for complexity in 3
do
for vad in "" "--vad"
do
for dtx in "" "--dtx"
do
echo speexenc -${band} --quality ${quality} --comp ${complexity} ${vad} ${dtx} input.${band}.wav
2004 Aug 06
1
rgding VAD
On Tue, 2003-04-15 at 11:31, Jean-Marc Valin wrote:
> > How do i detect whether there is silence in media using speex?
> > Is there any API which decides that the audio data only contains
> > silence?
> > Basically i will have PCM linear data, I want to know whether it is
> > complete silence.
>
> Well, the best way is probably to turn VAD *and*
2004 Aug 06
1
DTX and VAD doesn't work on one of my computers.
Dear Jean,
Thank you so much for reply my message.
I am sorry the function return 38 is speex_bits_write(). This function doesn't work on one of my computers. But after I add the function speex_preprocess(), it works well. I am wondering what is the possible reason? What the function speex_preprocess() does?
Another problem I have now is when I enable dtx and vad with function
2004 Aug 06
1
Frozen upper spectrum in WB VBR CNG
Jean-Marc Valin (jean-marc.valin@hermes.usherb.ca) wrote:
>
> > I've been using Speex in my voice-over-IP program on Win32, in
> > wideband (16kHz) mode. I just starting using VBR recently and
> > have run into something that might be a problem within Speex:
>
> Are you turning on DTX in addition to VBR? Also, what version are you
> using. As of 1.0, DTX is
2004 Aug 06
1
Frozen upper spectrum in WB VBR CNG
Hi,
I've been using Speex in my voice-over-IP program on Win32, in
wideband (16kHz) mode. I just starting using VBR recently and
have run into something that might be a problem within Speex:
If someone hasn't spoken for a little while, and the bitrate drops
to very low, sometimes the high half of the spectrum becomes frozen
with a looping sound. The bottom half of the spectrum is
2004 Aug 06
1
Query r/e multiple decoder reset...
Hello! I should have asked this earlier, but... :)
We're returning to voice processing in our app now, in which we're trying
to achieve n-way full duplex voice in a 3d environment. Since Speex is a
single voice stream CODEC, we've been looking at having each client upload
its stream to the servers, which multiplex several spatially nearby active
channels back down to the
2004 Aug 06
5
reommended settings for low bitrate voicecom codec ?
Hello,
the voice-communication TeamSpeak (www.teamspeak.org) is currently testing a
version that supports speex codecs. The quality for high bitrates is quite
good.
BUT, the low-bandwidth speex codecs that are currently used arent very good.
What I did to find this out:
I comprared a speex AVB with 6.3 KBit/sec (total, overhead for packets and
stuff included) and the 6.3 Kbit/sec Celp Codec
2006 Jul 18
1
Echo Cancellation for silent echo signal
Hi Jean-Marc,
I am developing a voice chat for win32 using speex,when I do echo cancellation, after 2 seconds adaptation,
the echo canceller work very well,but after process silent(zero) echo signal(when enabled VAD,DTX),it work very bad,I
have to reset it, it seems adapt faster then do not rest it, so I always here echo produce by my fisrt two second
speech,how can I solve this problem?
2006 Sep 20
1
Echo Cancellation for silent echo signal
?
?
Hi Jean-Marc,
When speech is played in speaker, we are feeding that speech to echo canceller for cancelling its echo from captured data and the echo canceller work very well. But when there is no speech, we are feeding samples of value zero to speaker and the same to echo canceller. But microphone is always capturing something. But when speech is played again after these zeros(silent
2006 Jan 17
2
Recommended GUI for Speex
Hi all,
I have recently added Speex support to my app (http://sbooth.org/Max/) and
am in the process of creating a GUI for the user to control the codec
parameters. I am new to Speex and as such I am not really sure which
parameters are more important than others, and deserve prominent
placement, etc. Is there a recommended GUI for people to follow or use as
an example?
Stephen
2006 Oct 24
2
Does VAD/DTX work without VBR and Preprocessor
Hello,
I'm try to run speex on some ARM processor.
I'd like to cut away some speex features including VBR, ABR and preprocessor
while still supporting VAD/DTX.
But I've found some puzzles in the source code regarding VAD as below,
1) there are VAD related codes in the source file preprocess.c.
Will VAD still work If I don't use the source file preprocess.c?
2)The speex manual