thr3ads.net - search: "speechdetected"

Displaying 4 results from an estimated 4 matches for "speechdetected".

2004 Aug 06

C++ wrapper for speex

...; int buflen = lpHdr->dwBufferLength; //160 float * buf = toFloat(lpHdr->lpData, buflen); // Encode some audio int nRet = enc.encode(buf, buflen); unsigned char *compressed = enc.getOutput(); int clen = enc.getOutputLength(); int speechDetected = enc.getSpeechDetected(); // delete temporary float buffer delete []buf; // Decode it int decodedSamples = dec.decode(compressed, clen); // Prepare decoded data for WAV BYTE pSound = new BYTE[decodedSamples * sizeof(short)]...

Kindly help me with VAD activation

2004 Aug 06

Kindly help me with VAD activation

...r, the VAD done by speex_preprocess() works much better. You will also get better results if you enable the denoiser, I think. peex_preprocess_ctl(preprocess, SPEEX_PREPROCESS_SET_DENOISE, &denoise_enabled); speex_preprocess_ctl(preprocess, SPEEX_PREPROCESS_SET_VAD, &vad_enabled); speechDetected = speex_preprocess(preprocess, floatbuf, NULL); Hope this helps, Tom Abhishek Parakh (abhishekp@ee.iitm.ac.in) wrote: > > Hello. > > I have generated a dll out of speex-1.1.4 using VC++6. I did not want all the > ogg/wave header in the encoded stream. So I have replaced to codes...

C++ wrapper for speex

2004 Aug 06

C++ wrapper for speex

...rt int[], buflen is multiple of frame size // (there is a getFrameSize() method for both classes) // Encode some audio tgAudioEncoder enc(16000); // you could do 8000 instead enc.enableVAD(1); enc.encode(buf, buflen); unsigned char *compressed = enc.getOutput(); int clen = enc.getOutputLen(); int speechDetected = enc.getSpeechDetected(); // Decode it tgAudioDecoder dec(16000); int decodedSamples = dec.decode(compressed, clen); memcpy(buf, dec.getOutput(), decodedSamples * sizeof(short int)); Anyway, that's the basic idea. I don't think it could get much easier. :) Now, this is only raw encodi...

C++ wrapper for speex

2004 Aug 06

C++ wrapper for speex

...aram; > > int buflen = lpHdr->dwBufferLength; //160 > float * buf = toFloat(lpHdr->lpData, buflen); > > // Encode some audio > int nRet = enc.encode(buf, buflen); > unsigned char *compressed = enc.getOutput(); > int clen = enc.getOutputLength(); > int speechDetected = enc.getSpeechDetected(); > > // delete temporary float buffer > delete []buf; > > // Decode it > int decodedSamples = dec.decode(compressed, clen); > > // Prepare decoded data for WAV > BYTE pSound = new BYTE[decodedSamples * > sizeof(short)]; >...

search for: speechdetected