thr3ads.net - search: "phonemes"

Displaying 19 results from an estimated 19 matches for "phonemes".

Did you mean: phoneme

2 questions, frame size and SPEEX_GET_LOOKAHEAD

2006 Oct 31

2 questions, frame size and SPEEX_GET_LOOKAHEAD

...issues are not a concern at this stage. On Oct 31, 2006, at 3:40 PM, Andras Kadinger wrote: > [At the risk of educating you about something you might already know] > > Natural speech in most human languages gradually changes from one > phoneme to the next. > > Concatenating phonemes together from a fixed, prerecorded, > unflexible set would give rise to abrupt changes between them (both > in phoneme quality and in pitch), and thus make the resulting speech > hard to understand and/or uncomfortable to listen to. > > Most flexible (unlimited vocabulary), uni...

2 questions, frame size and SPEEX_GET_LOOKAHEAD

2006 Oct 31

2 questions, frame size and SPEEX_GET_LOOKAHEAD

Ok, let me first explain why 5ms matters, even they are 0's, in my particular application. I am working on a speech synthesis system. The basic idea is concatenating pre-recorded phonemes or words into longer sentences. So any missing or extra samples, even it is as short as 5~10ms, cause very noticeable discontinuities. I want to use speex to compress/decompress those pre-recorded material. But I'm concerned about the extra 0's might be padded at both ends. For th...

2 questions, frame size and SPEEX_GET_LOOKAHEAD

2006 Oct 31

2 questions, frame size and SPEEX_GET_LOOKAHEAD

[At the risk of educating you about something you might already know] Natural speech in most human languages gradually changes from one phoneme to the next. Concatenating phonemes together from a fixed, prerecorded, unflexible set would give rise to abrupt changes between them (both in phoneme quality and in pitch), and thus make the resulting speech hard to understand and/or uncomfortable to listen to. Most flexible (unlimited vocabulary), unit (e.g. "phoneme"...

de-essing into speex?

2004 Aug 06

de-essing into speex?

are there plans to add "de-essing" as part of the speexenc procedure, to make it possible to use lower quality compression without getting those horrible computer-like "ess" sounds in the finished result? like if you say "someone said the sun is shining", there is a lot of ess sounds, and these will sound "computer-ish" at vbr qualities below 9. i know

de-essing into speex?

2004 Aug 06

de-essing into speex?

...nt mixing and encoding of speex streams (VoIP phone lines) less effective and more costly in a resource sense. It is a good idea, though I would consider a luxury filter, that's just me being overly assertive. || \/ If anyone is interested from my knowledge of speech recognition all human phonemes when converted from power vs. time to power vs. freq exibit 2 characteristic spikes. The primary spike defines the base for recognizing the phoneme and the next highest spikes relative location and power give a program a good probability match as to which phoneme it is. Humanizing spx audio deriv...

skype and SIP hardware for linux

2006 Nov 05

skype and SIP hardware for linux

I'm looking at the <http://support.a-link.com/phonemate/IPU1.htm> phone because it works with Skype (from Linux), but can do SIP, too. Not necessarily asterisk related, but possibly. My networking situation might require IAX if I'm running Linux and want to use SIP, I'm not certain (Skype works fine). Putting that unknown aside for the moment, how does this phone work under

de-essing into speex?

2004 Aug 06

de-essing into speex?

...phone > lines) less effective and more costly in a resource sense. > > It is a good idea, though I would consider a luxury filter, that's > just me being overly assertive. > || > \/ > > If anyone is interested from my knowledge of speech recognition all > human phonemes when converted from power vs. time to power vs. freq > exibit 2 characteristic spikes. The primary spike defines the base > for recognizing the phoneme and the next highest spikes relative > location and power give a program a good probability match as to > which phoneme it is. > &...

de-essing into speex?

2004 Aug 06

de-essing into speex?

...re costly in a resource sense. > > > > It is a good idea, though I would consider a luxury filter, that's > > just me being overly assertive. > > || > > \/ > > > > If anyone is interested from my knowledge of speech recognition all > > human phonemes when converted from power vs. time to power vs. freq > > exibit 2 characteristic spikes. The primary spike defines the base > > for recognizing the phoneme and the next highest spikes relative > > location and power give a program a good probability match as to > > which ph...

(m)simtest ?

2006 Feb 16

(m)simtest ?

Hi,. We have 2 values (first formant F1, second formant F2) for a given phoneme for six languages. We want to see whether the languages are significantly different one from another for this given phoneme. We have done a manova on our data and it works well, but we doesn't allow us to see which pair of languages are different. If we have only one formant for the phoneme, we would use

German sound files available

2004 May 09

German sound files available

Hi there, today I made the German language prompts available for download: http://www.karl.aegee.org/asterisk.nsf/HT/sound-de Be aware: Asterisk doesn't yet fully support languages other than English, there are still (smaller) issues with voicemail and date/time announcements that require a patch. Cheers, Philipp

Getting at the LPC coefficients

2007 Feb 02

Getting at the LPC coefficients

Hi everyone! It's my first time posting to this list, and I've got a fairly technical question. I'm interested in doing phoneme extraction, and one of the first steps in the algorithm I'm planning to use is to get the LPC coefficients for an input frame. Since Speex is CELP-based, the coefficients must be generated in there somewhere. I've tried digging around in the source

Getting at the LPC coefficients

2007 Feb 02

Getting at the LPC coefficients

Hi Jean-Marc I'm looking at the 1.0.5 source, and I'm not seeing an _spx_lpc(). There's an _spx_autocorr(), which is in lpc.c and is called near the start of the encoder function in nb_celp.c. The encoder seems to call the autocorr() function, then calls wld() to do something called Levinson-Durbin. Am I right in thinking that after the call to wld(), the st->lpc[] array

-Infinity for Doule type column

2006 Jul 12

-Infinity for Doule type column

...crashes: "/usr/local/stow/ruby/lib/ruby/gems/1.8/gems/activerecord-1.14.3/lib/active_record/connection_adapters/abstract_adapter.rb:120:in `log'': Mysql::Error: Unknown column ''Infinity'' in ''field list'': INSERT INTO full_doc_indices (`etf`, `Id`, `phonemes`, `filename`, `end_time`, `start_time`) VALUES(-Infinity, NULL, ''IH N T IH'', ''BN99EN_2'', 4952.77044, 4939.515) (ActiveRecord::StatementInvalid)" Obviously, ActiveRecord is using "-Infinity" literally, and then MySQL treats "-Infinity"...

Psycho-acoustics research

2002 Mar 19

Psycho-acoustics research

...f compression, then play them back on decent audio equipment for listening tests to see if listeners can still distinguish important parts of the sounds in the recordings. In particular, I'm interested in looking at the nature of degradation when the compression ratio is particularly high: what phonemes become more difficult to distinguish soonest, as the compression ratio goes up? And then, if possible, I'd like to come up with an analysis of *why* those particular sounds are poorly recreated, as opposed to others. My guess is that fricative sounds (/f/, /v/, /s/, /z/) will "degrade&qu...

Digital speech within 100 Hz bandwidth

2008 Aug 12

Digital speech within 100 Hz bandwidth

Dear Sir, Could you please forward this e-mail to your engineering department. I am a ham radio person who wants to transmit from the Earth to the moon and back to the Earth by phone not by Morris code. This will be a very weak signal. I need an extra 13.8 dB of gain. Could you please help me distribute the attached paper to someone who could take this project into the next level? Please reply

Some additional attacks on Cookie Session

2007 Mar 30

Some additional attacks on Cookie Session

Aside from the replay attacks discussed, there are some other attack vectors on the cookie_session store. I appreciate (and admire!) Jeremy''s good humor on all of this: > Planting the seed here led to quick ripening and plenty of pesticide. > Thanks for the fish, all. > > jeremy Anyway, here''s what we came up with: 1. Brute Force SHA512 can be computed _very_ fast.

About Omega in pda()

2000 May 04

About Omega in pda()

** High Priority ** Hello R users My issue is both theorical and technical. I would like to run a penalised discriminant analysis with the fda() function, but I don''t know all the details of splines theory. I try on the example of the phonems from the article "Penalised Discriminant Analysis" of Hastie, Buja and Tibshirani 1994 : 5 groups and 256 variables. The 256

2 questions, frame size and SPEEX_GET_LOOKAHEAD

2006 Oct 31

2 questions, frame size and SPEEX_GET_LOOKAHEAD

> >> 2. What does SPEEX_GET_LOOKAHEAD do? How to use it? >> In speexenc.c, there is following code: >> >> ... >> speex_encoder_ctl(st, SPEEX_GET_LOOKAHEAD, &lookahead); >> ... >> nb_encoded = -lookahead; >> >> Can someone explain what this means? > > The lookahead is the number of samples you need to discard at the > start.

running TextAloudMP3

2005 Sep 22

running TextAloudMP3

Hello I have troubles running TextAloundMp3 under wine. TextAloudMP3 version is 1.459 Wine version is 20050725 My TextAloudMp3 directory looks like: $ ls TextAloudMp3 -rw------- 1 atom users 842 Sep 9 14:31 27bluL2.gif -rw------- 1 atom users 842 Sep 9 14:31 27bluR2.gif -rw------- 1 atom users 842 Sep 9 14:31 27blulft.gif -rw------- 1 atom users 845 Sep 9 14:31 27blurgt.gif

search for: phonemes