search for: phonemes

Displaying 19 results from an estimated 19 matches for "phonemes".

Did you mean: phoneme
2006 Oct 31
1
2 questions, frame size and SPEEX_GET_LOOKAHEAD
...issues are not a concern at this stage. On Oct 31, 2006, at 3:40 PM, Andras Kadinger wrote: > [At the risk of educating you about something you might already know] > > Natural speech in most human languages gradually changes from one > phoneme to the next. > > Concatenating phonemes together from a fixed, prerecorded, > unflexible set would give rise to abrupt changes between them (both > in phoneme quality and in pitch), and thus make the resulting speech > hard to understand and/or uncomfortable to listen to. > > Most flexible (unlimited vocabulary), uni...
2006 Oct 31
2
2 questions, frame size and SPEEX_GET_LOOKAHEAD
Ok, let me first explain why 5ms matters, even they are 0's, in my particular application. I am working on a speech synthesis system. The basic idea is concatenating pre-recorded phonemes or words into longer sentences. So any missing or extra samples, even it is as short as 5~10ms, cause very noticeable discontinuities. I want to use speex to compress/decompress those pre-recorded material. But I'm concerned about the extra 0's might be padded at both ends. For th...
2006 Oct 31
0
2 questions, frame size and SPEEX_GET_LOOKAHEAD
[At the risk of educating you about something you might already know] Natural speech in most human languages gradually changes from one phoneme to the next. Concatenating phonemes together from a fixed, prerecorded, unflexible set would give rise to abrupt changes between them (both in phoneme quality and in pitch), and thus make the resulting speech hard to understand and/or uncomfortable to listen to. Most flexible (unlimited vocabulary), unit (e.g. "phoneme"...
2004 Aug 06
2
de-essing into speex?
are there plans to add "de-essing" as part of the speexenc procedure, to make it possible to use lower quality compression without getting those horrible computer-like "ess" sounds in the finished result? like if you say "someone said the sun is shining", there is a lot of ess sounds, and these will sound "computer-ish" at vbr qualities below 9. i know
2004 Aug 06
0
de-essing into speex?
...nt mixing and encoding of speex streams (VoIP phone lines) less effective and more costly in a resource sense. It is a good idea, though I would consider a luxury filter, that's just me being overly assertive. || \/ If anyone is interested from my knowledge of speech recognition all human phonemes when converted from power vs. time to power vs. freq exibit 2 characteristic spikes. The primary spike defines the base for recognizing the phoneme and the next highest spikes relative location and power give a program a good probability match as to which phoneme it is. Humanizing spx audio deriv...
2006 Nov 05
1
skype and SIP hardware for linux
I'm looking at the <http://support.a-link.com/phonemate/IPU1.htm> phone because it works with Skype (from Linux), but can do SIP, too. Not necessarily asterisk related, but possibly. My networking situation might require IAX if I'm running Linux and want to use SIP, I'm not certain (Skype works fine). Putting that unknown aside for the moment, how does this phone work under
2004 Aug 06
3
de-essing into speex?
...phone > lines) less effective and more costly in a resource sense. > > It is a good idea, though I would consider a luxury filter, that's > just me being overly assertive. > || > \/ > > If anyone is interested from my knowledge of speech recognition all > human phonemes when converted from power vs. time to power vs. freq > exibit 2 characteristic spikes. The primary spike defines the base > for recognizing the phoneme and the next highest spikes relative > location and power give a program a good probability match as to > which phoneme it is. > &...
2004 Aug 06
0
de-essing into speex?
...re costly in a resource sense. > > > > It is a good idea, though I would consider a luxury filter, that's > > just me being overly assertive. > > || > > \/ > > > > If anyone is interested from my knowledge of speech recognition all > > human phonemes when converted from power vs. time to power vs. freq > > exibit 2 characteristic spikes. The primary spike defines the base > > for recognizing the phoneme and the next highest spikes relative > > location and power give a program a good probability match as to > > which ph...
2006 Feb 16
0
(m)simtest ?
Hi,. We have 2 values (first formant F1, second formant F2) for a given phoneme for six languages. We want to see whether the languages are significantly different one from another for this given phoneme. We have done a manova on our data and it works well, but we doesn't allow us to see which pair of languages are different. If we have only one formant for the phoneme, we would use
2004 May 09
3
German sound files available
Hi there, today I made the German language prompts available for download: http://www.karl.aegee.org/asterisk.nsf/HT/sound-de Be aware: Asterisk doesn't yet fully support languages other than English, there are still (smaller) issues with voicemail and date/time announcements that require a patch. Cheers, Philipp
2007 Feb 02
2
Getting at the LPC coefficients
Hi everyone! It's my first time posting to this list, and I've got a fairly technical question. I'm interested in doing phoneme extraction, and one of the first steps in the algorithm I'm planning to use is to get the LPC coefficients for an input frame. Since Speex is CELP-based, the coefficients must be generated in there somewhere. I've tried digging around in the source
2007 Feb 02
1
Getting at the LPC coefficients
Hi Jean-Marc I'm looking at the 1.0.5 source, and I'm not seeing an _spx_lpc(). There's an _spx_autocorr(), which is in lpc.c and is called near the start of the encoder function in nb_celp.c. The encoder seems to call the autocorr() function, then calls wld() to do something called Levinson-Durbin. Am I right in thinking that after the call to wld(), the st->lpc[] array
2006 Jul 12
1
-Infinity for Doule type column
...crashes: "/usr/local/stow/ruby/lib/ruby/gems/1.8/gems/activerecord-1.14.3/lib/active_record/connection_adapters/abstract_adapter.rb:120:in `log'': Mysql::Error: Unknown column ''Infinity'' in ''field list'': INSERT INTO full_doc_indices (`etf`, `Id`, `phonemes`, `filename`, `end_time`, `start_time`) VALUES(-Infinity, NULL, ''IH N T IH'', ''BN99EN_2'', 4952.77044, 4939.515) (ActiveRecord::StatementInvalid)" Obviously, ActiveRecord is using "-Infinity" literally, and then MySQL treats "-Infinity"...
2002 Mar 19
3
Psycho-acoustics research
...f compression, then play them back on decent audio equipment for listening tests to see if listeners can still distinguish important parts of the sounds in the recordings. In particular, I'm interested in looking at the nature of degradation when the compression ratio is particularly high: what phonemes become more difficult to distinguish soonest, as the compression ratio goes up? And then, if possible, I'd like to come up with an analysis of *why* those particular sounds are poorly recreated, as opposed to others. My guess is that fricative sounds (/f/, /v/, /s/, /z/) will "degrade&qu...
2008 Aug 12
3
Digital speech within 100 Hz bandwidth
Dear Sir, Could you please forward this e-mail to your engineering department. I am a ham radio person who wants to transmit from the Earth to the moon and back to the Earth by phone not by Morris code. This will be a very weak signal. I need an extra 13.8 dB of gain. Could you please help me distribute the attached paper to someone who could take this project into the next level? Please reply
2007 Mar 30
7
Some additional attacks on Cookie Session
Aside from the replay attacks discussed, there are some other attack vectors on the cookie_session store. I appreciate (and admire!) Jeremy''s good humor on all of this: > Planting the seed here led to quick ripening and plenty of pesticide. > Thanks for the fish, all. > > jeremy Anyway, here''s what we came up with: 1. Brute Force SHA512 can be computed _very_ fast.
2000 May 04
0
About Omega in pda()
** High Priority ** Hello R users My issue is both theorical and technical. I would like to run a penalised discriminant analysis with the fda() function, but I don''t know all the details of splines theory. I try on the example of the phonems from the article "Penalised Discriminant Analysis" of Hastie, Buja and Tibshirani 1994 : 5 groups and 256 variables. The 256
2006 Oct 31
2
2 questions, frame size and SPEEX_GET_LOOKAHEAD
> >> 2. What does SPEEX_GET_LOOKAHEAD do? How to use it? >> In speexenc.c, there is following code: >> >> ... >> speex_encoder_ctl(st, SPEEX_GET_LOOKAHEAD, &lookahead); >> ... >> nb_encoded = -lookahead; >> >> Can someone explain what this means? > > The lookahead is the number of samples you need to discard at the > start.
2005 Sep 22
1
running TextAloudMP3
Hello I have troubles running TextAloundMp3 under wine. TextAloudMP3 version is 1.459 Wine version is 20050725 My TextAloudMp3 directory looks like: $ ls TextAloudMp3 -rw------- 1 atom users 842 Sep 9 14:31 27bluL2.gif -rw------- 1 atom users 842 Sep 9 14:31 27bluR2.gif -rw------- 1 atom users 842 Sep 9 14:31 27blulft.gif -rw------- 1 atom users 845 Sep 9 14:31 27blurgt.gif