Displaying 6 results from an estimated 6 matches for "diphones".
Did you mean:
iphones
2004 Nov 05
1
Encoding problem
I am using speex for encoding and decoding speech files in a speech
synthesis program. I am using concatenative approach for speech synthesis
using diphones (small speech chunks) as basic unit.
Now in my application, the order of diphones to be joined after decoding is
determined at runtime.
Therefore I have to encode each diphone independent of others.
I find out that when I encode diphones in order of their concatenation with
out reinitializing...
2005 Jul 14
0
Changing the voice in Asterisk
> Has anyone had any luck in changing the voices for Festival and
> Asterisk?
>
> I have Festival installed and working, but can not get the voice
> different
> from the default.
>
> Thanks,
>
> Jason
>
Jason--
Assuming you follow the installation instructions, and install the Mbrola and
other goodies for all the possible different voices, then you can,
2006 Oct 31
1
2 questions, frame size and SPEEX_GET_LOOKAHEAD
Hi, Andras,
Thanks for the comments. Yes, I am aware of those issues. I probably
should have been more accurate on my usage of terms. Actually in my
project, the unit collection is a mixture of diphones and words.
However seems to me, these synthesizer specific issue is irrelevant to
my question about speex. As you said, i merely use speex as storage
methods. All I ask for is to get the samples as close to original
recording as possible after encoding and decoding. Blending, cross
fading,...
2006 Oct 31
2
2 questions, frame size and SPEEX_GET_LOOKAHEAD
Ok, let me first explain why 5ms matters, even they are 0's, in my
particular application.
I am working on a speech synthesis system. The basic idea is
concatenating pre-recorded phonemes or words into longer sentences. So
any missing or extra samples, even it is as short as 5~10ms, cause
very noticeable discontinuities.
I want to use speex to compress/decompress those pre-recorded
2006 Oct 31
0
2 questions, frame size and SPEEX_GET_LOOKAHEAD
...Most flexible (unlimited vocabulary), unit (e.g. "phoneme")
concatenation speech synthesizers therefore use some strategy to blend
the pieces of speech together, usually both in pitch and in phoneme
quality. One very conceptually simple and therefore popular approach is
storing "diphones" - phoneme transitions: e.g. the second half of "a"
and the first half of "p" from the hypothetical word "apa". Since
phonemes usually tend to reach their "most recognizable" state in the
"middle", cutting and splicing them together around t...
2003 Dec 08
9
IAX clients
Hi,
Is there IAX client in Applet JAVA which can be embeded in a web page ?
Best regards
Rattana
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.digium.com/pipermail/asterisk-users/attachments/20031208/c388ef61/attachment.htm