Tiido's speech synthesizer 3

Announce (tech) demos or games releases

Moderator: Mask of Destiny

Post Reply
TmEE co.(TM)
Very interested
Posts: 2373
Joined: Tue Dec 05, 2006 1:37 pm
Location: Estonia, Rapla City
Contact:

Tiido's speech synthesizer 3

Post by TmEE co.(TM) » Tue Feb 24, 2009 6:57 pm

http://www.fileden.com/files/2008/4/21/ ... PEECH3.BIN

(Lot) more work is needed around the samples, but things are shaping up :)

enjoy :)
Mida sa loed ? Nagunii aru ei saa ;)
http://www.tmeeco.eu
Files of all broken links and images of mine are found here : http://www.tmeeco.eu/FileDen

Jorge Nuno
Very interested
Posts: 374
Joined: Mon Jun 11, 2007 3:09 am
Location: Azeitão, PT

Post by Jorge Nuno » Tue Feb 24, 2009 7:37 pm

I couldn't understand a damn thing! :? Could be my fault, too...

TmEE co.(TM)
Very interested
Posts: 2373
Joined: Tue Dec 05, 2006 1:37 pm
Location: Estonia, Rapla City
Contact:

Post by TmEE co.(TM) » Tue Feb 24, 2009 7:42 pm

You're not the only one :P

"1 2 3 testing testing this is demonstration of tiido's speech synthesizer bla bla bla...."
Mida sa loed ? Nagunii aru ei saa ;)
http://www.tmeeco.eu
Files of all broken links and images of mine are found here : http://www.tmeeco.eu/FileDen

Jorge Nuno
Very interested
Posts: 374
Joined: Mon Jun 11, 2007 3:09 am
Location: Azeitão, PT

Post by Jorge Nuno » Tue Feb 24, 2009 8:13 pm

How is this done? Every letter is sampled in the bin then they're glued together, looking at a word and loading the sample that corresponds to the next char?

What if the letters have different sounds in different situations?

TmEE co.(TM)
Very interested
Posts: 2373
Joined: Tue Dec 05, 2006 1:37 pm
Location: Estonia, Rapla City
Contact:

Post by TmEE co.(TM) » Tue Feb 24, 2009 8:19 pm

It uses 31 samples (with silence), and everythign is based on Estonian pronounciation system where a letter is pronounced same in any word (with some exceptions). Russian works so too (with some exceptions).
Mida sa loed ? Nagunii aru ei saa ;)
http://www.tmeeco.eu
Files of all broken links and images of mine are found here : http://www.tmeeco.eu/FileDen

Jorge Nuno
Very interested
Posts: 374
Joined: Mon Jun 11, 2007 3:09 am
Location: Azeitão, PT

Post by Jorge Nuno » Tue Feb 24, 2009 8:27 pm

Well this is a lot harder in EN/ES/FR/PT/others-I-know-nothing-about, because letters can have a TON of different phonetics, however the accents/tilde can help a little :wink:

HardWareMan
Very interested
Posts: 727
Joined: Sat Dec 15, 2007 7:49 am
Location: Kazakhstan, Pavlodar

Post by HardWareMan » Wed Feb 25, 2009 3:40 am

I think you should use phonemes of syllables rather than letters. Then speech will be more clear. A long time ago I did the same thing for russian...

TmEE co.(TM)
Very interested
Posts: 2373
Joined: Tue Dec 05, 2006 1:37 pm
Location: Estonia, Rapla City
Contact:

Post by TmEE co.(TM) » Wed Feb 25, 2009 6:51 pm

I've got very good results using just letters, samples just have to be done right... I haven't done any work on this today :/
Mida sa loed ? Nagunii aru ei saa ;)
http://www.tmeeco.eu
Files of all broken links and images of mine are found here : http://www.tmeeco.eu/FileDen

Snake
Very interested
Posts: 203
Joined: Sat Sep 13, 2008 1:01 am

Post by Snake » Wed Feb 25, 2009 7:52 pm

HardWareMan wrote:I think you should use phonemes of syllables rather than letters. Then speech will be more clear. A long time ago I did the same thing for russian...
:D I've done this type of thing myself.

One thing I've always meant to do, but never seem to get around to it, is to attempt to extract samples from speeches by famous people... Could be fun :)

powerofrecall
Very interested
Posts: 237
Joined: Fri Apr 17, 2009 7:35 pm
Location: USA

Post by powerofrecall » Fri Apr 17, 2009 10:25 pm

What about rsynth?

http://www.speech.cs.cmu.edu/comp.speec ... synth.html

It's based on some really old, bad public domain code but it's marginally understandable and it sounds cool, haha

Chilly Willy
Very interested
Posts: 2785
Joined: Fri Aug 17, 2007 9:33 pm

Post by Chilly Willy » Sat Apr 18, 2009 8:21 am

powerofrecall wrote:What about rsynth?

http://www.speech.cs.cmu.edu/comp.speec ... synth.html

It's based on some really old, bad public domain code but it's marginally understandable and it sounds cool, haha
It use floats for it's computations. It would be a bit of work to convert to integer math. It's also not likely to be anything close to "real time".

Post Reply