FPGA Central - World's 1st FPGA / CPLD Portal

FPGA Central

World's 1st FPGA Portal

 

Go Back   FPGA Groups > NewsGroup > DSP

DSP comp.dsp newsgroup, mailing list

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 11-14-2004, 07:52 PM
James Salsman
Guest
 
Posts: n/a
Default Re: is Festival the state-of-the-art system?

>>... understand that some of the really good speech synthesizers
>> "cheat" by concatenating human recordings of words, or otherwise
>> patching the synthesis on a word-by-word basis.
>>
>> I know that there's at least one TTS product using 60 MB.

>
> That doesn't sound like much. Festival takes up about 70MB on my system.


How much memory does it allocate when that runs?

How many different audio files does it have?

I don't think I've looked very closely at a speech synthesis
program since around 2000. When I found that very few people knew
how to synthesize sound directly from a spectrogram, I was
astonished. That had been proven reversible. But the algorithms
published on it were doing a terrible job. At one point I wasn't
even sure that anyone knew how to do it in less than something
like either O(N^2 log N) or O(N log N^2) time.

The O(N log N) phase vocodec which resulted is in:
http://www.bovik.org/fs.m.txt
http://www.bovik.org/af.m.txt

That should help any resynthesis system use a compressed audio
format and still be able to do real-time resynthesis with any
arbitrary spectral transforms. It can also be used to normalize
data sets so they all have the same relative pitch, tempo, etc.

Thanks are due to Malcolm Slaney, Miller Puckette, Mark Dolson,
Vaughan Pratt, and 1977 IEEE Medal of Honor recipient Michael Portnoff.

I wonder how much switching to a Portnoff window from a Hamming
window helps ordinary speech recognition, all other things equal.

CMU Drs. Mostow and Aist have a patent on the use of speech
synthesis in my line of work. I'm glad they took that one out.

Sincerely,
James
--
www.readsay.com - maker of the ReadSay PROnounce English literacy system
400 MHz PDA included: $499 -- http://www.readsay.com/PROnounce.html
Reply With Quote
Reply

Bookmarks

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
system verilog state machine maxascent FPGA 2 11-02-2008 08:23 PM
MI5 Persecution: Royal Festival Hall 15/4/99 (9060) [email protected] VHDL 1 11-18-2007 06:05 PM
MI5 Persecution: Royal Festival Hall 15/4/99 (9877) [email protected] Verilog 0 11-17-2007 10:32 AM
MI5 Persecution: Royal Festival Hall 15/4/99 (9808) [email protected] FPGA 0 11-17-2007 10:30 AM
HELP PLEASE!! - Finite State Machine - Automaton - Microprogrammed System deejayfred VHDL 0 10-02-2003 03:23 AM


All times are GMT +1. The time now is 02:20 AM.


Powered by vBulletin® Version 3.8.0
Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.2.0
Copyright 2008 @ FPGA Central. All rights reserved