Go to the first, previous, next, last section, table of contents.

12 References

allen87: J. Allen, S. Hunnicut, and D. Klatt. Text-to-speech: The MITalk system. Cambridge University Press, Cambridge, UK., 1987.
anderson84: M. Anderson, J. Pierrehumbert, and M. Liberman. Synthesis by rule of English intonation patterns. In Proceedings of ICASSP 84, pages 2.8.1--2.8.4, 1984.
bacchiani96: M. Bacchiani, M. Ostendorf, Y. Sagisaka, and K. Paliwal. Design of a speech recognition system based on acoustically derived segmental units. In ICASSP-96, volume 1, pages 443--446, Atlanta, Georgia, 1996.
bachenko90: J. Bachenko and E. Fitzpatrick. A computational grammar of discourse-neutral prosodic phrasing in English. Computational Linguistics, 16(3):155--170, 1990.
black96: A. Black and A. Hunt. Generating F$_0$ contours from ToBI labels using linear regression. In ICSLP96, volume 3, pages 1385--1388, Philadelphia, PA., 1996.
black98b: A. Black, K. Lenzo, and V. Pagel. Issues in building general letter to sound rules. In Proc. ESCA Workshop on Speech Synthesis, pages 77--80, Australia., 1998.
black97b: A. Black and P. Taylor. Assigning phrase breaks from part-of-speech sequences. In Eurospeech97, volume 2, pages 995--998, Rhodes, Greece, 1997.
black97a: A. W. Black. Predicting the intonation of discourse segments from examples in dialogue speech. In Y. Sagisaka, N. Campbell, and N. Higuchi, editors, Computing Prosody, pages 117--128. Springer-Verlag, 1997.
black95a: A. W. Black. Comparison of algorithms for predicting accent placement in english speech synthesis. In Proceedings of the Acoustics Society of Japan, pages 275--276, 3--4--1, Spring, 1995.
black95d: A. W. Black and N. Campbell. Optimising selection of units from speech databases for concatenative synthesis. In Eurospeech95, volume 1, pages 581--584, Madrid, Spain, 1995.
campbell96: N. Campbell and A. Black. Prosody and the selection of source units for concatenative synthesis. In J. van Santen, R. Sproat, J. Olive, and J. Hirschberg, editors, Progress in speech synthesis, pages 279--282. Springer Verlag, 1996.
campbell91: N. Campbell and S. Isard. Segment durations in a syllable frame. Journal of Phonetics, 19(1):37--47, 1991.
campbell92b: W. N. Campbell. Synthesis units for natural English speech. IEICE, SP 91-129:55--62, 1992.
conkie96: A. Conkie and S. Isard. Optimal coupling of diphones. In J. van Santen, R. Sproat, J. Olive, and J. Hirschberg, editors, Progress in speech synthesis, pages 293--305. Springer Verlag, 1996.
DeRose88: S. DeRose. Grammatical category disambiguation by statistical optimization. Computational Linguistics, 14:31--39, 1988.
donovan95: R. Donovan and P. Woodland. Improvements in an HMM-based speech synthesiser. In Eurospeech95, volume 1, pages 573--576, Madrid, Spain, 1995.
dusterhoff97a: K. Dusterhoff and Black A. Generating F$_0$ contours for speech synthesis using the tilt intonation theory. In Proc. ESCA Workshop on Intonation, Athens, Greece., 1997.
dutoit93: T. Dutoit and H. Leich. MBR-PSOLA : Text-to-speech synthesis based on an MBE re-synthesis of the segments database. Speech Communication, 13:435--440, 1993.
fujimura93: O Fujimura. C/d model: a computational model of phonetic implementation. In E Ristad, editor, DIMACS Proceedings. Am. Math. Soc., 1993.
fujisaki83: H. Fujisaki. Dynamic characteristics of voice fundamental frequency in speech and singing. In P MacNeilage, editor, The Production of Speech, pages 39--55. Springer-verlag, 1983.
hess83: W. Hess. Pitch Detection in Speech Signals: Algorithms and Devices. Springer Verlag, 1983.
hirschberg92: J. Hirschberg. Using discourse content to guide pitch accent decisions in synthetic speech. In G. Bailly and C. Benoit, editors, Talking Machines, pages 367--376. North-Holland, 1992.
hirschberg94: J. Hirschberg and P. Prieto. Training intonation phrase rules automatically for English and Spanish text-to-speech. In Proc. ESCA Workshop on Speech Synthesis, pages 159--162, Mohonk, NY., 1994.
huang97: X. Huang, A. Acero, H. Hon, Y. Ju, J Liu, S. Meredith, and M. Plumpe. Recent improvements on microsoft's trainable text-to-speech synthesizer: Whistler. In ICASSP-97, volume II, pages 959--962, Munich, Germany, 1997.
hunt96: A. Hunt and A. Black. Unit selection in a concatenative speech synthesis system using a large speech database. In ICASSP-96, volume 1, pages 373--376, Atlanta, Georgia, 1996.
hunt89: M. Hunt, Zwierynski D., and Carr R. Issues in high quality LPC analysis and synthesis. In Eurospeech89, volume 2, pages 348--351, Paris, France, 1989.
jilka96: M. Jilka. Regelbasierte generierung nat\"urlich klingender intonationsmuster des amerikanischen englisch (rule-based generation of naturally sounding intonation patterns of american english). Master's thesis, University of Stuttgart, Institute of Natural Language Processing, 1996.
kain98: A. Kain and M. Macon. Spectral voice conversion for text-to-speech synthesis. In ICASSP-98, volume 1, pages 285--288, Seattle, Washington, 1998.
klatt87: D. Klatt. Review of text-to-speech conversion for english. Journal of the Acoustical Society of America, 82:737--793, 1987.
malfrere97: F. Malfrere and T. Dutoit. High quality speech synthesis for phonetic speech segmentation. In Eurospeech97, pages 2631--2634, Rhodes, Greece, 1997.
malfrere98: F. Malfrere, T. Dutoit, and P. Mertens. Automatic prosody generation using suprasegmental unit selection. In Proc. ESCA Workshop on Speech Synthesis, pages 323--327, Australia., 1998.
marcus93: M. Marcus, B. Santorini, and M. Marcinkiewicz. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19:313--330, 1993.
moebius96: B. Moebius. Synthesizing german intonation contours. In J.P. van Santen, R. Sproat, J. Olive, and J. Hirschberg, editors, Progress in Speech Synthesis, pages 401--415. Springer, 1996.
moehler98: G. Moehler and A. Conkie. Parametric modelling of intonation using vector quantization. In Proc. ESCA Workshop on Speech Synthesis, pages 311--316, Australia., 1998.
moulines90: Eric. Moulines and F. Charpentier. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9(5/6):453--467, 1990.
ostendorf95: M. Ostendorf, P. Price, and S. Shattuck-Hufnagel. The Boston University Radio News Corpus. Technical Report ECS-95-001, Electrical, Computer and Systems Engineering Department, Boston University, Boston, MA, 1995.
ostendorf94: M. Ostendorf and N. Veilleux. A hierarchical stochastic model for automatic prediction of prosodic boundary location. Computational Linguistics, 20(1):27--55, 1994.
pierrehumbert80: Janet B. Pierrehumbert. The Phonology and Phonetics of English Intonation. PhD thesis, MIT, 1980. Published by Indiana University Linguistics Club.
ritchie92: G. Ritchie, G. Russell, A. Black, and S. Pulman. Computational Morphology. MIT Press, Cambrdige, Mass., 1992.
ross96: K. Ross and M. Ostendorf. Prediction of abstract prosodic labels for speech synthesis. Computer, Speech and Language, ??(?):?, 1996.
nuutalk92: Y. Sagisaka, N. Kaiki, N. Iwahashi, and K. Mimura. ATR -- $\nu$-TALK speech synthesis system. In Proceedings of ICSLP 92, volume 1, pages 483--486, 1992.
sanders95: E. Sanders and P. Taylor. Using statistical models to predict phrase boundaries for speech synthesis. In Eurospeech95, volume 2, pages 1811--1814, Madrid, Spain, 1995.
silverman92: K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert, and J. Hirschberg. ToBI: a standard for labelling English prosody. In Proceedings of ICSLP92, volume 2, pages 867--870, 1992.
sproat98b: R. Sproat, A. Hunt, M. Ostendorf, P. Taylor, A. Black, K. Lenzo, and M. Edgington. SABLE: A standard for TTS markup. In International Conference on Spoken Language Processing, Sydney, Australia, 1998.
sproat96b: R. Sproat, C. Shih, W. Gale, and N. Chang. A stochastic finite-state word-segementation algorithm for Chinese. Computational Linguistics, 22(3), 1996.
sproat97: R. Sproat, P. Taylor, M. Tanenblatt, and A. Isard. A markup language for text-to-speech synthesis. In Eurospeech97, volume 2, pages 995--998, Rhodes, Greece, 1997.
syrdal98a: A. Syrdal, G. Moehler, K. Dusterhoff, A. Conkie, and Black A. Three methods of intonation modelling. In Proc. ESCA Workshop on Speech Synthesis, pages 305--310, Australia., 1998.
taylor00a: P. Taylor. Analysis and synthesis of intonation using the tilt model. Journal of the Acoustical Society of America, 107 3:1697--1714, 2000.
taylor99b: P. Taylor and A. Black. Speech synthesis by phonological structure matching. In Eurospeech99, volume 4, pages 1531--1534, Budapest, Hungary, 1999.
taylor98b: P. Taylor, A. Black, and R. Caley. The architecture of the festival speech synthesis system. In 3rd ESCA Workshop on Speech Synthesis, pages 147--141, Jenolan Caves, Australia., 1998.
taylor94b: P. Taylor and A. W. Black. Synthesizing conversational intonation from a linguistically rich input. In Proc. ESCA Workshop on Speech Synthesis, pages 175--178, Mohonk, NY., 1994.
taylor97b: P. Taylor and A. Isard. SSML: A speech synthesis markup language. Speech Communication, 21:123--133, 1997.
bosch98: A. van den Bosch, T. Weijters, and W. Daelemans. Modularity in inductive-learned word pronunciation systems. In Proc. NeMLaP3/CoNNL98, pages 185--194, Sydney, 1998.
yarowsky96: D. Yarowsky. Homograph disambiguation in text-to-speech synthesis. In J. van Santen, R. Sproat, J. Olive, and J. Hirschberg, editors, Progress in speech synthesis, pages 157--172. Springer Verlag, 1996.

Go to the first, previous, next, last section, table of contents.