CMU_Pronouncing_Dictionary

CMU Pronouncing Dictionary

CMU Pronouncing Dictionary

Machine-readable pronunciations


The CMU Pronouncing Dictionary (also known as CMUdict) is an open-source pronouncing dictionary originally created by the Speech Group at Carnegie Mellon University (CMU) for use in speech recognition research.

Quick Facts Developer(s), Stable release ...

CMUdict provides a mapping orthographic/phonetic for English words in their North American pronunciations. It is commonly used to generate representations for speech recognition (ASR), e.g. the CMU Sphinx system, and speech synthesis (TTS), e.g. the Festival system. CMUdict can be used as a training corpus for building statistical grapheme-to-phoneme (g2p) models[1] that will generate pronunciations for words not yet included in the dictionary.

The most recent release is 0.7b; it contains over 134,000 entries. An interactive lookup version is available.[2]

Database format

The database is distributed as a plain text file with one entry to a line in the format "WORD  <pronunciation>" with a two-space separator between the parts. If multiple pronunciations are available for a word, variants are identified using numbered versions (e.g. WORD(1)). The pronunciation is encoded using a modified form of the ARPABET system, with the addition of stress marks on vowels of levels 0, 1, and 2. A line-initial ;;; token indicates a comment. A derived format, directly suitable for speech recognition engines is also available as part of the distribution; this format collapses stress distinctions (typically not used in ASR).

The following is a table of phonemes used by CMU Pronouncing Dictionary.[2]

More information ARPABET, Rspl. ...
More information ARPABET, Rspl. ...
More information AB, Description ...
More information ARPABET, Rspl. ...
More information ARPABET, Rspl. ...
More information ARPABET, Rspl. ...

History

More information Version, Release date ...

Applications

  • The Unifon converter is based on the CMU Pronouncing Dictionary.
  • The Natural Language Toolkit contains an interface to the CMU Pronouncing Dictionary.
  • The Carnegie Mellon Logios[5] tool incorporates the CMU Pronouncing Dictionary.
  • PronunDict, a pronunciation dictionary of American English, uses the CMU Pronouncing Dictionary as its data source. Pronunciation is transcribed in IPA symbols. This dictionary also supports searching by pronunciation.
  • Some singing voice synthesizer software like CeVIO Creative Studio and Synthesizer V uses modified version of CMU Pronouncing Dictionary for synthesizing English singing voices.
  • Transcriber, a tool for the full text phonetic transcription, uses the CMU Pronouncing Dictionary
  • 15.ai, a real-time text-to-speech tool using artificial intelligence, uses the CMU Pronouncing Dictionary

See also


References

  1. "Sequitur G2P - A trainable Grapheme-to-Phoneme converter".
  2. "The CMU Pronouncing Dictionary". CMU Pronouncing Dictionary. 2015-07-16. Archived from the original on 2022-06-03. Retrieved 2022-06-04.
  3. "Cmusphinx - Revision 10973: /Trunk/Logios". Archived from the original on 2011-05-20. Retrieved 2009-12-19.

Share this article:

This article uses material from the Wikipedia article CMU_Pronouncing_Dictionary, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.