Go to the first, previous, next, last section, table of contents.


18 English phone lists

Relating phonemes to sounds is not obvious as people think. Even when one is familar with phone sets its easy to make mistakes when reading lists of phones alone. This is particularly true in reading diphone nonsense words. The table provided here are intended for both the experienced and inexperienced reader of phones, to help you decide on the pronunciation.

These tables are not supposed to be a substitute for a good phonetics course, they are intended to give people a basic idea of the pronunciation of the phone sets used in the particaulr examples in this document. Many simplifying assumptions have been made, and often aren't even mentioned. To the phoneticians out there I apologise, as much as the assumptions are wrong we are here listing atomic discrete phones which we have found useful in building practical systems, even though better sets probably exist.

18.1 US phoneset

Inspite of everyone telling you that there is one and only one US phoneset, when it comes to actually using one you quickly discover there are actually many standard one used by lots of different pieces of software, often the difference betwen them is trivial (e.g. case folding) but computers being fundamentally dumb can't take these trivial differences into account. Here we list the radio phoneset which is used by standard US voices in festival. The definition is in `festival/lib/radio_phones.scm'. This list was based on those phones that appear in the Boston University FM radio corpus with minor modifications. The list here is exactly those phones which appear in the diphone nonses words as used in the example explained in section 8.10 US/UK English Walkthrough.

aa
fAther, wAshington
ae
fAt, bAd
ah
bUt, hUsh
ao
lAWn, dOOr, mAll
aw
hOW, sOUth, brOWser
ax
About, cAnoe
ay
hIde, bIble
eh
gEt, fEAther
el
tabLE, usabLE
em
systEM, communisM
en
beatEN
er
fERtile, sEARch, makER
ey
gAte, Ate
ih
bIt, shIp
iy
bEAt, shEEp
ow
lOne, nOse
oy
tOY, OYster
uh
fUll, wOOd
uw
fOOl, fOOd
b
Book, aBrupt
ch
CHart, larCH
d
Done, baD
dh
THat, faTHer
f
Fat, lauGH
g
Good, biGGer
hh
Hello, loopHole
jh
diGit, Jack
k
Camera, jaCK, Kill
l
Late, fuLL
m
Man, gaMe
n
maN, New
ng
baNG, sittiNG
p
Pat, camPer
r
Reason, caR,
s
Sit, maSS
sh
SHip, claSH
t
Tap, baT
th
THeatre, baTH
v
Various, haVe
w
Water, cobWeb
y
Yellow, Yacht
z
Zero, quiZ, boyS
zh
viSion, caSual
pau
short silence

In addition to the phone sthemselves the nonsense word generated by the diphone schema also have some other notations to denote different type of phone.

The use of - (hyphen) in the nonsense word itself is used to denot an explicit syllable boundary. Thus pau t aa n - k aa pau is used to state that the word should be pronounced as tan ka rather than tank ah. Where no explicit syllable boundary is given the pronunciation should be pronounce naturally without any boundary (which is probabaly too underspecified in some cases).

The use of _ (underscore) in phone names is used to denote consonant clusters. That is t_-_r is the /tr/ as found in trip not that in cat run.

18.2 UK phoneset

This phoneset developed at CSTR a number of years ago is for Southern UK English (RP, "received pronunciation"). Its definition is in `festival/lib/mrpa_phones.scm'.

uh
cUp, dOne
e
bEt, chEck
a
cAt, mAtch
o
cOttage, hOt
i
bIt, shIp
u
pUll, fOOt, bOOk
ii
bEAt, shEEp
uu
pOOl, bOOt
oo
AUthor, cOURt
aa
ARt, hEARt
@@
sEARch, bURn
ai
bIte, mIght, lIke
ei
Ate, mAIl
oi
tOY, OYster
au
sOUth, hOW
ou
hOle, cOAt
e@
AIR, bARE, chAIR
i@
EAR, bEER
u@
sUre, jUry
@
About, arlAs, equipmEnt
p
Pat, camPer
t
Tap, baT
k
Camera, jaCK, Kill
b
Book, aBrupt
d
Done, baD
g
Good, biGGer
s
Sit, maSS
z
Zero, quiZ, boyS
sh
SHip, claSH
zh
viSion, caSual
f
Fat, lauGH
v
Various, haVe
th
THeatre, baTH
dh
THat, faTHer
ch
CHart, larCH
jh
diGit, Jack
h
Hello, loopHole
m
Man, gaMe
n
maN, New
ng
baNG, sittiNG
l
Late, bLack
y
Yellow, Yacht
r
Reason, caReer,
w
Water, cobWeb
#
short silence

In addition to the phone sthemselves the nonsense word generated by the diphone schema also have some other notations to denote different type of phone.

The use of - (hyphen) in the nonsense word itself is used to denot an explicit syllable boundary. Thus pau t aa n - k aa pau is used to state that the word should be pronounced as tan ka rather than tank ah. Where no explicit syllable boundary is given the pronunciation should be pronounce naturally without any boundary (which is probabaly too underspecified in some cases).

The use of _ (underscore) in phone names is used to denote consonant clusters. That is t_-_r is the /tr/ as found in trip not that in cat run.


Go to the first, previous, next, last section, table of contents.