A diphthong (// DIF-thong or // DIP-thong; from Greek: δίφθογγος, diphthongos, literally "double sound" or "double tone"), also known as a gliding vowel, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech apparatus) moves during the pronunciation of the vowel. In most varieties of English, the phrase no highway cowboys / / has five distinct diphthongs, one in every syllable.
Diphthongs contrast with monophthongs, where the tongue or other speech organs do not move and the syllable contains only a single vowel sound. For instance, in English, the word ah is spoken as a monophthong (//), while the word ow is spoken as a diphthong in most varieties (//). Where two adjacent vowel sounds occur in different syllables—for example, in the English word re-elect—the result is described as hiatus, not as a diphthong. (The English word hiatus // is itself an example of both hiatus and diphthongs.)
Diphthongs often form when separate vowels are run together in rapid speech during a conversation. However, there are also unitary diphthongs, as in the English examples above, which are heard by listeners as single-vowel sounds (phonemes).
In the International Phonetic Alphabet (IPA), monophthongs are transcribed with one symbol, as in English sun [sʌn], in which ⟨ʌ⟩ represents a monophthong. Diphthongs are transcribed with two symbols, as in English high [haɪ] or cow [kaʊ], in which ⟨aɪ⟩ and ⟨aʊ⟩ represent diphthongs.
Diphthongs may be transcribed with two vowel symbols or with a vowel symbol and a semivowel symbol. In the words above, the less prominent member of the diphthong can be represented with the symbols for the palatal approximant [j] and the labiovelar approximant [w], with the symbols for the close vowels [i] and [u], or the symbols for the near-close vowels [ɪ] and [ʊ]:
|vowel and semivowel||⟨haj kaw⟩||broad transcription|
|two vowel symbols||⟨hai̯ kau̯⟩|
|⟨haɪ̯ kaʊ̯⟩||narrow transcription|
Some transcriptions are broader or narrower (less precise or more precise phonetically) than others. Transcribing the English diphthongs in high and cow as ⟨aj aw⟩ or ⟨ai̯ au̯⟩ is a less precise or broader transcription, since these diphthongs usually end in a vowel sound that is more open than the semivowels [j w] or the close vowels [i u]. Transcribing the diphthongs as ⟨aɪ̯ aʊ̯⟩ is a more precise or narrower transcription, since the English diphthongs usually end in the near-close vowels [ɪ ʊ].
The non-syllabic diacritic, the inverted breve below ⟨◌̯⟩, is placed under the less prominent part of a diphthong to show that it is part of a diphthong rather than a vowel in a separate syllable: [aɪ̯ aʊ̯]. When there is no contrastive vowel sequence in the language, the diacritic may be omitted. Other common indications that the two sounds are not separate vowels are a superscript, ⟨aᶦ aᶷ⟩, or a tie bar, ⟨a͡ɪ a͡ʊ⟩ or ⟨a͜ɪ a͜ʊ⟩. The tie bar can be useful when it is not clear which symbol represents the syllable nucleus, or when they have equal weight. Superscripts are especially used when an on- or off-glide is particularly fleeting.
The period ⟨.⟩ is the opposite of the non-syllabic diacritic: it represents a syllable break. If two vowels next to each other belong to two different syllables (hiatus), meaning that they do not form a diphthong, they can be transcribed with two vowel symbols with a period in between. Thus, lower can be transcribed ⟨ˈloʊ.ər⟩, with a period separating the first syllable, //, from the second syllable, //.
The non-syllabic diacritic is used only when necessary. It is typically omitted when there is no ambiguity, as in ⟨haɪ kaʊ⟩. No words in English have the vowel sequences *[a.ɪ a.ʊ], so the non-syllabic diacritic is unnecessary.
Falling and rising
Falling (or descending) diphthongs start with a vowel quality of higher prominence (higher pitch or volume) and end in a semivowel with less prominence, like [aɪ̯] in eye, while rising (or ascending) diphthongs begin with a less prominent semivowel and end with a more prominent full vowel, similar to the [ja] in yard. (Note that "falling" and "rising" in this context do not refer to vowel height; for that, the terms "opening" and "closing" are used instead. See below.) The less prominent component in the diphthong may also be transcribed as an approximant, thus [aj] in eye and [ja] in yard. However, when the diphthong is analysed as a single phoneme, both elements are often transcribed with vowel symbols (/aɪ̯/, /ɪ̯a/). Note also that semivowels and approximants are not equivalent in all treatments, and in the English and Italian languages, among others, many phoneticians do not consider rising combinations to be diphthongs, but rather sequences of approximant and vowel. There are many languages (such as Romanian) that contrast one or more rising diphthongs with similar sequences of a glide and a vowel in their phonetic inventory (see semivowel for examples).
Closing, opening, and centering
In closing diphthongs, the second element is more close than the first (e.g. [ai]); in opening diphthongs, the second element is more open (e.g. [ia]). Closing diphthongs tend to be falling ([ai̯]), and opening diphthongs are generally rising ([i̯a]), as open vowels are more sonorous and therefore tend to be more prominent. However, exceptions to this rule are not rare in the world's languages. In Finnish, for instance, the opening diphthongs /ie̯/ and /uo̯/ are true falling diphthongs, since they begin louder and with higher pitch and fall in prominence during the diphthong.
A third, rare type of diphthong that is neither opening nor closing is height-harmonic diphthongs, with both elements at the same vowel height. These occurred in Old English:
- beon [beo̯n] "be"
- ceald [kæɑ̯ld] "cold"
A centering diphthong is one that begins with a more peripheral vowel and ends with a more central one, such as [ɪə̯], [ɛə̯], and [ʊə̯] in Received Pronunciation or [iə̯] and [uə̯] in Irish. Many centering diphthongs are also opening diphthongs ([iə̯], [uə̯]).
Diphthongs may contrast in how far they open or close. For example, Samoan contrasts low-to-mid with low-to-high diphthongs:
- ’ai [ʔai̯] 'probably'
- ’ae [ʔae̯] 'but'
- ’auro [ʔau̯ɾo] 'gold'
- ao [ao̯] 'a cloud'
Narrow and wide
Narrow diphthongs are the ones that end with a vowel which on a vowel chart is quite close to the one that begins the diphthong, for example Northern Dutch [eɪ], [øʏ] and [oʊ]. Wide diphthongs are the opposite - they require a greater tongue movement, and their offsets are farther away from their starting points on the vowel chart. Examples of wide diphthongs are RP/GA English [aɪ] and [aʊ].
Languages differ in the length of diphthongs, measured in terms of morae. In languages with phonemically short and long vowels, diphthongs typically behave like long vowels, and are pronounced with a similar length. In languages with only one phonemic length for pure vowels, however, diphthongs may behave like pure vowels. For example, in Icelandic, both monophthongs and diphthongs are pronounced long before single consonants and short before most consonant clusters.
Some languages contrast short and long diphthongs. In some languages, such as Old English, these behave like short and long vowels, occupying one and two morae, respectively. Languages that contrast three quantities in diphthongs are extremely rare, but not unheard of; Northern Sami is known to contrast long, short and "finally stressed" diphthongs, the last of which are distinguished by a long second element.
In some languages, diphthongs are single phonemes, while in others they are analyzed as sequences of two vowels, or of a vowel and a semivowel.
Certain sound changes relate to diphthongs and monophthongs. Vowel breaking or diphthongization is a vowel shift in which a monophthong becomes a diphthong. Monophthongization or smoothing is a vowel shift in which a diphthong becomes a monophthong.
Difference from a vowel and semivowel
While there are a number of similarities, diphthongs are not the same phonologically as a combination of a vowel and an approximant or glide. Most importantly, diphthongs are fully contained in the syllable nucleus while a semivowel or glide is restricted to the syllable boundaries (either the onset or the coda). This often manifests itself phonetically by a greater degree of constriction, but the phonetic distinction is not always clear. The English word yes, for example, consists of a palatal glide followed by a monophthong rather than a rising diphthong. In addition, the segmental elements must be different in diphthongs [ii̯] and so when it occurs in a language, it does not contrast with [iː]. However, it is possible for languages to contrast [ij] and [iː].
In words coming from Middle English, most cases of the Modern English diphthongs [aɪ̯, oʊ̯, eɪ̯, aʊ̯] originate from the Middle English long monophthongs [iː, ɔː, aː, uː] through the Great Vowel Shift, although some cases of [oʊ̯, eɪ̯] originate from the Middle English diphthongs [ɔu̯, aɪ̯]. Due to complex regional variation Hiberno-English diphthongs are not enumerated below.
|RP (British)||Australian||North American|
- In Scottish, Upper Midwestern, and California English, /oʊ̯/ is monophthongal [oː].
- In Pittsburgh English, /aʊ̯/ is monophthongal [aː], leading to the stereotypical spelling "Dahntahn" for "downtown".
- Canadian English and some dialects of northern American English exhibit allophony of /aʊ̯/ and /aɪ̯/ called Canadian raising – in some places they have become separate phonemes. GA and RP have raising to a lesser extent in /aɪ̯/.
- In several American dialects such as Southern American English, /aɪ̯/ becomes monophthongal [aː] except before voiceless consonants.
- The erstwhile monophthongs /iː/ and /uː/ are diphthongized in many dialects. In many cases they might be better transcribed as [uu̯] and [ii̯], where the non-syllabic element is understood to be closer than the syllabic element. They are sometimes transcribed /uw/ and /ij/.
- Most Australian English speakers monophthongize "-ee-" vowels. However, Western Australian English is an exception, as it generally features centring diphthongs in words like fear and beard. See: Macquarie University, 2010, Regional Accents (30 January 2015).
- In rhotic dialects, words like pair, poor, and peer can be analyzed as diphthongs, although other descriptions analyze them as vowels with [ɹ] in the coda.
- In Received Pronunciation, the vowels in lair and lure may be monophthongized to [ɛː] and [oː] respectively (Roach (2004:240)).
- [eɪ̯], [øʏ̯], and [oʊ̯] are normally pronounced as closing diphthongs except when preceding [ɾ], in which case they are either centering diphthongs: [eə̯], [øə̯], and [oə̯] or are lengthened and monophthongized to [ɪː], [øː], and [ʊː]
The dialect of Hamont (in Limburg) has five centring diphthongs and contrasts long and short forms of [ɛɪ̯], [œʏ̯], [ɔʊ̯], and [ɑʊ̯].
Phonemic diphthongs in German:
- /aɪ̯/ as in Ei ‘egg’
- /aʊ̯/ as in Maus ‘mouse’
- /ɔʏ̯/ as in neu ‘new’
In the varieties of German that vocalize the /r/ in the syllable coda, other diphthongal combinations may occur. These are only phonetic diphthongs, not phonemic diphthongs, since the vocalic pronunciation [ɐ̯] alternates with consonantal pronunciations of /r/ if a vowel follows, cf. du hörst [duː ˈhøːɐ̯st] ‘you hear’ – ich höre [ʔɪç ˈhøːʀə] ‘I hear’. These phonetic diphthongs may be as follows:
|/oːr/||[oːɐ̯]1||[tʰoːɐ̯]||Tor||gate/goal (in football)|
- ^1 Wiese (1996) notes that the length contrast is not very stable before non-prevocalic /r/ and that "Meinhold & Stock (1980:180), following the pronouncing dictionaries (Mangold (1990), Krech & Stötzer (1982)) judge the vowel in Art, Schwert, Fahrt to be long, while the vowel in Ort, Furcht, hart is supposed to be short. The factual basis of this presumed distinction seems very questionable." He goes on stating that in his own dialect, there is no length difference in these words, and that judgements on vowel length in front of non-prevocalic /r/ which is itself vocalized are problematic, in particular if /a/ precedes.
- According to the 'lengthless' analysis, the aforementioned 'long' diphthongs are analyzed as [iɐ̯], [yɐ̯], [uɐ̯], [eɐ̯], [øɐ̯], [oɐ̯], [ɛɐ̯] and [aɐ̯]. This makes non-prevocalic /aːr/ and /ar/ homophonous as [aɐ̯] or [aː]. Non-prevocalic /ɛːr/ and /ɛr/ may also merge, but the vowel chart in Kohler (1999:88) shows that they have somewhat different starting points.
- Wiese (1996) also states that "laxing of the vowel is predicted to take place in shortened vowels; it does indeed seem to go hand in hand with the vowel shortening in many cases."
The diphthongs of some German dialects differ from standard German diphthongs. The Bernese German diphthongs, for instance, correspond rather to the Middle High German diphthongs than to standard German diphthongs:
- /iə̯/ as in lieb ‘dear’
- /uə̯/ as in guet ‘good’
- /yə̯/ as in müed ‘tired’
- /ei̯/ as in Bei ‘leg’
- /ou̯/ as in Boum ‘tree’
- /øi̯/ as in Böim ‘trees’
Apart from these phonemic diphthongs, Bernese German has numerous phonetic diphthongs due to L-vocalization in the syllable coda, for instance the following ones:
- [au̯] as in Stau ‘stable’
- [aːu̯] as in Staau ‘steel’
- [æu̯] as in Wäut ‘world’
- [æːu̯] as in wääut ‘elects’
- [ʊu̯] as in tschúud ‘guilty’
Yiddish has three diphthongs:
- [ɛɪ̯] as in [plɛɪ̯tə] פּליטה ('refugee' f.)
- [aɛ̯] as in [naɛ̯n] נײַן ('nine')
- [ɔə̯] as in [ɔə̯fn̩] אופֿן ('way')
Diphthongs may reach a higher target position (towards /i/) in situations of coarticulatory phenomena or when words with such vowels are being emphasized.
There are five diphthongs in the Oslo dialect of Norwegian, all of them falling:
- [æɪ] as in nei, "no"
- [œʷʏʷ] as in øy, "island"
- [æʉ͍] as in sau, "sheep"
- [ɑɪ] as in hai, "shark"
- [ɔʷʏʷ] as in joik, "Sami song"
An additional diphthong, [ʉ͍ɪ], occurs only in the word hui in the expression i hui og hast "in great haste". The number and form of diphthongs vary between dialects.
Diphthongs in Faroese are:
- /ai/ as in bein (can also be short)
- /au/ as in havn
- /ɛa/ as in har, mær
- /ɛi/ as in hey
- /ɛu/ as in nevnd
- /œu/ as in nøvn
- /ʉu/ as in hús
- /ʊi/ as in mín, bý, ið (can also be short)
- /ɔa/ as in ráð
- /ɔi/ as in hoyra (can also be short)
- /ɔu/ as in sól, ovn
Diphthongs in Icelandic are the following:
- /au̯/ as in átta, "eight"
- /ou̯/ as in nóg, "enough"
- /øy/ as in auga, "eye"
- /ai̯/ as in kær, "dear"
- /ei̯/ as in þeir, "they"
- /ɔi/ as in koja, "bunk bed", "berth" (rare, only in handful of words)
Combinations of semivowel /j/ and a vowel are the following:
- /jɛ/ as in éta, "eat"
- /ja/ as in jata, "manger"
- /jau̯/ as in já, "yes"
- /jo/ as in joð, "iodine", "jay", "yod" (only in a handful of words of foreign origin)
- /jou̯/ as in jól, "Christmas"
- /jœ/ as in jötunn, "giant"
- /jai̯/ as in jæja, "oh well"
- /ju/ as in jú, "yes"
In French, /wa/, /wɛ̃/, /ɥi/ and /ɥɛ̃/ may be considered true diphthongs (that is, fully contained in the syllable nucleus: [u̯a], [u̯ɛ̃], [y̯i], [y̯ɛ̃]). Other sequences are considered part of a glide formation process that turns a high vowel into a semivowel (and part of the syllable onset) when followed by another vowel.
- /wa/ [u̯a] as in roi "king"
- /wɛ̃/ [u̯ɛ̃] as in groin "muzzle"
- /ɥi/ [y̯i] as in huit "eight"
- /ɥɛ̃/ [y̯ɛ̃] as in juin "June"
- /wi/ as in oui "yes"
- /jɛ̃/ as in lien "bond"
- /jɛ/ as in Ariège
- /aj/ as in travail "work"
- /ɛj/ as in Marseille
- /ij/ as in bille "ball"
- /œj/ as in feuille "leaf"
- /uj/ as in grenouille "frog"
- /jø/ as in vieux "old"
- [ɑɔ̯] as in tard "late"
- [aɛ̯] as in père "father"
- [aœ̯] as in fleur "flower"
- [ou̯] as in autre "other"
- [øy̯] as in neutre "neutral"
- [ãʊ̯̃] as in banque "bank"
- [ẽɪ̯̃] as in mince "thin"
- [ɒ̃ʊ̯̃] as in bon "well"
- [œ̃ʏ̯̃] as in un "one"
Catalan possesses a number of phonetic diphthongs, all of which begin (rising diphthongs) or end (falling diphthongs) in [j] or [w].
|[əj]||mainada||'children'||[əw]||caurem||'we will fall'|
|[uj]||avui||'today'||[uw]||duu||'he/she is carrying'|
|[jə]||feia||'he/she was doing'||[wə]||qüestió||'question'|
- [j] in word initial position, e.g. iogurt.
- Both occur between vowels as in feia and veiem.
- In the sequences [ɡw] or [kw] and vowel, e.g. guant, quota, qüestió, pingüí (these exceptional cases even lead some scholars to hypothesize the existence of rare labiovelar phonemes /ɡʷ/ and /kʷ/).
There are also certain instances of compensatory diphthongization in the Majorcan dialect so that /ˈtroncs/ ('logs') (in addition to deleting the palatal plosive) develops a compensating palatal glide and surfaces as [ˈtrojns] (and contrasts with the unpluralized [ˈtronʲc]). Diphthongization compensates for the loss of the palatal stop (part of Catalan's segment loss compensation). There are other cases where diphthongization compensates for the loss of point of articulation features (property loss compensation) as in [ˈaɲ] ('year') vs [ˈajns] ('years'). The dialectal distribution of this compensatory diphthongization is almost entirely dependent on the dorsal plosive (whether it is velar or palatal) and the extent of consonant assimilation (whether or not it is extended to palatals).
The Portuguese diphthongs are formed by the labio-velar approximant [w] and palatal approximant [j] with a vowel, European Portuguese has 14 phonemic diphthongs (10 oral and 4 nasal), all of which are falling diphthongs formed by a vowel and a nonsyllabic high vowel. Brazilian Portuguese has roughly the same amount, although the European and non-European dialects have slightly different pronunciations ([ɐj] is a distinctive feature of some southern and central Portuguese dialects, especially that of Lisbon). A [w] onglide after /k/ or /ɡ/ and before all vowels as in quando [ˈkwɐ̃du] ('when') or guarda [ˈɡwaɾðɐ ~ ˈɡwaʁdɐ] ('guard') may also form rising diphthongs and triphthongs. Additionally, in casual speech, adjacent heterosyllabic vowels may combine into diphthongs and triphthongs or even sequences of them.
In addition, phonetic diphthongs are formed in most Brazilian Portuguese dialects by the vocalization of /l/ in the syllable coda with words like sol [sɔw] ('sun') and sul [suw] ('south') as well as by yodization of vowels preceding /s/ or its allophone at syllable coda [ʃ ~ ɕ] in terms like arroz [aˈʁojs ~ ɐˈʁo(j)ɕ] ('rice'), and /z/ (or [ʒ ~ ʑ]) in terms such as paz mundial [ˈpajz mũdʒiˈaw ~ ˈpa(j)ʑ mũdʑiˈaw] ('world peace') and dez anos [ˌdɛjˈz‿ɐ̃nu(j)s ~ ˌdɛjˈz‿ɐ̃nuɕ] ('ten years').
Phonetically, Spanish has seven falling diphthongs and eight rising diphthongs. In addition, during fast speech, sequences of vowels in hiatus become diphthongs wherein one becomes non-syllabic (unless they are the same vowel, in which case they fuse together) as in poeta [ˈpo̯eta] ('poet') and maestro [ˈmae̯stɾo] ('teacher'). The Spanish diphthongs are:
|[ei̯]||potei||'I could' (past tense)||[eu̯]||pleurite||'pleurisy'|
The second table includes only 'false' diphthongs, composed of a semivowel + a vowel, not two vowels. The situation is more nuanced in the first table: a word such as 'baita' is actually pronounced ['baj.ta] and most speakers would syllabify it that way. A word such as 'voi' would instead be pronounced and syllabified as ['vo.i], yet again without a diphthong.
In general, unstressed /i e o u/ in hiatus can turn into glides in more rapid speech (e.g. biennale [bi̯enˈnaːle] 'biennial'; coalizione [ko̯alitˈtsi̯oːne] 'coalition') with the process occurring more readily in syllables further from stress.
Romanian has two true diphthongs: /e̯a/ and /o̯a/. There are, however, a host of other vowel combinations (more than any other major Romance language) which are classified as vowel glides. As a result of their origin (diphthongization of mid vowels under stress), the two true diphthongs appear only in stressed syllables and make morphological alternations with the mid vowels /e/ and /o/. To native speakers, they sound very similar to /ja/ and /wa/ respectively. There are no perfect minimal pairs to contrast /o̯a/ and /wa/, and because /o̯a/ doesn't appear in the final syllable of a prosodic word, there are no monosyllabic words with /o̯a/; exceptions might include voal ('veil') and trotuar ('sidewalk'), though Ioana Chițoran argues that these are best treated as containing glide-vowel sequences rather than diphthongs. In addition to these, the semivowels /j/ and /w/ can be combined (either before, after, or both) with most vowels, while this arguably forms additional diphthongs and triphthongs, only /e̯a/ and /o̯a/ can follow an obstruent-liquid cluster such as in broască ('frog') and dreagă ('to mend'), implying that /j/ and /w/ are restricted to the syllable boundary and therefore, strictly speaking, do not form diphthongs.
All Irish diphthongs are falling.
- [əi̯], spelled aigh, aidh, agh, adh, eagh, eadh, eigh, or eidh
- [əu̯], spelled abh, amh, eabh, or eamh
- [iə̯], spelled ia, iai
- [uə̯], spelled ua, uai
There are 9 diphthongs in Scottish Gaelic. Group 1 occur anywhere (eu is usually [eː] before -m, e.g. Seumas). Group 2 are reflexes that occur before -ll, -m, -nn, -bh, -dh, -gh and -mh.
|2||[ai]||ai||saill "grease", cainnt "speech", aimhreit "riot"|
|[ɤi]||oi, ei, ai||loinn "badge", greim "bite", saighdear "soldier"|
|[ɯi]||ui, aoi||druim "back", aoibhneas "joy"|
|[au]||a, ea||cam "crooked", ceann "head"|
|[ɔu]||o||tom "mound", donn "brown"|
For more detailed explanations of Gaelic diphthongs see Scottish Gaelic orthography.
Welsh is traditionally divided into Northern and Southern dialects. In the north, some diphthongs may be short or long according to regular vowel length rules but in the south they are always short (see Welsh phonology). Southern dialects tend to simplify diphthongs in speech (e.g. gwaith /ɡwaiθ/ is reduced to /ɡwaːθ/).
|aw||/au, ɑːu/||/au/||mawr 'big'|
|ei||/əi/||/əi/||gweithio 'to work'|
|ew||/ɛu, eːu/||/ɛu/||tew 'fat'|
|oe||/ɔɨ, ɔːɨ/||/ɔi/||moel 'bald'|
|ow||/ɔu, oːu/||/ɔu/||brown 'brown'|
|wy||/ʊɨ, uːɨ/||/ʊi/||pwyll 'sense'|
- † The plural ending -au is reduced to /a/ in the north and /e/ in the south, e.g. cadau 'battles' is /ˈkada/ (north) or /ˈkade/ (south).
There are three diphthongs in Czech:
- /aʊ̯/ as in auto (almost exclusively in words of foreign origin)
- /eʊ̯/ as in euro (in words of foreign origin only)
- /oʊ̯/ as in koule
The vowel groups ia, ie, ii, io, and iu in foreign words are not regarded as diphthongs, they are pronounced with /j/ between the vowels [ɪja, ɪjɛ, ɪjɪ, ɪjo, ɪju].
is conventionally considered a diphthong. However, it is actually [ie] in hiatus or separated by a semivowel, [ije].
Some Serbo-Croatian dialects also have uo, as in kuonj, ruod, uon whereas, in Standard Croatian and Serbian, these words are konj, rod, on.
All nine vowels can appear as the first component of an Estonian diphthong, but only [ɑ e i o u] occur as the second component.
"in spite of"
"face" (s. possessive)
There are additional diphthongs less commonly used, such as [eu] in Euroopa (Europe), [øɑ] in söandama (to dare), and [æu] in näuguma (to mew).
All Finnish diphthongs are falling. Notably, Finnish has true opening diphthongs (e.g. /uo/), which are not very common crosslinguistically compared to centering diphthongs (e.g. /uə/ in English). Vowel combinations across syllables may in practice be pronounced as diphthongs, when an intervening consonant has elided, as in näön [næøn] instead of [næ.øn] for the genitive of näkö ('sight').
- [ɑi̯] as in laiva (ship)
- [ei̯] as in keinu (swing)
- [oi̯] as in poika (boy)
- [æi̯] as in äiti (mother)
- [øi̯] as in öisin (at nights)
- [ɑu̯] as in lauha (mild)
- [eu̯] as in leuto (mild)
- [ou̯] as in koulu (school)
- [ey̯] as in leyhyä (to waft)
- [æy̯] as in täysi (full)
- [øy̯] as in löytää (to find)
- [ui̯] as in uida (to swim)
- [yi̯] as in lyijy (lead)
- [iu̯] as in viulu (violin)
- [iy̯] as in siistiytyä (to smarten up)
- [ie̯] as in kieli (tongue)
- [uo̯] as in suo (bog)
- [yø̯] as in yö (night)
The diphthong system in Northern Sami varies considerably from one dialect to another. The Western Finnmark dialects distinguish four different qualities of opening diphthongs:
- /eæ/ as in leat "to be"
- /ie/ as in giella "language"
- /oa/ as in boahtit "to come"
- /uo/ as in vuodjat "to swim"
In terms of quantity, Northern Sami shows a three-way contrast between long, short and finally stressed diphthongs. The last are distinguished from long and short diphthongs by a markedly long and stressed second component. Diphthong quantity is not indicated in spelling.
Maltese has seven falling diphthongs, though they may be considered VC sequences phonemically.
- [ɛɪ̯] ej or għi
- [ɐɪ̯] aj or għi
- [ɔɪ̯] oj
- [ɪʊ̯] iw
- [ɛʊ̯] ew
- [ɐʊ̯] aw or għu
- [ɔʊ̯] ow or għu
Rising sequences in Mandarin are usually regarded as a combination of a medial semivowel ([j], [w], or [ɥ]) plus a vowel, while falling sequences are regarded as one diphthong.
- ai: [ai̯], as in ài (愛, love)
- ei: [ei̯], as in lèi (累, tired)
- ao: [au̯], as in dào (道, way)
- ou: [ou̯], as in dòu (豆, bean)
Cantonese has eleven diphthongs.
- aai: [aːi̯], as in gaai1 (街, street)
- aau: [aːu̯], as in baau3 (爆, explode)
- ai: [ɐi̯], as in gai1 (雞, chicken)
- au: [ɐu̯], as in au1 (勾, hook)
- ei: [ei̯], as in gei1 (機, machine)
- eu: [ɛːu̯], as in deu6 (掉, throw)
- iu: [iːu̯], as in giu3 (叫, call)
- oi: [ɔːy̯], as in oi3 (愛, love)
- ou: [ou̯], as in gou1 (高, high)
- ui: [uːy̯], as in pui4 (陪, accompany)
- eui: [ɵy̯], as in zeoi3 (醉, drunk)
In addition to vowel nuclei following or preceding /j/ and /w/, Thai has three diphthongs which exist as long-short pairs:
- เอีย ia [iːa̯, ia̯]
- เอือ üa [ɯːa̯, ɯa̯]
- อัว ua [uːa̯, ua̯]
In addition to vowel nuclei following or preceding /j/ and /w/, Vietnamese has three diphthongs:
- [iə̯] ia~iê
- [ɨə̯] ưa~ươ
- [uə̯] ua~uô
Khmer language has rich vocalics with an extra distinction of long and short register to the vowels and diphthongs.
Zulu has only monophthongs. Y and w are semi-vowels:
- [ja] as in [ŋijaɠuˈɓɛːɠa] ngiyakubeka (I am placing it)
- [wa] as in [ŋiːwa] ngiwa (I fall/I am falling)
- /ai̯/: balairung ('hall') , kedai ('shop'), pandai ('clever')
- /au̯/: autodidak ('autodidact'), taufik (Indonesian first name),kerbau ('buffalo'), limau ('lemon')
- /oi̯/ (or /ʊi̯/ in Indonesian): boikot ('boycott') , amboi (an expression when amazed)
- /ei̯/: eigendom ('property') , survei ('survey')
- "diphthong". Dictionary.com Unabridged. Random House.
- "Definition of DIPHTHONG". www.merriam-webster.com.
- definition of 'Diphthong' on SIL International, accessed 17 January 2008
- FileFormat.Info, page on combining inverted breve below
- Used e.g. by Donaldson, Bruce C. (1993), "1. Pronunciation", A Grammar of Afrikaans, Mouton de Gruyter, pp. 8–9, ISBN 9783110134261 The author states that the Afrikaans diphthongs /eə øə oə/ can be transcribed /eᵊ øᵊ oᵊ/.
- Used e.g. by Mangold, Max (2005), Das Aussprachewörterbuch (6th ed.), Duden, pp. 36–37, ISBN 978-3411040667. The author transcribes the diphthongs ⟨ai au eu⟩ as [a͜i a͜u ɔ͜y]. However, on page 36, he admits that phonetically, [aɪ̯ aʊ̯ ɔʏ̯] are more precise symbols.
- Battisti (2000) Fonetica generale, p 224
- E.g. Allen & Hawkins (1978) Development of Phonological Rhythm contranst ⟨aɪ⟩ from ⟨a͜ɪ⟩ from ⟨aᶦ⟩
- Chițoran (2002a:203)
- Crystal, David (2008). Dictionary of Linguistics and Phonetics. Wiley. pp. diphthong.
- Richard M. Hogg, Norman Blake, R. W. Burchfield, The Cambridge History of the English Language, CUP 1992, p. 49.
- Mangrio, Riaz Ahmed (22 June 2016). The Morphology of Loanwords in Urdu: The Persian, Arabic and English Strands. Cambridge Scholars Publishing. ISBN 9781443896634.
- Kaye & Lowenstamm (1984:139)
- Schane (1995:588)
- Padgett (2007:1938)
- Schane (1995:606)
- Schane (1995:589, 606)
- Gussenhoven (1992:46)
- Verhoeven (2005:245)
- Verhoeven (2007:221)
- Wiese (1996:198)
- Also supported by Tröster-Mutz (2011:20).
- Kleine (2003:263)
- Chitoran (2001:11)
- Carbonell & Llisterri (1992:54)
- Institut d'Estudis Catalans Archived 30 September 2010 at the Wayback Machine Els diftongs, els triftongs i els hiats – Gramàtica de la Llengua Catalana (provisional draft)
- e.g. Lleó (1970), Wheeler (1979)
- Wheeler (2005:101)
- Mascaró (2002:580–581)
- Mascaró (2002:581)
- Faria (2003:7)
- Cruz-Ferreira (1995:92)
- Barbosa & Albano (2004:230)
- Martínez-Celdrán, Fernández-Planas & Carrera-Sabaté (2003:256)
- Azevedo, Milton M. (2004). Introducción a la lingüística española (in Spanish) (2nd ed.). Upper Saddle River, NJ: Prentice Hall. ISBN 0-13-110959-6.
- Bertinetto & Loporcaro (2005:138)
- Bertinetto & Loporcaro (2005:139)
- Chițoran (2002a:204)
- Chițoran (2002a:206)
- Chițoran (2002b:217)
- See Chițoran (2001:8–9) for a brief overview of the views regarding Romanian semivowels
- Chițoran (2002b:213)
- (in Croatian) Vjesnik Archived 21 November 2000 at Archive.today Babić ne zagovara korijenski pravopis, nego traži da Hrvati ne piju mlijeko nego – mlieko
- Josip Lisac. "Štokavsko narječje: prostiranje i osnovne značajke". Kolo (in Croatian). Archived from the original on 17 February 2008.
- Borg & Azzopardi-Alexander (1997:299)
- Tingsabadh & Abramson (1993:25)
- Minister of Education and Culture Decree No: 50/2015, Jakarta, 2015.
- Birbosa, Plínio A.; Albano, Eleonora C. (2004), "Brazilian Portuguese", Journal of the International Phonetic Association, 34 (2): 227–232, doi:10.1017/S0025100304001756
- Bertinetto, Pier Marco; Loporcaro, Michele (2005), "The sound pattern of Standard Italian, as compared with the varieties spoken in Florence, Milan and Rome", Journal of the International Phonetic Association, 35 (2): 131–151, doi:10.1017/S0025100305002148
- Borg, Albert J.; Azzopardi-Alexander, Marie (1997), Maltese, Routledge, ISBN 0-415-02243-6
- Carbonell, Joan F.; Llisterri, Joaquim (1992), "Catalan", Journal of the International Phonetic Association, 22 (1–2): 53–56, doi:10.1017/S0025100300004618
- Chițoran, Ioana (2001), The Phonology of Romanian: A Constraint-based Approach, Berlin & New York: Mouton de Gruyter, ISBN 3-11-016766-2
- Chițoran, Ioana (2002a), "A perception-production study of Romanian diphthongs and glide-vowel sequences", Journal of the International Phonetic Association, 32 (2): 203–222, doi:10.1017/S0025100302001044
- Chițoran, Ioana (2002b), "The phonology and morphology of Romanian diphthongization" (PDF), Probus, 14 (2): 205–246, doi:10.1515/prbs.2002.009
- Cruz-Ferreira, Madalena (1995), "European Portuguese", Journal of the International Phonetic Association, 25 (2): 90–94, doi:10.1017/S0025100300005223
- Faria, Arlo (2003), Applied Phonetics: Portuguese Text-to-Speech, University of California, Berkeley, CiteSeerX 10.1.1.134.8785
- Gussenhoven, Carlos (1992), "Dutch", Journal of the International Phonetic Association, 22 (2): 45–47, doi:10.1017/S002510030000459X
- Kaye, Jonathan; Lowenstamm, Jean (1984), "De la syllabicité", in Dell, François; Vergnaud, Jean-Roger; Hirst, Daniel (eds.), La forme sonore du langage, Paris: Hermann, pp. 123–159, ISBN 9782705614119
- Kleine, Ane (2003), "Standard Yiddish", Journal of the International Phonetic Association, 33 (2): 261–265, doi:10.1017/S0025100303001385
- Kohler, Klaus J. (1999), "German", Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet, Cambridge: Cambridge University Press, pp. 86–89, doi:10.1017/S0025100300004874, ISBN 0-521-65236-7
- Krech, Eva Maria; Stötzer, Ursula (1982), Großes Wörterbuch der deutschen Aussprache, Leipzig: VEB Bibliographisches Institut, ISBN 978-3323001404
- Martínez-Celdrán, Eugenio; Fernández-Planas, Ana Ma.; Carrera-Sabaté, Josefina (2003), "Castilian Spanish", Journal of the International Phonetic Association, 33 (2): 255–259, doi:10.1017/S0025100303001373
- Mangold, Max (1990). Das Aussprachewörterbuch (in German) (3rd ed.). Dudenverlag. ISBN 3-411-20916-X.
- Mascaró, Joan (1976), Catalan Phonology and the Phonological Cycle (Doctoral thesis), Massachusetts Institute of Technology, retrieved 12 December 2013
- Meinhold, Gottfried; Stock, Eberhard (1980), Phonologie der deutschen Gegenwartssprache, Lepzig: VEB Bibliographisches Institut
- Peters, Jörg (2010), "The Flemish–Brabant dialect of Orsmaal–Gussenhoven", Journal of the International Phonetic Association, 40 (2): 239–246, doi:10.1017/S0025100310000083
- Roach, Peter (2004), "British English: Received Pronunciation", Journal of the International Phonetic Association, 34 (2): 239–245, doi:10.1017/S0025100304001768
- Padgett, Jaye (2007), "Glides, Vowels, and Features", Lingua, 118 (12): 1937–1955, doi:10.1016/j.lingua.2007.10.002
- Schane, Sanford (1995), "Diphthongization in Particle Phonology", in Goldsmith, John A. (ed.), The Handbook of Phonological Theory, Blackwell Handbooks in Linguistics, Blackwell, pp. 586–608
- Tingsabadh, M.R. Kalaya; Abramson, Arthur (1993), "Thai", Journal of the International Phonetic Association, 23 (1): 24–28, doi:10.1017/S0025100300004746
- Tröster-Mutz, Stefan (2011), Variation of vowel length in German (PDF), Groningen
- Verhoeven, Jo (2005), "Belgian Standard Dutch", Journal of the International Phonetic Association, 35 (2): 243–247, doi:10.1017/S0025100305002173
- Verhoeven, Jo (2007), "The Belgian Limburg dialect of Hamont", Journal of the International Phonetic Association, 37 (2): 219–225, doi:10.1017/S0025100307002940
- Verhoeven, Jo; Van Bael, C. (2002), "Akoestische kenmerken van de Nederlandse klinkers in drie Vlaamse regio's", Taal en Tongval, 54: 1–23
- Wiese, Richard (1996), The Phonology of German, Oxford: Oxford University Press, ISBN 0-19-824040-6