Phoneme

In a language or dialect, a phoneme (from the Greek: φώνημα, phōnēma, "a sound uttered") is the smallest segmental unit of sound employed to form meaningful contrasts between utterances.[1]

Thus a phoneme is a sound or a group of different sounds perceived to have the same function by speakers of the language or dialect in question. An example of a phoneme is the /k/ sound in the words kit and skill. (In transcription, phonemes are placed between slashes, as here.) Although most native English speakers don't notice this, in most English dialects, the /k/ sounds in these two words are actually pronounced differently: they are different speech sounds, or phones (which, in transcription, are placed in square brackets). In our example, the /k/ in kit is aspirated, [kʰ], while the /k/ in skill is unaspirated. These different sounds are nonetheless considered to belong to the same phoneme in English because, if an English speaker used one instead of the other, the meaning of the word would not change: using [kʰ] in skill might sound odd, but the word would still be recognized. By contrast, some other phonemes could be substituted (creating a minimal pair) which would cause a change in meaning: producing words like still (substituting /t/), spill (substituting /p/) and swill (substituting /w/). These other sounds (/t/, /p/ and /w/) are, in English, different phonemes.

In some languages, however, [kʰ] and [k] are different phonemes, and are perceived as such by the speakers of those languages. For example, in Icelandic, /kʰ/ is the first sound of kátur meaning 'cheerful', while /k/ is the first sound of gátur meaning 'riddles'. The fact that these two different words have different meanings which can be readily identified by speakers of Icelandic tells us that Icelandic speakers perceive the sounds as different phonemes.

Phones that belong to the same phoneme, such as [t] and [tʰ] for English /t/, are called allophones. A common test to determine whether two phones are allophones of the same phoneme or separate phonemes relies on finding minimal pairs: words that differ by only the phone in question. For example, the words tip and dip illustrate that in English [t] and [d] are separate phonemes, /t/ and /d/, in English: the two words have different meanings that are readily recognizable, meaning that English speakers can readily distinguish between the two sounds. In other languages, though, including Korean; there are no such pairs available. The lack of minimal pairs distinguishing [t] and [d] in Korean indicates that in this language they are allophones of a single phoneme /t/. (/tʰata/ is pronounced [tʰada], for example. That is, when they hear this one word, Korean speakers perceive the same sound in both the beginning and middle of the word, whereas an English speaker would perceive different sounds in these two locations.)

Some linguists (such as Roman Jakobson, Morris Halle, and Noam Chomsky) consider phonemes to be further decomposable into features, such features being the true minimal constituents of language. Features overlap each other in time, as do suprasegmental phonemes in oral language and many phonemes in sign languages. Features could be designated as acoustic (Jakobson) or articulatory (Halle & Chomsky) in nature.

Contents

Background and related ideas

The term phonème was reportedly first used by A. Dufriche-Desgenettes in 1873, but it referred only to a speech sound. The term phoneme as an abstraction was developed by the Polish linguist Jan Niecisław Baudouin de Courtenay and his student Mikołaj Kruszewski during 1875–1895. The term used by these two was fonema, the basic unit of what they called psychophonetics. The concept of the phoneme was then elaborated in the works of Nikolai Trubetzkoi and others of the Prague School (during the years 1926–1935), and in those of structuralists like Ferdinand de Saussure, Edward Sapir, and Leonard Bloomfield. Some structuralists wished to eliminate a cognitive or psycholinguistic function for the phoneme.

Later, it was also used in generative linguistics, most famously by Noam Chomsky and Morris Halle, and remains central to many accounts of the development of modern phonology. As a theoretical concept or model, though, it has been supplemented and even replaced by others.

In some languages, the term chroneme may be used for contrastive length or duration of phonemes. In languages in which tones are phonemic, the tone phonemes may be called tonemes. Not all scholars working on such languages use these terms.

The distinction between phonetic and phonemic systems gave rise of Kenneth Pike's concepts of Emic and etic description.

Notation

A transcription that only indicates the different phonemes of a language is said to be phonemic. In languages that are morphophonemic (vowels in particular) , pronunciations that correspond to the canonical alphabet pronunciations are called alphaphonemic.

Such transcriptions are enclosed within virgules (slashes), / /; these show that each enclosed symbol is claimed to be phonemically meaningful. On the other hand, a transcription that indicates finer detail, including allophonic variation like the two English L's, is said to be phonetic, and is enclosed in square brackets, [ ].

The common notation used in linguistics employs virgules (slashes) (/ /) around the symbol that stands for the phoneme. For example, the phoneme for the initial consonant in the word "phoneme" would be written as /f/. In other words, the graphemes are <ph>, but this digraph represents one sound /f/. Allophones, more phonetically specific descriptions of how a given phoneme might be commonly instantiated, are often denoted in linguistics by the use of diacritical or other marks added to the phoneme symbols and then placed in square brackets ([ ]) to differentiate them from the phoneme in slant brackets (/ /). The conventions of orthography are then kept separate from both phonemes and allophones by the use of angle brackets < > to enclose the spelling.

The symbols of the International Phonetic Alphabet (IPA) and extended sets adapted to a particular language are often used by linguists to write phonemes of oral languages, with the principle being one symbol equals one categorical sound. Due to problems displaying some symbols in the early days of the Internet, systems such as X-SAMPA and Kirshenbaum were developed to represent IPA symbols in plain text. As of 2004, any modern web browser can display IPA symbols (as long as the operating system provides the appropriate fonts), and we use this system in this article.

Usually, long vowels and consonants are represented either by a length indicator or doubling of the symbol in question.

Examples

Examples of phonemes in the English language include consonant plosives like /p/ and /b/. These two are most often written consistently with one letter for each sound. These phonemes, however, might not be so apparent in written English, for example when they are typically represented by a group of more than one letter, called a digraph, like <sh> (pronounced /ʃ/) or <ch> (pronounced /tʃ/).

For a list of the phonemes in the English language, see IPA for English.

Two sounds which are allophones (sound variants belonging to the same phoneme) in one language may belong to separate phonemes in another language or dialect. In English, for example, /p/ has aspirated and non-aspirated allophones: aspirated as in /pɪn/, and non-aspirated as in /spɪn/. However, in many languages (e. g. Chinese), aspirated /pʰ/ is a phoneme distinct from unaspirated /p/. As another example, there is no distinction between [r] and [l] in Japanese: there is only one /r/ phoneme, though it has various allophones that can sound more like [l], [ɾ], or [r] to English speakers. The sounds [z] and [s] are distinct phonemes in English, but allophones in Spanish. The sounds [n] (as in run) and [ŋ] (as in rung) are also sometimes considered phonemes in English, but allophones in Italian and Spanish.

An important phoneme is the chroneme, a phonemically-relevant extension of the duration of a consonant or vowel. Some languages or dialects such as Finnish or Japanese allow chronemes after both consonants and vowels. Others, like Australian English use it after only one (in this case, vowels).

Restricted phonemes

A restricted phoneme is a phoneme that can only occur in a certain environment: There are restrictions as to where it can occur. English has several restricted phonemes:

Biuniqueness

Biuniqueness is a property of the phoneme in classic structuralist phonemics. The biuniqueness definition states that every phonetic allophone must unambiguously be assigned to one and only one phoneme. In other words, there is a many-to-one allophone-to-phoneme mapping instead of a many-to-many mapping.

The notion of biuniqueness was controversial among some pre-generative linguists and was prominently challenged by Morris Halle and Noam Chomsky in the late 1950s and early 1960s.

The unworkable aspects of the concept soon become apparent if you consider the phenomenon of flapping in North American English. In the right environment, this flapping can change either /t/ or /d/ into the allophone [ɾ] for many affected speakers. Here, one allophone is clearly assigned to two phonemes.

Neutralization, archiphoneme, and underspecification

Phonemes that are contrastive in certain environments may not be contrastive in all environments. In the environments where they don't contrast, the contrast is said to be neutralized.

In English there are three nasal phonemes, /m, n, ŋ/, as shown by the minimal triplet,

/sʌm/ sum
/sʌn/ sun
/sʌŋ/ sung

With rare exceptions, these phonemes are not contrastive before plosives such as /p, t, k/ within the same morpheme. Although all three phones appear before plosives, for example in limp, lint, link ( /limp/, /lint/, /liŋk/), only one of these may appear before each of the plosives. That is, the /m, n, ŋ/ distinction is neutralized before each of the plosives /p, t, k/:

Thus these phonemes are not contrastive in these environments, and according to some theorists, there is no evidence as to what the underlying representation might be. If one hypothesizes that one is dealing with only a single underlying nasal, there is no reason to pick one of the three phonemes /m, n, ŋ/ over the other two.

(In some languages there is only one phonemic nasal anywhere, and due to obligatory assimilation, it surfaces as [m, n, ŋ] in just these environments, so this idea is not as far-fetched as it might seem at first glance.)

In certain schools of phonology, such a neutralized distinction is known as an archiphoneme (Nikolai Trubetzkoy of the Prague school is often associated with this analysis). Archiphonemes are often notated with a capital letter. Following this convention, the neutralization of /m, n, ŋ/ before /p, t, k/ could be notated as |N|, and limp, lint, link would be represented as |lɪNp, lɪNt, lɪNk|. (The |pipes| indicate underlying representation.) Other ways this archiphoneme could be notated are |m-n-ŋ|, {m, n, ŋ}, or |n*|.

Another example from American English is the neutralization of the plosives /t, d/ following a stressed syllable. Phonetically, both are realized in this position as [ɾ], a voiced alveolar flap. This can be heard by comparing betting with bedding.

[bɛt] bet
[bɛd] bed

with the suffix -ing:

[ˈbɛɾɪŋ] betting
[ˈbɛɾɪŋ] bedding

Thus, one cannot say whether the underlying representation of the intervocalic consonant in either word is /t/ or /d/ without looking at the unsuffixed form. This neutralization can be represented as an archiphoneme |D|, in which case the underlying representation of betting or bedding could be |ˈbɛDɪŋ|.

Another way to talk about archiphonemes involves the concept of underspecification: phonemes can be considered fully specified segments while archiphonemes are underspecified segments. In Tuvan, phonemic vowels are specified with the articulatory features of tongue height, backness, and lip rounding. The archiphoneme |U| is an underspecified high vowel where only the tongue height is specified.

phoneme/
archiphoneme
height backness roundedness
/i/ high front unrounded
/ɯ/ high back unrounded
/u/ high back rounded
|U| high

Whether |U| is pronounced as front or back and whether rounded or unrounded depends on vowel harmony. If |U| occurs following a front unrounded vowel, it will be pronounced as the phoneme /i/; if following a back unrounded vowel, it will be as an /ɯ/; and if following a back rounded vowel, it will be an /u/. This can be seen in the following words:

-|Um| 'my' (the vowel of this suffix is underspecified)
|idikUm| [idikim] 'my boot' (/i/ is front and unrounded)
|xarUm| [xarɯm] 'my snow' (/a/ is back and unrounded)
|nomUm| [nomum] 'my book' (/o/ is back and rounded)

Minimal contrastive units in sign languages

In sign languages, the basic elements of gesture and location were formerly called cheremes (or cheiremes), but general usage changed to phoneme. Tonic phonemes are sometimes called tonemes, and timing phonemes chronemes.

In sign languages, phonemes may be classified as Tab (elements of location, from Latin tabula), Dez (the hand shape, from designator), Sig (the motion, from signation), and with some researchers, Ori (orientation). Facial expressions and mouthing are also phonemic.

There is one published set of phonemic symbols for sign language, the Stokoe notation, used for linguistic research and originally developed for American Sign Language. Stokoe notation has since been applied to British Sign Language by Kyle and Woll, and to Australian Aboriginal sign languages by Adam Kendon. Other sign notations, such as the Hamburg Notation System and SignWriting, are phonetic scripts capable of writing any sign language. However, because they are not constrained by phonology, they do not yield a specific spelling for a sign. The SignWriting form, for example, will be different depending on whether the signer is left or right-handed, despite the fact this makes no difference to the meaning of the sign.

Phonological extremes

Of all the sounds human vocal folds can produce, different languages vary considerably in the number of these sounds that are considered to be distinctive phonemes in the speech of that language. Ubyx and Arrernte have only two phonemic vowels, while at the other extreme, the Bantu language Ngwe has 14 vowel qualities, 12 of which may occur long or short, making 26 oral vowels, plus 6 nasalized vowels, long and short, making a total of 38 vowels; while !Xóõ achieves 31 pure vowels, not counting its additional variation by vowel length, by varying the phonation. Rotokas has only six consonants, while !Xóõ has somewhere in the neighborhood of 77, and Ubyx 81. French has no phonemic tone or stress, while several of the Kam–Sui languages have nine tones, and one of the Kru languages, Wobe, has been claimed to have 14, though this is disputed. The total phonemic inventory in languages varies from as few as eleven in Rotokas to as many as 112 in !Xóõ (including four tones). The English language uses a rather large set of 13 to 21 vowels, including diphthongs, though its 22 to 26 consonants are close to average. (There are 21 consonant and five vowel letters in the English alphabet, but this does not correspond to the number of consonant and vowel sounds.)

The most common vowel system consists of the five vowels /i/, /e/, /a/, /o/, /u/. The most common consonants are /p/, /t/, /k/, /m/, /n/. Very few languages lack any of these: Arabic lacks /p/, standard Hawaiian lacks /t/, Mohawk and Tlingit lack /p/ and /m/, Hupa lacks both /p/ and a simple /k/, colloquial Samoan lacks /t/ and /n/, while Rotokas and Quileute lack /m/ and /n/.

See also

References

  1. ^ International Phonetic Association (1999), "Phonetic description and the IPA chart", Handbook of the International Phonetic Association: a guide to the use of the international phonetic alphabet, Cambridge University Press, ISBN 9780521637510, http://www.cambridge.org/uk/catalogue/catalogue.asp?isbn=0521637511