C

This article is about the letter of the alphabet. For other uses, see C (disambiguation).

For technical reasons, "C#" redirects here. For C#, see C-sharp.

Aa	Bb	Cc	Dd
Ee	Ff	Gg	Hh
Ii	Jj	Kk	Ll
Mm	Nn	Oo	Pp
Qq	Rr	Ss	Tt
Uu	Vv	Ww	Xx
Yy	Zz

C in copyright symbol

C is the third letter in the English alphabet, and a letter of the alphabets of many other writing systems, which inherited it from the Latin alphabet. It is also the third letter of the ISO basic Latin alphabet. It is named cee (pronounced /ˈsiː/) in English.^[1]

History

Phoenician gaml	Arabic ǧīm	Hebrew gimel	Greek Gamma	Etruscan C	Old Latin C (G)

'C' comes from the same letter as 'G'. The Semites named it gimel. The sign is possibly adapted from an Egyptian hieroglyph for a staff sling, which may have been the meaning of the name gimel. Another possibility is that it depicted a camel, the Semitic name for which was gamal. Barry B. Powell, a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)".^[2]

In the Etruscan language, plosive consonants had no contrastive voicing, so the Greek 'Γ' (Gamma) was adopted into the Etruscan alphabet to represent /k/. Already in the Western Greek alphabet, Gamma first took a '' form in Early Etruscan, then '' in Classical Etruscan. In Latin it eventually took the 'c' form in Classical Latin. In the earliest Latin inscriptions, the letters 'c k q' were used to represent the sounds /k/ and /ɡ/ (which were not differentiated in writing). Of these, 'q' was used to represent /k/ or /ɡ/ before a rounded vowel, 'k' before 'a', and 'c' elsewhere.^[3] During the 3rd century BC, a modified character was introduced for /ɡ/, and 'c' itself was retained for /k/. The use of 'c' (and its variant 'g') replaced most usages of 'k' and 'q'. Hence, in the classical period and after, 'g' was treated as the equivalent of Greek gamma, and 'c' as the equivalent of kappa; this shows in the romanization of Greek words, as in 'KAΔMOΣ', 'KYPOΣ', and 'ΦΩKIΣ' came into Latin as 'cadmvs', 'cyrvs' and 'phocis', respectively.

Other alphabets have letters homoglyphic to 'c' but not in use and derivation, like the Cyrillic letter Es (С, с) which derives from the lunate sigma, named due to its resemblance to the crescent moon.

Later use

When the Roman alphabet was introduced into Britain, 'c' represented only /k/ and this value of the letter has been retained in loanwords to all the insular Celtic languages: in Welsh, Irish, Gaelic, 'c' represents only /k/. The Old English or "Anglo-Saxon" writing was learned from the Celts, apparently of Ireland; hence 'c' in Old English also originally represented /k/; the Modern English words kin, break, broken, thick, and seek, all come from Old English words written with 'c': cyn, brecan, brocen, þicc, and séoc. But during the course of the Old English period, /k/ before front vowels (/e/ and /i/) were palatalized, having changed by the tenth century to [tʃ], though 'c' was still used, as in cir(i)ce, wrecc(e)a. On the continent, meanwhile, a similar phonetic change had also been going on (for example, in Italian).

In Vulgar Latin, /k/ became palatalized to [tʃ] in Italy and Dalmatia; in France and the Iberian peninsula, it became [ts]. Yet for these new sounds ⟨c⟩ was still used before front vowels ⟨e⟩,⟨ i⟩. The letter thus represented two distinct values. Subsequently, the Latin phoneme /kʷ/ (spelled ⟨qv⟩) de-labialized to /k/ meaning that the various Romance languages had /k/ before front vowels. In addition, Norman used the Greek letter 'k' so that the sound /k/ could be represented by either 'k' or 'c' the latter of which could represent either /k/ or /ts/ depending on whether it preceded a front vowel or not. The convention of using both c' and 'k' was applied to the writing of English after the Norman Conquest, causing a considerable re-spelling of the Old English words. Thus while Old English candel, clif, corn, crop, cú, remained unchanged, Cent, cæ´ᵹ (cé´ᵹ), cyng, brece, séoce, were now (without any change of sound) spelled 'Kent', 'keȝ', 'kyng', 'breke', and 'seoke'; even cniht ('knight') was subsequently changed to 'kniht' and þic ('thick') changed to 'thik' or 'thikk'. The Old English 'cw' was also at length displaced by the French 'qu' so that the Old English cwén ('queen') and cwic ('quick') became Middle English 'quen' 'quik', respectively. [tʃ] to which Old English palatalized /k/ had advanced, also occurred in French, chiefly from Latin /k/ before 'a'. In French it was represented by 'ch', as in champ (from Latin camp-um) and this spelling was introduced into English: the Hatton Gospels, written about 1160, have in Matt. i-iii, child, chyld, riche, mychel, for the cild, rice, mycel, of the Old English version whence they were copied. In these cases, the Old English 'c' gave place to 'k qu ch' but, on the other hand, 'c' in its new value of /ts/ came in largely in French words like processiun, emperice, grace, and was also substituted for 'ts' in a few Old English words, as miltse, bletsien, in early Middle English milce, blecien. By the end of the thirteenth century both in France and England, this sound /ts/ de-affricated to /s/; and from that time 'c' has represented /s/ before front vowels either for etymological reasons, as in lance, cent, or (in defiance of etymology) to avoid the ambiguity due to the "etymological" use of 's' for /z/, as in ace, mice, once, pence, defence.

Thus, to show the etymology, English spelling has advise, devise, instead of advize, devize, which while advice, device, dice, ice, mice, twice, etc., do not reflect etymology; example has extended this to hence, pence, defence, etc., where there is no etymological necessity for 'c'. Former generations also wrote sence for sense. Hence, today the Romance languages and English have a common feature inherited from Vulgar Latin where 'c' takes on either a "hard" or "soft" value depending on the following vowel.

Use in orthographies

English

In English orthography, 'c' generally represents a "soft" value of /s/ before the vowel letters 'e' (including the Latin-derived digraphs ae and oe), 'i' and 'y' and a "hard" value of /k/ before the vowel letters 'a', 'o' and 'u'. However, there are a number of exceptions in English: "soccer" and "Celt" are words that have /k/ where /s/ would be expected.

The soft c may represent the /ʃ/ sound in the digraph 'ci' when this precedes a vowel, as in the words 'delicious' and 'appreciate'.

The digraph 'ch' most commonly represents /tʃ/, but can take the value /k/ (mainly in words of Greek origin) or /ʃ/ (mainly in words of French origin); some dialects of English also have /x/ in words like loch where other speakers pronounce the final sound as /k/. The trigraph 'tch' always represents /tʃ/.

The digraph 'ck' is often used to represent the sound /k/ after short vowels.

Other languages

In the Romance languages French, Spanish, Italian, Romanian and Portuguese, 'c' generally has a "hard" value of /k/ and a "soft" value, the pronunciation of which varies by language. In French, Portuguese, and Spanish from Latin America and southern Spain, the soft 'c' value is /s/ as it is in English. In the Spanish spoken in northern and central Spain, the soft 'c' is a voiceless dental fricative /θ/. In Italian and Romanian, the soft 'c' is [t͡ʃ].

All Balto-Slavic languages that use the Latin alphabet, as well as Albanian, Hungarian, Pashto, several Sami languages, Esperanto, Ido, Interlingua, and Americanist phonetic notation (and those aboriginal languages of North America whose practical orthography derives from it) use 'c' to represent /t͡s/, the voiceless alveolar or voiceless dental sibilant affricate. In romanized Mandarin Chinese, the letter represents an aspirated version of this sound, /t͡sʰ/.

Among non-European languages that have adopted the Latin alphabet, 'c' represents a variety of sounds. Yup'ik, Indonesian, Malay, and a number of African languages such as Hausa, Fula, and Manding share the soft Italian value of /t͡ʃ/. In Azeri, Crimean Tatar, Kurdish, and Turkish 'c' stands for the voiced counterpart of this sound, the voiced postalveolar affricate /d͡ʒ/. In Yabem and similar languages, such as Bukawa, 'c' stands for a glottal stop /ʔ/. Xhosa and Zulu use this letter to represent the click /ǀ/. in some other African languages, such as Beninese Yoruba, 'c' is used for /ʃ/. In Fijian, 'c' stands for a voiced dental fricative /ð/, while in Somali it has the value of /ʕ/.

The letter 'c' is also used as a transliteration of the Cyrillic 'ц' in the Latinic forms of Serbian, Macedonian, and sometimes Ukrainian (along with the digraph 'ts').

Ch

There are several common digraphs with 'c', the most common being 'ch', which in some languages such as German is far more common than 'c' alone. 'Ch' takes various values in other languages, such as:

/t͡ʃ/ in Spanish
/ʃ/ in French and Portuguese
/k/ in Interlingua and Italian
/x/ in the West Slavic languages (e.g. Polish, Czech and Slovak)
/x/ (comprising the mostly allophonic sounds [x] and [ç]) or sometimes /k/ in German
/x/ or /χ/ in Dutch
/tʂʰ/ in Romanized Standard Chinese

Other digraphs and trigraphs

As in English, 'Ck', with the value /k/, is often used after short vowels in other Germanic languages such as German and Swedish (but some other Germanic languages use 'kk' instead, such as Dutch and Norwegian). The digraph 'cz' is found in Polish and 'cs' in Hungarian, both representing /t͡ʃ/. The digraph 'sc' represents /ʃ/ in Old English, Italian, and a few languages related to Italian, (however in Italian and related languages this only happens before front vowels, otherwise it represents /sk/). The trigraph 'sch' represents /ʃ/ in German.

Other usage

As a phonetic symbol, lowercase 'c' is the International Phonetic Alphabet (IPA) and X-SAMPA symbol for the voiceless palatal plosive, and capital 'C' is the X-SAMPA symbol for the voiceless palatal fricative.

It is used to represent one hundred in Roman numerals.

Related letters and other similar characters

Ć ć : Latin letter C with acute
Ĉ ĉ : Latin letter C with circumflex
Č č : Latin letter C with caron
Ċ ċ : Latin letter C with dot above
Ç ç : Latin letter C cedilla
Ḉ ḉ : Latin letter C with cedilla and acute
Ƈ ƈ : Latin letter C with hook
C̈ c̈: Latin letter C with diaresis
ɕ : IPA symbol C with back-curl
С с : Cyrillic letter Es – identical in shape with the Latin C c, but the equivalent of the Latin S s.
Ц ц : Cyrillic letter Tse

Ⅽ : Roman numeral
ʗ : stretched C
ℂ : double struck C
ℭ : blackletter C
℃ : degree Celsius
¢ : cent (currency)
₡ : colón (currency)
₢ : Brazilian cruzeiro (currency)
₵ : Ghana cedi (currency)

Computing codes

Character	C		c
Unicode name	LATIN CAPITAL LETTER C		LATIN SMALL LETTER C
Encodings	decimal	hex	decimal	hex
Unicode	67	U+0043	99	U+0063
UTF-8	67	43	99	63
Numeric character reference	C	C	c	c
EBCDIC family	195	C3	131	83
ASCII ¹	67	43	99	63

¹ Also for encodings based on ASCII, including the DOS, Windows, ISO-8859 and Macintosh families of encodings.

Other representations

NATO phonetic	Morse code
Charlie	–·–·


Signal flag	Flag semaphore	Braille dots-14

References

↑ "C" Oxford English Dictionary, 2nd edition (1989); Merriam-Webster's Third New International Dictionary of the English Language, Unabridged (1993); "cee", op. cit.
↑ Powell, Barry B. (27 Mar 2009). Writing: Theory and History of the Technology of Civilization. Wiley Blackwell. p. 182. ISBN 978-1405162562.
↑ Sihler, Andrew L. (1995). New Comparative Grammar of Greek and Latin (illustrated ed.). New York: Oxford University Press. p. 21. ISBN 0-19-508345-8.