Katakana
From Wikipedia, the free encyclopedia
Katakana カタカナ |
||
---|---|---|
Type: | Syllabary | |
Languages: | Japanese and Ainu | |
Time period: | ~800 A.D. to the present | |
Parent writing systems: | Kanji Man'yōgana Katakana カタカナ |
|
Sister writing systems: | Hiragana, Hentaigana | |
Unicode range: | U+30A0–U+30FF | |
ISO 15924 code: | Kana | |
Note: This page may contain IPA phonetic symbols in Unicode. See IPA chart for English for an English-based pronunciation key. |
Katakana (片仮名?) is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji, and in some cases the Latin alphabet. The word katakana means "fragmentary kana," as they are derived from components of more complex kanji.
Katakana are characterized by short straight strokes and angular corners, and are the simplest of the Japanese scripts.
There are two main systems of ordering katakana, the old-fashioned iroha ordering, and the more prevalent gojūon ordering.
Contents |
[edit] Usage
In modern Japanese, katakana are most often used for transcription of words from foreign languages (called gairaigo). For example, "television" is written terebi (テレビ?). Similarly, katakana is usually used for country names and foreign place and personal names. For example America is written アメリカ Amerika (America has its own Kanji amerika (亜米利加?) or for short, beikoku (米国?) means "A country of America") and John is written ジョン (Jon).
Katakana are also used for onomatopoeia, letters used to represent sounds, for example pinpon (ピンポン?), the "ding-dong" sound of a doorbell, would usually be written in katakana.
Technical and scientific terms, such as the names of animal and plant species and minerals are also commonly written in katakana.
Katakana are also often used for transcription of Japanese company names (not always). For example Suzuki is written スズキ, and Toyota is written トヨタ. Katakana are also used for emphasis, especially on signs, advertisements, and hoardings. For example, it is common to see ココ koko (here), ゴミ gomi (trash) or メガネ megane (glasses), and words to be emphasized in a sentence are also sometimes written in katakana, mirroring the European usage of italics.
Pre-World War II official documents mix katakana and kanji in the same way that hiragana and kanji are mixed in modern Japanese texts, that is, katakana were used for okurigana and particles such as wa or o.
Katakana were also used for telegrams in Japan before 1988 and before the introduction of multibyte characters in computer systems in the 1980s. Most computers used Katakana instead of Kanji and/or Hiragana for output.
Although words borrowed from ancient Chinese are usually written in kanji, loanwords from modern Chinese dialects which are borrowed directly rather than using the Sino-Japanese on'yomi readings, are often written in katakana. Examples include
- マージャン (麻將/麻雀), mājan (mahjong); in Mandarin májiàng
- ウーロン茶 (烏龍茶), ūroncha (Oolong tea), from Mandarin wūlóng
- チャーハン (炒飯), chāhan, (fried rice)
- チャーシュー(叉焼), chāshū, from Cantonese cha siu, roast pork
- シューマイ (焼売), shūmai, from Cantonese siu maai, a kind of dim sum.
The very common Chinese loanword ラーメン (rāmen) is rarely written with its kanji 拉麺.
There are rare cases where the opposite has occurred, with kanji forms created from words originally written in katakana. An example of this is コーヒー (kohi), "coffee", which can be alternatively written as 珈琲. This kanji usage, although very rare, is occasionally employed by coffee manufacturers for novelty.
Katakana are sometimes used instead of hiragana as furigana to give the pronunciation of a word written in Roman characters, or for a foreign word, which is written as kanji for the meaning, but intended to be pronounced as the original.
Katakana are also sometimes used to indicate words being spoken in a foreign or otherwise unusual accent, by foreign characters, robots etc. For example, in a manga, the speech of a foreign character or a robot may be represented by コンニチワ (konnichiwa) instead of the more usual hiragana こんにちは (konnichi wa).
Katakana are also used to indicate the on'yomi (Chinese-derived) readings of a kanji in a kanji dictionary.
Some Japanese personal names are written in katakana. This was more common in the past, hence elderly women often have katakana names.
It is very common to write words with difficult-to-read kanji in katakana. This phenomenon is often seen with medical terminology. For example, in the word "dermatology", 皮膚科, hifuka, the second kanji, 膚, is considered difficult, and thus the word hifuka is commonly written as 皮フ科 or ヒフ科 in katakana. Similarly, difficult kanji such as 癌 gan, "cancer", are often written in katakana or hiragana.
Katakana is also used for traditional musical notations, as in the Tozan ryu of shakuhachi, and in sankyoku ensembles with koto, shamisen and shakuhachi.
[edit] Orthography
Foreign phrases are sometimes transliterated with a middle dot called nakaguro (中黒?) or a space separating the words. However, in cases where it is assumed that the reader knows the separate gairaigo words in the phrase, the middle dot is not used. For example, the phrase コンピュータゲーム (kompyūta gēmu)(computer game), containing two very well-known gairaigo, is not written with a middle dot.
Katakana spelling differs slightly from hiragana. While hiragana spells long vowels with the addition of a second vowel kana, katakana usually uses a vowel extender mark called a chōon. This mark is a short line following the direction of the text, horizontal in yokogaki, or horizontal text, and vertical in tategaki, or vertical text. However, it is more often used when writing foreign loanwords; long vowels in Japanese words written in katakana are usually written as they would be in hiragana. There are exceptions such as ローソク(蝋燭)(rōsoku)(candle) or ケータイ(携帯)(kētai)(mobile phone).
A small tsu ッ called a sokuon indicates a geminate consonant, which is represented in rōmaji by doubling the following consonant. For example, bed is written in katakana as ベッド (beddo).
The sokuon is sometimes used in places which have no equivalent in native sounds. For example, double-h in place of ch is common in German names. Bach, for example, comes out as バッハ (Bahha); Mach is マッハ (Mahha). The doubling of the "h" in Bach and Mach (or the underlying small tsu) is probably the kana that best fits those German names.
Related sounds in various languages are hard to express in Japanese, so Khruschev becomes フルシチョフ (Furushichofu). Ali Khamenei is アリー・ハーメネイー (Arī Hāmeneī). The Japanese Wikipedia has references to イツハク・パールマン (Itsuhaku Pāruman) and イツァーク・パールマン (Itsāku Pāruman), Itzhak Perlman.
[edit] History
Katakana was developed in the early Heian Period from parts of man'yōgana characters as a form of shorthand. For example, ka カ comes from the left side of ka 加 "increase".
[edit] Computer encoding
Katakana have two forms of encoding, halfwidth hankaku (半角?) and fullwidth zenkaku (全角?). The halfwidth forms come from JIS X 0201 originally. This includes halfwidth Katakana in right side area of ASCII. That is, most halfwidth Katakana could be represented by one byte each. In the late 1970's, two-byte character sets such as JIS X 0208 were introduced to represent Hiraganas, Kanjis and other characters. JIS_X_0208 has its own Katakana area independently of one-byte character set such as JIS_X_0201. Katakana of JIS_X_0208 takes two-byte (at least), so many (especially old) devices output these Katakanas as two-byte-width. This is why Katakana of JIS_X_0201 is called halfwidth and JIS_X_0208, fullwidth. Therefore, most encodings have no halfwidth Hiragana.
Although often said to be obsolete, in fact the halfwidth katakana are still used in many systems and encodings. For example, the titles of mini discs can only be entered in ASCII or halfwidth katakana, and halfwidth katakana were commonly used in computerized cash register displays, on shop receipts, and Japanese digital television and DVD subtitles. Several popular Japanese encodings such as EUC-JP, Unicode and Shift-JIS have halfwidth Katakana code as well as fullwidth. By contrast, ISO-2022-JP has no halfwidth Katakana, and is mainly used over SMTP and NNTP. Halfwidth katakana are commonly used to save memory space.
[edit] Unicode
In Unicode, fullwidth katakana occupy code points U+30A0 to U+30FF [1]:
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | ||
30A | ゠ | ァ | ア | ィ | イ | ゥ | ウ | ェ | エ | ォ | オ | カ | ガ | キ | ギ | ク | |
30B | グ | ケ | ゲ | コ | ゴ | サ | ザ | シ | ジ | ス | ズ | セ | ゼ | ソ | ゾ | タ | |
30C | ダ | チ | ヂ | ッ | ツ | ヅ | テ | デ | ト | ド | ナ | ニ | ヌ | ネ | ノ | ハ | |
30D | バ | パ | ヒ | ビ | ピ | フ | ブ | プ | ヘ | ベ | ペ | ホ | ボ | ポ | マ | ミ | |
30E | ム | メ | モ | ャ | ヤ | ュ | ユ | ョ | ヨ | ラ | リ | ル | レ | ロ | ヮ | ワ | |
30F | ヰ | ヱ | ヲ | ン | ヴ | ヵ | ヶ | ヷ | ヸ | ヹ | ヺ | ・ | ー | ヽ | ヾ | ヿ |
Halfwidth equivalents to the fullwidth katakana also exist. These are encoded within the Halfwidth and Fullwidth Forms block (U+FF00–U+FFEF) [2], starting at U+FF65 and ending at U+FF9F (characters U+FF61–U+FF64 are halfwidth punctuation marks):
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | ||
FF6 | ⦆ | 。 | 「 | 」 | 、 | ・ | ヲ | ァ | ィ | ゥ | ェ | ォ | ャ | ュ | ョ | ッ | |
FF7 | ー | ア | イ | ウ | エ | オ | カ | キ | ク | ケ | コ | サ | シ | ス | セ | ソ | |
FF8 | タ | チ | ツ | テ | ト | ナ | ニ | ヌ | ネ | ノ | ハ | ヒ | フ | ヘ | ホ | マ | |
FF9 | ミ | ム | メ | モ | ヤ | ユ | ヨ | ラ | リ | ル | レ | ロ | ワ | ン | ゙ | ゚ |
Code points 32D0 to 32FE list Circled Katakana. Note: A circled ン is missing
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | ||
32D | ㋐ | ㋑ | ㋒ | ㋓ | ㋔ | ㋕ | ㋖ | ㋗ | ㋘ | ㋙ | ㋚ | ㋛ | ㋜ | ㋝ | ㋞ | ㋟ | |
32E | ㋠ | ㋡ | ㋢ | ㋣ | ㋤ | ㋥ | ㋦ | ㋧ | ㋨ | ㋩ | ㋪ | ㋫ | ㋬ | ㋭ | ㋮ | ㋯ | |
32F | ㋰ | ㋱ | ㋲ | ㋳ | ㋴ | ㋵ | ㋶ | ㋷ | ㋸ | ㋹ | ㋺ | ㋻ | ㋼ | ㋽ | ㋾ |
[edit] Katakana for the Ainu language
Katakana is sometimes used to write the Ainu language. Unique to Ainu language katakana usage, the consonant that comes at the end of a syllable is represented by a small version of a katakana that corresponds to that final consonant and with an arbitrary vowel. For instance "up" is represented by ウㇷ゚ (u followed by small pu). In Unicode, the Katakana Phonetic Extensions block (U+31F0–U+31FF) [3] exists for Ainu language support. These characters are used mainly for the Ainu language only:
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | ||
31F | ㇰ | ㇱ | ㇲ | ㇳ | ㇴ | ㇵ | ㇶ | ㇷ | ㇸ | ㇹ | ㇺ | ㇻ | ㇼ | ㇽ | ㇾ | ㇿ |
[edit] Table of katakana
This is a table of katakana together with their Hepburn romanization. The first chart sets out the standard katakana (characters in red are obsolete, and characters in green are modern additions to the katakana, used mainly to represent sounds from other languages. Learning to read katakana is often complicated by the similarities between different characters. For example, shi シ and tsu ツ , as well as so ソ and n ン , look almost the same except for the slant and stroke shape.
vowels | yōon | ||||||
ア a | イ i | ウ u | エ e | オ o | ャ ya | ュ yu | ョ yo |
---|---|---|---|---|---|---|---|
カ ka | キ ki | ク ku | ケ ke | コ ko | キャ kya | キュ kyu | キョ kyo |
サ sa | シ shi | ス su | セ se | ソ so | シャ sha | シュ shu | ショ sho |
タ ta | チ chi | ツ tsu | テ te | ト to | チャ cha | チュ chu | チョ cho |
ナ na | ニ ni | ヌ nu | ネ ne | ノ no | ニャ nya | ニュ nyu | ニョ nyo |
ハ ha | ヒ hi | フ fu | ヘ he | ホ ho | ヒャ hya | ヒュ hyu | ヒョ hyo |
マ ma | ミ mi | ム mu | メ me | モ mo | ミャ mya | ミュ myu | ミョ myo |
ヤ ya | ユ yu | イェ ye | ヨ yo | ||||
ラ ra | リ ri | ル ru | レ re | ロ ro | リャ rya | リュ ryu | リョ ryo |
ワ wa | (ヰ) ウィ wi | (ヱ) ウェ we | ヲ (ウォ) wo | ||||
ン n | |||||||
ガ ga | ギ gi | グ gu | ゲ ge | ゴ go | ギャ gya | ギュ gyu | ギョ gyo |
ザ za | ジ ji | ズ zu | ゼ ze | ゾ zo | ジャ ja | ジュ ju | ジョ jo |
ダ da | ヂ (ji) | ヅ (zu) | デ de | ド do | ヂャ (ja) | ヂュ (ju) | ヂョ (jo) |
バ ba | ビ bi | ブ bu | ベ be | ボ bo | ビャ bya | ビュ byu | ビョ byo |
パ pa | ピ pi | プ pu | ペ pe | ポ po | ピャ pya | ピュ pyu | ピョ pyo |
(ヷ) ヴァ va | (ヸ) ヴィ vi | ヴ vu | (ヹ) ヴェ ve | (ヺ) ヴォ vo | ヴャ vya | ヴュ vyu | ヴョ vyo |
シェ she | |||||||
ジェ je | |||||||
チェ che | |||||||
ティ ti | トゥ tu | テュ tyu | |||||
ディ di | ドゥ du | デュ dyu | |||||
ツァ tsa | ツィ tsi | ツェ tse | ツォ tso | ||||
ファ fa | フィ fi | フェ fe | フォ fo | フュ fyu |
[edit] Example transcriptions of Katakana and foreign languages
[edit] Medicine
Original word | Katakana | Rōmaji |
---|---|---|
Vitamin (de) | ビタミン | Bitamin |
Mineral (en) | ミネラル | Mineraru |
Calcium (en) | カルシウム | Karushiumu |
Hormone (en) | ホルモン | Horumon |
[edit] Computing
Original word | Katakana | Rōmaji | Kanji and other words |
---|---|---|---|
Computer (en) | コンピューター | Konpyūtā | 計算機 keisanki 電算機 densanki 電子計算機 denshikeisanki |
Mouse (en) | マウス | Mausu | |
Keyboard (en) | キーボード | Kībōdo | |
Display (en) | ディスプレイ | Disupurei | 画面 gamen |
Pointer (en) | ポインタ | Pointa | |
Programming (en) | プログラミング | Puroguramingu | |
Software (en) | ソフトウェア | Sofutowea | |
Hardware (en) | ハードウェア | Hādowea | |
Operating system (en) | オペレーティング・システム | Opereitingu sisutemu | 基本ソフト kihonsofuto OS ōesu |
Internet (en) | インターネット | Intānetto | |
Web (en) | ウェブ | Webu |
[edit] Names
Original word | Katakana | Rōmaji |
---|---|---|
John (en) | ジョン | Jon |
George (en) | ジョージ | Jōji |
Marie (en) | マリー | Marī |
Michael (en) | マイケル | Maikeru |
Maria (de) | マリア | Maria |
Michael (de) | ミハエル, ミヒャエル | Mihaeru, Mihyaeru |
[edit] Regions
Original word | Katakana | Rōmaji | Kanji |
---|---|---|---|
America (en) | アメリカ | Amerika | 米国 beikoku |
Latin America (en) | ラテンアメリカ | Raten Amerika | 中南米 chūnambei |
Europe (pt) | ヨーロッパ | Yōroppa | 欧州 ōshū |
Asia (en) | アジア | Ajia | 亜州 ashū |
Africa (en) | アフリカ | Afurika | 阿州 ashū |
Oceania (en) | オセアニア | Oseania | 大洋州 taiyōshū |
[edit] Nations and cities
Original word | Katakana | Rōmaji | English name | Local name |
---|---|---|---|---|
New York (en) | ニューヨーク | Nyūyōku | ||
Los Angeles (en) | ロサンゼルス | Rosanzerusu | ||
Canada (en) | カナダ | Kanada | ||
Toronto (en) | トロント | Toronto | ||
Brazil (en) | ブラジル | Burajiru | Brasil (pt) | |
London (en) | ロンドン | Rondon | ||
France (fr) (en) | フランス | Furansu | ||
Paris (fr) | パリ | Pari | ||
Deutschland (de) Duitsland (nl) |
ドイツ | Doitsu | Germany (en) | |
Berlin (de) | ベルリン | Berurin | ||
Poland (en) | ポーランド | Pōrando | Polska (pl) | |
Italia (it) | イタリア | Itaria | Italy (en) | |
Roma (it) (lt) | ローマ | Rōma | Rome (en) | |
Spain (en) | スペイン | Supein | España (es) | |
Madrid (en) | マドリッド | Madoriddo | ||
Russia (en) | ロシア | Roshia | Росси́я, Rossiya (ru) | |
India (en) | インド | Indo | ||
Indonesia (id) | インドネシア | Indoneshia |
[edit] See also
- Japanese phonology for pronunciation.
- Hiragana
- Historical kana usage for a discussion of pre-war kana spelling
- Rōmaji for a comparison of romanization systems
- Transcribing English to Japanese
- Wikipedia:Technical assistance for katakana
[edit] External links
- Katakana code chart at Unicode.org
- katakana stroke order diagrams on nihongoresources.com
- Real Kana Practice katakana using different typefaces