U+8336 was added to Unicode in version 1.1 (1993). It belongs to the block CJK Unified Ideographs in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as tea. Its Pīnyīn pronunciation is ch.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. The glyph can, under circumstances, be confused with 1 other glyphs. In text U+8336 behaves as Ideographic regarding line breaks. It has type OLetter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

The Wikipedia has the following information about this codepoint:

The etymology of tea can be traced back to the ancient Chinese form of the word. The Chinese character for tea is 茶, originally written with an extra horizontal stroke as 荼 (pronounced tu, used as a word for a bitter herb), and acquired its current form in the Tang Dynasty first used in the eighth-century treatise on tea The Classic of Tea. The word is pronounced differently in the various Chinese languages, such as chá in Mandarin, zo and dzo in Wu Chinese, and ta and te in Min Chinese. One suggestion is that the different pronunciations may have arisen from the different words for tea in ancient China, for example tu (荼) may have given rise to ; historical phonologists however argued that the cha, te and dzo all arose from the same root with a reconstructed pronunciation dra (dr- represents a single consonant for a retroflex d), which changed due to sound shift through the centuries. Other ancient words for tea include jia (檟, defined as "bitter tu" during the Han Dynasty), she (蔎), ming (茗) and chuan (荈), with ming the only other word still in use for tea. Most, such as Mandarin and Cantonese, pronounce it along the lines of cha, but Hokkien varieties along the Southern coast of China and in Southeast Asia pronounce it like teh. These two pronunciations have made their separate ways into other languages around the world:

  • Te is from the Amoy of southern Fujian province. It reached the West from the port of Xiamen (Amoy), once a major point of contact with Western European traders such as the Dutch, who spread it to Western Europe.
  • Cha is from the Cantonese chàh of Guangzhou (Canton) and the ports of Hong Kong and Macau, also major points of contact, especially with the Portuguese, who spread it to India in the 16th century. The Korean and Japanese pronunciations of cha, however, came not from Cantonese, rather they were borrowed into Korean and Japanese during earlier periods of Chinese history.

The widespread form chai is likely to have come from Persian چای chay. Both the châ and chây forms are found in Persian dictionaries. They derive from Northern Chinese pronunciation of chá, which passed overland to Central Asia and Persia, where it picked up the Persian grammatical suffix -yi before passing on to Russian, Arabic, Urdu, Turkish, etc.

English has all three forms: cha or char (both pronounced /ˈtʃɑː/), attested from the 16th century; tea, from the 17th; and chai, from the 20th.

Languages in more intense contact with Chinese, Sinospheric languages like Vietnamese, Zhuang, Tibetan, Korean, and Japanese, may have borrowed their words for tea at an earlier time and from a different variety of Chinese, so-called Sino-Xenic pronunciations. Although normally pronounced as cha, Korean and Japanese also retain early, though less common, pronunciations of ta and da. Japanese has different pronunciations for the word tea depending on when the pronunciations was first borrowed into the language: Ta comes from the Tang Dynasty court at Chang'an: that is, from Middle Chinese; da however comes from the earlier Southern Dynasties court at Nanjing, a place where the consonant was still voiced, as it is today in neighbouring Shanghainese zo. Vietnamese and Zhuang have southern cha-type pronunciations.

System Representation
UTF-8 E8 8C B6
UTF-16 83 36
UTF-32 00 00 83 36
URL-Quoted %E8%8C%B6
HTML-Escape 茶
Wrong windows-1252 Mojibake 茶
Encoding: EUC-KR (hex bytes) D2 FE
Encoding: JIS0208 (hex bytes) C3 E3
Pīnyīn ch
IRG_GSource G0-3268
IRG_HSource HB1-
IRG_JSource J0-4363
IRG_KPSource KP0-D5A
IRG_KSource K0-527E
IRG_TSource T1-577D
IRG_VSource V1-647
BigFive AFF9
CCCII 21547B
CNS1986 1-577D
CNS1992 1-577D
EACC 21547B
GB0 1872
GB1 1872
Jis0 3567
KSC0 5094
MainlandTelegraph 5420
TaiwanTelegraph 5420
Xerox 244:075

