Home: go to the homepage U+1A20 to U+1AAF Tai Tham
Glyph for U+1A78
Source: Noto Sans Tai Tham

U+1A78 Tai Tham Sign Khuen Tone-4

U+1A78 was added in Unicode version 5.2 in 2009. It belongs to the block U+1A20 to U+1AAF Tai Tham in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Nonspacing Mark and is mainly used in the Tai Tham script.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it acts as Nonspacing Mark. When changing direction it is not mirrored. U+1A78 offers a line break opportunity at its position depending on the further context.

The Wikipedia has the following information about this codepoint:

Tai Tham script (Tham meaning "scripture") is an abugida writing system used mainly for a group of Southwestern Tai languages i.e., Northern Thai, Tai Lü, Khün and Lao; as well as the liturgical languages of Buddhism i.e., Pali and Sanskrit. It is historically known as Tua Tham (ᨲ᩠ᩅᩫᨵᨾ᩠ᨾ᩼ or ᨲ᩠ᩅᩫᨵᩢᨾ᩠ᨾ᩼). In Thailand and Myanmar, the script is often referred to as Lanna script (Thai: อักษรธรรมล้านนา RTGS: Akson Tham Lan Na; Burmese: လန်နာအက္ခရာ; MLCTS: Lanna Akkhara) in relation to the historical kingdom of Lan Na situating in the Northern region of modern day Thailand and a part of Shan state in Myanmar. Local people in Northern Thailand also call the script as Tua Mueang (ᨲ᩠ᩅᩫᨾᩮᩥᩬᨦ, Northern Thai pronunciation: [tǔa̯.mɯ̄a̯ŋ] ) in parallel to Kam Mueang, a local name for Northern Thai language. In Laos and Isan region of Thailand, a variation of Tai Tham script, often dubbed Lao Tham, is also known by the locals as To Tham Lao (Northeastern Thai: โตธรรมลาว /toː˩.tʰam˧˥.laːw˧/, cf. Lao: ໂຕທຳ/ໂຕທັມ BGN/PCGN to tham) or Yuan script. Tai Tham script is traditionally written on a dried palm leaf as a palm-leaf manuscript.

The Northern Thai language is a close relative of (standard) Thai. It is spoken by nearly 6 million people in Northern Thailand and several thousand in Laos of whom few are literate in Lanna script. The script is still read by older monks. Northern Thai has six linguistic tones and Thai only five, making transcription into the Thai alphabet problematic. There is some resurgent interest in the script among younger people, but an added complication is that the modern spoken form, called Kam Muang, differs in pronunciation from the older form.

There are 670,000 speakers of Tai Lü, some of those born before 1950 are literate in Tham, also known as Old Tai Lue. The script has also continued to be taught in the monasteries. The New Tai Lue script is derived from Tham. There are 120,000 speakers of Khün for which Lanna is the only script.

Representations

System Representation (click value to copy)
6776
UTF-8 E1 A9 B8
UTF-16 1A 78
UTF-32 00 00 1A 78
URL-Quoted %E1%A9%B8
HTML hex reference ᩸
Wrong windows-1252 Mojibake ◌᩸
Encoding: GB18030 (hex bytes) 81 35 94 30
RFC 5137 \u'1A78'
Bash and Zsh inside echo -e\u1A78
C and C++ \u1A78
C# \u1A78
CSS \001A78
Excel =UNICHAR(6776)
Go \u1A78
JavaScript \u1A78
Modern JavaScript since ES6\u{1a78}
JSON \u1A78
Java \u1A78
Lua \u{1A78}
Matlab char(6776)
Perl "\x{1A78}"
PHP \u{1a78}
PostgreSQL U&'\1A78'
PowerShell `u{1A78}
Python \u1A78
Ruby \u{1a78}
Rust \u{1a78}
Click the star button next to each label to set this representation as favorite or remove it from the favorites. Favorites will be shown initially. (Favorites are stored locally on your computer and never sent over the internet.)

Elsewhere

Complete Record

Property Value
Age (age) 5.2 (2009)
Unicode Name (na) TAI THAM SIGN KHUEN TONE-4
Unicode 1 Name (na1)
Block (blk) Tai Tham
General Category (gc) Nonspacing Mark
Script (sc) Tai Tham
Bidirectional Category (bc) Nonspacing Mark
Combining Class (ccc) Above
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Lowercase Mapping (lc) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Uppercase Mapping (uc) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Simple Titlecase Mapping (stc) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Titlecase Mapping (tc) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Case Folding (cf) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Grapheme Cluster Break (GCB) Extend
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) Extend
Indic Mantra Category (InMC)
Indic Positional Category (InPC) Top
Indic Syllabic Category (InSC) Tone_Mark
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Modifier Combining Mark (MCM)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Extend
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Extend
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Transparent
Line Break (lb) Complex Context Dependent (South East Asian)
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+1A78 Tai Tham Sign Khuen Tone-4
Script Extension (scx)
Vertical Orientation (vo) R