Home: go to the homepage U+1A20 to U+1AAF Tai Tham
Glyph for U+1A48
Source: Noto Sans Tai Tham

U+1A48 Tai Tham Letter High Sa

U+1A48 was added in Unicode version 5.2 in 2009. It belongs to the block U+1A20 to U+1AAF Tai Tham in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Tai Tham script.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it is written from left to right. When changing direction it is not mirrored. U+1A48 offers a line break opportunity at its position depending on the further context.

The Wikipedia has the following information about this codepoint:

Tai Tham script (Tham meaning "scripture") is an abugida writing system used mainly for a group of Southwestern Tai languages i.e., Northern Thai, Tai Lü, Khün and Lao; as well as the liturgical languages of Buddhism i.e., Pali and Sanskrit. It is historically known as Tua Tham (ᨲ᩠ᩅᩫᨵᨾ᩠ᨾ᩼​ or ᨲ᩠ᩅᩫᨵᩢᨾ᩠ᨾ᩼). In Thailand and Myanmar, the script is often referred to as Lanna script (Thai: อักษรธรรมล้านนา RTGS: Akson Tham Lan Na; Burmese: လန်နအက္ခရာ RTGS: Lanna Akara) in relation to the historical kingdom of Lan Na situating in the Northern region of modern day Thailand and a part of Shan state in Myanmar. Local people in Northern Thailand also call the script as Tua Mueang (ᨲ᩠ᩅᩫᨾᩮᩥᩬᨦ, Northern Thai pronunciation: [tǔa̯.mɯ̄a̯ŋ] ) in parallel to Kam Mueang, a local name for Northern Thai language. In Laos and Isan region of Thailand, a variation of Tai Tham script, often dubbed Lao Tham, is also known by the locals as To Tham Lao (Northeastern Thai: โตธรรมลาว /toː˩.tʰam˧˥.laːw˧/, cf. Lao: ໂຕທຳ/ໂຕທັມ BGN/PCGN to tham) or Yuan script. Tai Tham script is traditionally written on a dried palm leaf as a palm-leaf manuscript.

The Northern Thai language is a close relative of (standard) Thai. It is spoken by nearly 6 million people in Northern Thailand and several thousand in Laos of whom few are literate in Lanna script. The script is still read by older monks. Northern Thai has six linguistic tones and Thai only five, making transcription into the Thai alphabet problematic. There is some resurgent interest in the script among younger people, but an added complication is that the modern spoken form, called Kam Muang, differs in pronunciation from the older form.

There are 670,000 speakers of Tai Lü, some of those born before 1950 are literate in Tham, also known as Old Tai Lue. The script has also continued to be taught in the monasteries. The New Tai Lue script is derived from Tham. There are 120,000 speakers of Khün for which Lanna is the only script.

Representations

System Representation
6728
UTF-8 E1 A9 88
UTF-16 1A 48
UTF-32 00 00 1A 48
URL-Quoted %E1%A9%88
HTML hex reference ᩈ
Wrong windows-1252 Mojibake ᩈ

Elsewhere

Complete Record

Property Value
Age (age) 5.2 (2009)
Unicode Name (na) TAI THAM LETTER HIGH SA
Unicode 1 Name (na1)
Block (blk) Tai Tham
General Category (gc) Other Letter
Script (sc) Tai Tham
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+1A48 Tai Tham Letter High Sa
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+1A48 Tai Tham Letter High Sa
Lowercase Mapping (lc) Glyph for U+1A48 Tai Tham Letter High Sa
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+1A48 Tai Tham Letter High Sa
Uppercase Mapping (uc) Glyph for U+1A48 Tai Tham Letter High Sa
Simple Titlecase Mapping (stc) Glyph for U+1A48 Tai Tham Letter High Sa
Titlecase Mapping (tc) Glyph for U+1A48 Tai Tham Letter High Sa
Case Folding (cf) Glyph for U+1A48 Tai Tham Letter High Sa
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+1A48 Tai Tham Letter High Sa
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Consonant
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+1A48 Tai Tham Letter High Sa
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+1A48 Tai Tham Letter High Sa
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Other Letter
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+1A48 Tai Tham Letter High Sa
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Complex Context Dependent (South East Asian)
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+1A48 Tai Tham Letter High Sa
Script Extension (scx)
Vertical Orientation (vo) R