U+FE9C ARABIC LETTER THEH MEDIAL FORM: ﺜ – Unicode

U+FE9C was added in Unicode version 1.1 in 1993. It belongs to the block U+FE70 to U+FEFF Arabic Presentation Forms-B in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Arabic script.

The glyph is a medial version of the glyph Arabic Letter Theh. It has no designated width in East Asian texts. In bidirectional text it is written as Arabic letter from right to left. When changing direction it is not mirrored. The word that U+FE9C forms with similar adjacent characters prevents a line break inside it. The glyph can be confused with one other glyph.

The Wikipedia has the following information about this codepoint:

Ṯāʾ (ث) is one of the six letters the Arabic alphabet added to the twenty-two from the Phoenician alphabet (the others being ḫāʾ, ḏāl, ḍād, ẓāʾ, ġayn). It is also one of the ten letters the Persian alphabet added from the twenty-two inherited from the Phoenician alphabet (the others being xe, ẕâl, zâd, ẓâ, ġayn, pe, che, že and gaf). In Modern Standard Arabic it represents the voiceless dental fricative [θ], also found in English as the "th" in words such as "thank" and "thin". In Persian, Urdu, and Kurdish it is pronounced as s as in "sister" in English. Ṯāʾ, along those with the letter shīn, are the only two surviving Arabic letters with three dots above. In most European languages, it is mostly romanized as the digraph th. In other languages, such as Indonesian, this Arabic letter is often romanized as ts and Ṡ.
The most common transliteration in English is "th", e.g. Ethiopia (إثيوبيا), thawb (ثوب).
In name and shape, it is a variant of tāʾ (ت). Its numerical value is 500 (see Abjad numerals).
The Arabic letter ث is named ثَاءْ ṯāʾ. It is written in several ways depending in its position in the word:

In contemporary spoken Arabic, pronunciation of ṯāʾ as [θ] is found in the Arabian Peninsula, Iraqi, and Tunisian and other dialects and in highly educated pronunciations of Modern Standard and Classical Arabic. Pronunciation of the letter varies between and within the various varieties of Arabic: while it is consistently pronounced as the voiceless dental plosive [t] in Maghrebi Arabic (except Tunisian and eastern Libyan), on the other hand in the Arabic varieties of the Mashriq (in the broad sense, including Egyptian, Sudanese and Levantine) and Hejazi Arabic, it is pronounced as the sibilant voiceless alveolar fricative [s] in loanwords from Literary Arabic.
When representing this sound in transliteration of Arabic into Hebrew, it is written as ת׳.

Representations

System	Representation
Nº	65180
UTF-8	EF BA 9C
UTF-16	FE 9C
UTF-32	00 00 FE 9C
URL-Quoted	%EF%BA%9C
HTML hex reference	ﺜ
Wrong windows-1252 Mojibake	ïºœ
Encoding: GB18030 (hex bytes)	84 31 8B 34
Adobe Glyph List	thehmedialarabic
digraph	tk.

Related Characters

Confusables

Elsewhere

Complete Record

Property	Value
Age (age)	1.1 (1993)
Unicode Name (na)	ARABIC LETTER THEH MEDIAL FORM
Unicode 1 Name (na1)	GLYPH FOR MEDIAL ARABIC THAA
Block (blk)	Arabic Presentation Forms-B
General Category (gc)	Other Letter
Script (sc)	Arabic
Bidirectional Category (bc)	Arabic Letter
Combining Class (ccc)	Not Reordered
Decomposition Type (dt)	medial
Decomposition Mapping (dm)	Arabic Letter Theh
Lowercase (Lower)	✘︎
Simple Lowercase Mapping (slc)	Arabic Letter Theh Medial Form
Lowercase Mapping (lc)	Arabic Letter Theh Medial Form
Uppercase (Upper)	✘︎
Simple Uppercase Mapping (suc)	Arabic Letter Theh Medial Form
Uppercase Mapping (uc)	Arabic Letter Theh Medial Form
Simple Titlecase Mapping (stc)	Arabic Letter Theh Medial Form
Titlecase Mapping (tc)	Arabic Letter Theh Medial Form
Case Folding (cf)	Arabic Letter Theh Medial Form
ASCII Hex Digit (AHex)	✘︎
Alphabetic (Alpha)	✔︎
Bidi Control (Bidi_C)	✘︎
Bidi Mirrored (Bidi_M)	✘︎
Composition Exclusion (CE)	✘︎
Case Ignorable (CI)	✘︎
Changes When Casefolded (CWCF)	✘︎
Changes When Casemapped (CWCM)	✘︎
Changes When NFKC Casefolded (CWKCF)	✔︎
Changes When Lowercased (CWL)	✘︎
Changes When Titlecased (CWT)	✘︎
Changes When Uppercased (CWU)	✘︎
Cased (Cased)	✘︎
Full Composition Exclusion (Comp_Ex)	✘︎
Default Ignorable Code Point (DI)	✘︎
Dash (Dash)	✘︎
Deprecated (Dep)	✘︎
Diacritic (Dia)	✘︎
Emoji Modifier Base (EBase)	✘︎
Emoji Component (EComp)	✘︎
Emoji Modifier (EMod)	✘︎
Emoji Presentation (EPres)	✘︎
Emoji (Emoji)	✘︎
Extender (Ext)	✘︎
Extended Pictographic (ExtPict)	✘︎
FC NFKC Closure (FC_NFKC)	Arabic Letter Theh Medial Form
Grapheme Cluster Break (GCB)	Any
Grapheme Base (Gr_Base)	✔︎
Grapheme Extend (Gr_Ext)	✘︎
Grapheme Link (Gr_Link)	✘︎
Hex Digit (Hex)	✘︎
Hyphen (Hyphen)	✘︎
ID Continue (IDC)	✔︎
ID Start (IDS)	✔︎
IDS Binary Operator (IDSB)	✘︎
IDS Trinary Operator and (IDST)	✘︎
IDSU (IDSU)	0
ID_Compat_Math_Continue (ID_Compat_Math_Continue)	0
ID_Compat_Math_Start (ID_Compat_Math_Start)	0
Ideographic (Ideo)	✘︎
InCB (InCB)	None
Indic Mantra Category (InMC)	—
Indic Positional Category (InPC)	NA
Indic Syllabic Category (InSC)	Other
Jamo Short Name (JSN)	—
Join Control (Join_C)	✘︎
Logical Order Exception (LOE)	✘︎
Modifier Combining Mark (MCM)	✘︎
Math (Math)	✘︎
Noncharacter Code Point (NChar)	✘︎
NFC Quick Check (NFC_QC)	Yes
NFD Quick Check (NFD_QC)	Yes
NFKC Casefold (NFKC_CF)	Arabic Letter Theh
NFKC Quick Check (NFKC_QC)	No
NFKC_SCF (NFKC_SCF)	Arabic Letter Theh
NFKD Quick Check (NFKD_QC)	No
Other Alphabetic (OAlpha)	✘︎
Other Default Ignorable Code Point (ODI)	✘︎
Other Grapheme Extend (OGr_Ext)	✘︎
Other ID Continue (OIDC)	✘︎
Other ID Start (OIDS)	✘︎
Other Lowercase (OLower)	✘︎
Other Math (OMath)	✘︎
Other Uppercase (OUpper)	✘︎
Prepended Concatenation Mark (PCM)	✘︎
Pattern Syntax (Pat_Syn)	✘︎
Pattern White Space (Pat_WS)	✘︎
Quotation Mark (QMark)	✘︎
Regional Indicator (RI)	✘︎
Radical (Radical)	✘︎
Sentence Break (SB)	Other Letter
Soft Dotted (SD)	✘︎
Sentence Terminal (STerm)	✘︎
Terminal Punctuation (Term)	✘︎
Unified Ideograph (UIdeo)	✘︎
Variation Selector (VS)	✘︎
Word Break (WB)	Alphabetic Letter
White Space (WSpace)	✘︎
XID Continue (XIDC)	✔︎
XID Start (XIDS)	✔︎
Expands On NFC (XO_NFC)	✘︎
Expands On NFD (XO_NFD)	✘︎
Expands On NFKC (XO_NFKC)	✘︎
Expands On NFKD (XO_NFKD)	✘︎
Bidi Paired Bracket (bpb)	Arabic Letter Theh Medial Form
Bidi Paired Bracket Type (bpt)	None
East Asian Width (ea)	neutral
Hangul Syllable Type (hst)	Not Applicable
ISO 10646 Comment (isc)	—
Joining Group (jg)	No_Joining_Group
Joining Type (jt)	Non Joining
Line Break (lb)	Alphabetic
Numeric Type (nt)	none
Numeric Value (nv)	not a number
Simple Case Folding (scf)	Arabic Letter Theh Medial Form
Script Extension (scx)
Vertical Orientation (vo)	R

U+FE9C Arabic Letter Theh Medial Form

Representations

Related Characters

Confusables

Elsewhere

Complete Record