Home U+FE70 to U+FEFF Arabic Presentation Forms-B
Glyph for U+FE99
Source: Noto Sans Arabic

U+FE99 ARABIC LETTER THEH ISOLATED FORM

U+FE99 was added to Unicode in version 1.1 (1993). It belongs to the block U+FE70 to U+FEFF Arabic Presentation Forms-B in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Arabic script.

The glyph is a Isolated composition of the glyph Glyph for U+062B Arabic Letter Theh. It has a Neutral East Asian Width. In bidirectional context it acts as Arabic Letter and is not mirrored. The glyph can, under circumstances, be confused with 1 other glyphs. In text U+FE99 behaves as Alphabetic regarding line breaks. It has type Other Letter for sentence and Alphabetic Letter for word breaks. The Grapheme Cluster Break is Any.

The Wikipedia has the following information about this codepoint:

Ṯāʾ (ث) is one of the six letters the Arabic alphabet added to the twenty-two from the Phoenician alphabet (the others being ḫāʾ, ḏāl, ḍād, ẓāʾ, ġayn). In Modern Standard Arabic it represents the voiceless dental fricative [θ], also found in English as the "th" in words such as "thank" and "thin". In Persian, Urdu, and Kurdish it is pronounced as s as in "sister" in English.

In name and shape, it is a variant of tāʾ (ت). Its numerical value is 500 (see Abjad numerals).

The Arabic letter ث is named ثَاءْ ṯāʾ. It is written in several ways depending in its position in the word:

In contemporary spoken Arabic, pronunciation of ṯāʾ as [θ] is found in the Arabian Peninsula, Iraqi, and Tunisian and other dialects and in highly educated pronunciations of Modern Standard and Classical Arabic. Pronunciation of the letter varies between and within the various varieties of Arabic: while it is consistently pronounced as the voiceless dental plosive [t] in Maghrebi Arabic (except Tunisian and eastern Libyan), on the other hand in the Arabic varieties of the Mashriq (in the broad sense, including Egyptian, Sudanese and Levantine) and Hejazi Arabic, it can be pronounced as either [t] or as the sibilant voiceless alveolar fricative [s]. Depending on the word in question, words pronounced as [s] are generally more technical or "sophisticated." Regardless of these regional differences, the pattern of the speaker's variety of Arabic frequently intrudes into otherwise Modern Standard speech; this is widely accepted, and is the norm when speaking the mesolect known alternately as lugha wusṭā ("middling/compromise language") or ʿAmmiyyat/Dārijat al-Muṯaqqafīn ("Educated/Cultured Colloquial") used in the informal speech of educated Arabs of different countries.

When representing this sound in transliteration of Arabic into Hebrew, it is written as ת׳.

Representations

System Representation
65177
UTF-8 EF BA 99
UTF-16 FE 99
UTF-32 00 00 FE 99
URL-Quoted %EF%BA%99
HTML-Escape ﺙ
Wrong windows-1252 Mojibake ﺙ
digraph tk-

Related Characters

Confusables

Elsewhere

Complete Record

Property Value
Age 1.1 (1993)
Unicode Name ARABIC LETTER THEH ISOLATED FORM
Unicode 1 Name GLYPH FOR ISOLATE ARABIC THAA
Block Arabic Presentation Forms-B
General Category Other Letter
Script Arabic
Bidirectional Category Arabic Letter
Combining Class Not Reordered
Decomposition Type Isolated
Decomposition Mapping Glyph for U+062B Arabic Letter Theh
Lowercase
Simple Lowercase Mapping Glyph for U+FE99 Arabic Letter Theh Isolated Form
Lowercase Mapping Glyph for U+FE99 Arabic Letter Theh Isolated Form
Uppercase
Simple Uppercase Mapping Glyph for U+FE99 Arabic Letter Theh Isolated Form
Uppercase Mapping Glyph for U+FE99 Arabic Letter Theh Isolated Form
Simple Titlecase Mapping Glyph for U+FE99 Arabic Letter Theh Isolated Form
Titlecase Mapping Glyph for U+FE99 Arabic Letter Theh Isolated Form
Case Folding Glyph for U+FE99 Arabic Letter Theh Isolated Form
ASCII Hex Digit
Alphabetic
Bidi Control
Bidi Mirrored
Bidi Paired Bracket Glyph for U+FE99 Arabic Letter Theh Isolated Form
Bidi Paired Bracket Type None
Cased
Composition Exclusion
Case Ignorable
Full Composition Exclusion
Changes When Casefolded
Changes When Casemapped
Changes When NFKC Casefolded
Changes When Lowercased
Changes When Titlecased
Changes When Uppercased
Dash
Deprecated
Default Ignorable Code Point
Diacritic
East Asian Width Neutral
Emoji Modifier Base
Emoji Component
Emoji Modifier
Emoji
Emoji Presentation
Extender
Extended Pictographic
FC NFKC Closure Glyph for U+FE99 Arabic Letter Theh Isolated Form
Grapheme Cluster Break Any
Grapheme Base
Grapheme Extend
Grapheme Link
Hex Digit
Hangul Syllable Type Not Applicable
Hyphen
ID Continue
Ideographic
ID Start
IDS Binary Operator
IDS Trinary Operator and
Indic Mantra Category
Indic Positional Category NA
Indic Syllabic Category Other
ISO 10646 Comment
Joining Group No_Joining_Group
Join Control
Jamo Short Name
Joining Type Non Joining
Line Break Alphabetic
Logical Order Exception
Math
Noncharacter Code Point
NFC Quick Check Yes
NFD Quick Check Yes
NFKC Casefold Glyph for U+062B Arabic Letter Theh
NFKC Quick Check No
NFKD Quick Check No
Numeric Type None
Numeric Value not a number
Other Alphabetic
Other Default Ignorable Code Point
Other Grapheme Extend
Other ID Continue
Other ID Start
Other Lowercase
Other Math
Other Uppercase
Pattern Syntax
Pattern White Space
Prepended Concatenation Mark
Quotation Mark
Radical
Regional Indicator
Sentence Break Other Letter
Simple Case Folding Glyph for U+FE99 Arabic Letter Theh Isolated Form
Script Extension
Soft Dotted
Sentence Terminal
Terminal Punctuation
Unified Ideograph
Vertical Orientation R
Variation Selector
Word Break Alphabetic Letter
White Space
XID Continue
XID Start
Expands On NFC
Expands On NFD
Expands On NFKC
Expands On NFKD