Home: go to the homepage U+0600 to U+06FF Arabic
Glyph for U+0601
Source: Noto Sans Arabic

U+0601 Arabic Sign Sanah

U+0601 was added in Unicode version 4.0 in 2003. It belongs to the block U+0600 to U+06FF Arabic in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Format and is mainly used in the Arabic script.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it is written as Arabic number from right to left. When changing direction it is not mirrored. This number joins with other adjacent letters and numbers to form a word. U+0601 forms a number with similar characters, which prevents a line break inside it.

The Wikipedia has the following information about this codepoint:

Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms. In English, the common ampersand (&) developed from a ligature in which the handwritten Latin letters e and t (spelling et, Latin for and) were combined. The rules governing ligature formation in Arabic can be quite complex, requiring special script-shaping technologies such as the Arabic Calligraphic Engine by Thomas Milo's DecoType.

As of Unicode 15.1, the Arabic script is contained in the following blocks:

  • Arabic (0600–06FF, 256 characters)
  • Arabic Supplement (0750–077F, 48 characters)
  • Arabic Extended-B (0870–089F, 41 characters)
  • Arabic Extended-A (08A0–08FF, 96 characters)
  • Arabic Presentation Forms-A (FB50–FDFF, 631 characters)
  • Arabic Presentation Forms-B (FE70–FEFF, 141 characters)
  • Rumi Numeral Symbols (10E60–10E7F, 31 characters)
  • Arabic Extended-C (10EC0-10EFF, 3 characters)
  • Indic Siyaq Numbers (1EC70–1ECBF, 68 characters)
  • Ottoman Siyaq Numbers (1ED00–1ED4F, 61 characters)
  • Arabic Mathematical Alphabetic Symbols (1EE00–1EEFF, 143 characters)

The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits. The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Extended-B and Arabic Extended-A ranges encode additional Qur'anic annotations and letter variants used for various non-Arabic languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms. The presentation forms are present only for compatibility with older standards, and are not currently needed for coding text. The Arabic Mathematical Alphabetical Symbols block encodes characters used in Arabic mathematical expressions. The Indic Siyaq Numbers block contains a specialized subset of Arabic script that was used for accounting in India under the Mughal Empire by the 17th century through the middle of the 20th century. The Ottoman Siyaq Numbers block contains a specialized subset of Arabic script, also known as Siyakat numbers, used for accounting in Ottoman Turkish documents.

Representations

System Representation
1537
UTF-8 D8 81
UTF-16 06 01
UTF-32 00 00 06 01
URL-Quoted %D8%81
HTML hex reference ؁
Wrong windows-1252 Mojibake ؁

Elsewhere

Complete Record

Property Value
Age (age) 4.0 (2003)
Unicode Name (na) ARABIC SIGN SANAH
Unicode 1 Name (na1)
Block (blk) Arabic
General Category (gc) Format
Script (sc) Arabic
Bidirectional Category (bc) Arabic Number
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+0601 Arabic Sign Sanah
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+0601 Arabic Sign Sanah
Lowercase Mapping (lc) Glyph for U+0601 Arabic Sign Sanah
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+0601 Arabic Sign Sanah
Uppercase Mapping (uc) Glyph for U+0601 Arabic Sign Sanah
Simple Titlecase Mapping (stc) Glyph for U+0601 Arabic Sign Sanah
Titlecase Mapping (tc) Glyph for U+0601 Arabic Sign Sanah
Case Folding (cf) Glyph for U+0601 Arabic Sign Sanah
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+0601 Arabic Sign Sanah
Grapheme Cluster Break (GCB) Prepend
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+0601 Arabic Sign Sanah
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+0601 Arabic Sign Sanah
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Numeric
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Numeric
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+0601 Arabic Sign Sanah
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Numeric
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+0601 Arabic Sign Sanah
Script Extension (scx)
Vertical Orientation (vo) R