Home: go to the homepage U+0600 to U+06FF Arabic
Glyph for U+0640
Source: Noto Sans Arabic

U+0640 Arabic Tatweel

U+0640 was added in Unicode version 1.1 in 1993. It belongs to the block U+0600 to U+06FF Arabic in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Modifier Letter and is commonly used, that is, in no specific script. It is also used in the scripts Adlam, Arabic, Mandaic, Manichaean, Old Uyghur, Psalter Pahlavi, Hanifi Rohingya, Sogdian, Syriac. The character is also known as kashida.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it is written as Arabic letter from right to left. When changing direction it is not mirrored. The word that U+0640 forms with similar adjacent characters prevents a line break inside it.

The Wikipedia has the following information about this codepoint:

Kashida or Kasheeda (Persian: کَشِیدَه; kašīda; lit. "extended", "stretched", "lengthened"), also known as Tatweel or Tatwīl (Arabic: تَطْوِيل, taṭwīl), is a type of justification in the Arabic language and in some descendant cursive scripts. In contrast to white-space justification, which increases the length of a line of text by expanding spaces between words or individual letters, kasheeda creates justification by elongating characters at certain points. Kasheeda justification can be combined with white-space justification.

The analog in European (Latin-based) typography (expanding or contracting letters to improve spacing) is sometimes called expansion, and falls within microtypography. Kasheeda is considerably easier and more flexible, however, because Arabic–Persian scripts feature prominent horizontal strokes, whose lengths are accordingly flexible.

For example, al-ḥamdu and Raḥīm with and without kasheeda may look like the following:

The terms Kasheeda and Tatweel can also refer to a character that represents this elongation ( ـ ) or to one of a set of glyphs of varying lengths that implement this elongation in a font. The Unicode standard assigns code point U+0640 as Arabic Tatweel.

The kasheeda can take a subtle downward curvature in some calligraphic styles and handwriting. However, the curvilinear stroke is not feasible for most basic fonts, which merely use a completely flat underscore-like stroke for kashida.

In addition to letter spacing and justification, calligraphers also use kasheeda for emphasis and as book or chapter titles. In modern Arabic mathematical notation, kasheeda appears in some operation symbols that must stretch to accommodate associated contents above or below.

Kasheeda generally only appears in one word per line, and one letter per word. Furthermore, experts recommend kasheeda only between certain combinations of letters (typically those that cannot form a ligature). Some calligraphers who were paid by the page used an inordinate number of kasheeda to stretch content over more pages.

The branding of the 2022 FIFA World Cup in Qatar applies kasheeda to Latin script, connecting the bottom of the "t" and the second "a" in the host country's name.


System Representation
UTF-8 D9 80
UTF-16 06 40
UTF-32 00 00 06 40
URL-Quoted %D9%80
HTML hex reference ـ
Wrong windows-1252 Mojibake ◌ـ
alias kashida
Encoding: ISO-8859-6 (hex bytes) E0
Encoding: WINDOWS-1256 (hex bytes) DC
Adobe Glyph List afii57440
Adobe Glyph List kashidaautoarabic
Adobe Glyph List kashidaautonosidebearingarabic
Adobe Glyph List tatweelarabic
digraph ++

Related Characters


Complete Record

Property Value
Age (age) 1.1 (1993)
Unicode Name (na) ARABIC TATWEEL
Unicode 1 Name (na1)
Block (blk) Arabic
General Category (gc) Modifier Letter
Script (sc) Common
Bidirectional Category (bc) Arabic Letter
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+0640 Arabic Tatweel
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+0640 Arabic Tatweel
Lowercase Mapping (lc) Glyph for U+0640 Arabic Tatweel
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+0640 Arabic Tatweel
Uppercase Mapping (uc) Glyph for U+0640 Arabic Tatweel
Simple Titlecase Mapping (stc) Glyph for U+0640 Arabic Tatweel
Titlecase Mapping (tc) Glyph for U+0640 Arabic Tatweel
Case Folding (cf) Glyph for U+0640 Arabic Tatweel
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+0640 Arabic Tatweel
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+0640 Arabic Tatweel
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+0640 Arabic Tatweel
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Other Letter
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Alphabetic Letter
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+0640 Arabic Tatweel
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Join Causing
Line Break (lb) Alphabetic
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+0640 Arabic Tatweel
Script Extension (scx) Adlam Arabic Mandaic Manichaean Old Uyghur Psalter Pahlavi Hanifi Rohingya Sogdian Syriac
Vertical Orientation (vo) R