Home: go to the homepage U+2000 to U+206F General Punctuation
Glyph for U+200D
Source: Noto Emoji

U+200D Zero Width Joiner

U+200D was added in Unicode version 1.1 in 1993. It belongs to the block U+2000 to U+206F General Punctuation in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Format and inherits its script property from the preceding character.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it acts as Boundary Neutral. When changing direction it is not mirrored. U+200D prohibits a line break around it.

The Wikipedia has the following information about this codepoint:

The zero-width joiner (ZWJ, ; rendered: ; HTML entity: ‍ or ‍) is a non-printing character used in the computerized typesetting of writing systems in which the shape or positioning of a grapheme depends on its relation to other graphemes (complex scripts), such as the Arabic script or any Indic script. Sometimes the Roman script is to be counted as complex, e.g. when using a Fraktur typeface. When placed between two characters that would otherwise not be connected, a ZWJ causes them to be printed in their connected forms.

The exact behaviour of the ZWJ varies depending on whether the use of a conjunct consonant or ligature (where multiple characters are shown with a single glyph) is expected by default; for instance, it suppresses the use of conjuncts in Devanagari (whilst still allowing the use of the individual joining form of a dead consonant, as opposed to a halant form as would be required by the zero-width non-joiner), but induces the use of conjuncts in Sinhala (which does not use them by default). Similarly to Sinhala, when a ZWJ is placed between two emoji characters (or interspersed between multiple), it can result in a single glyph being shown, such as the family emoji, made up of two adult emoji and one or two child emoji.

In some cases, such as the second Devanagari example below, the ZWJ can be used to display a joining form in isolation, when included after the character and combining halant code.

The character's code point is U+200D ZERO WIDTH JOINER (‍). In the InScript keyboard layout for Indian languages, it is typed by the key combination Ctrl+Shift+1. However, many layouts use the position of QWERTY's ']' key for this character.

Representations

System Representation
8205
UTF-8 E2 80 8D
UTF-16 20 0D
UTF-32 00 00 20 0D
URL-Quoted %E2%80%8D
HTML hex reference ‍
Wrong windows-1252 Mojibake ‍
HTML named entity ‍
abbreviation ZWJ
Encoding: CP1256 (hex bytes) 9E
Encoding: GB18030 (hex bytes) 81 36 A4 39
AGL: Latin-5 uni200D
Adobe Glyph List afii301

Elsewhere

Complete Record

Property Value
Age (age) 1.1 (1993)
Unicode Name (na) ZERO WIDTH JOINER
Unicode 1 Name (na1)
Block (blk) General Punctuation
General Category (gc) Format
Script (sc) Inherited
Bidirectional Category (bc) Boundary Neutral
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+200D Zero Width Joiner
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+200D Zero Width Joiner
Lowercase Mapping (lc) Glyph for U+200D Zero Width Joiner
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+200D Zero Width Joiner
Uppercase Mapping (uc) Glyph for U+200D Zero Width Joiner
Simple Titlecase Mapping (stc) Glyph for U+200D Zero Width Joiner
Titlecase Mapping (tc) Glyph for U+200D Zero Width Joiner
Case Folding (cf) Glyph for U+200D Zero Width Joiner
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+200D Zero Width Joiner
Grapheme Cluster Break (GCB) Zero Width Joiner
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) Extend
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Joiner
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Modifier Combining Mark (MCM)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Extend
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Zero Width Joiner
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+200D Zero Width Joiner
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Join Causing
Line Break (lb) Zero Width Joiner
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+200D Zero Width Joiner
Script Extension (scx)
Vertical Orientation (vo) R