U+205B FOUR DOT MARK

U+205B was added to Unicode in version 4.1 (2005). It belongs to the block General Punctuation in the Basic Multilingual Plane.

This character is a Other Punctuation and is commonly used, that is, in no specific script.

The glyph is not a composition. It has a Neutral East Asian Width. In bidirectional context it acts as Other Neutral and is not mirrored. In text U+205B behaves as Break After regarding line breaks. It has type Other for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

The Wikipedia has the following information about this codepoint:

Obelism is the practice of annotating manuscripts with marks set in the margins. Modern obelisms are used by editors when blue-penciling a manuscript or typescript. Examples are "stet" (which is Latin for "Let it stand," used in this context to mean "disregard the previous mark") and "dele" (for "Delete").

The obelos symbol (see obelus) gets its name from the spit, or sharp end of a lance in ancient Greek. An obelos was placed by editors on the margins of manuscripts, especially in Homer, to indicate lines that were doubtfully Homer's. The system was developed by Aristarchus and notably used later by Origen in his Hexapla. Origen marked spurious words between obelos and metobelos.

There were many other such shorthand symbols, to indicate corrections, emendations, deletions, additions, and so on. Most used are the editorial coronis, the paragraphos, the forked paragraphos, the reversed forked paragraphos, the hypodiastole, the downwards ancora, the upwards ancora, and the dotted right-pointing angle, which is also known as the diple periestigmene. Loosely, all these symbols, and the act of annotation by means of them, are obelism.

These nine ancient Greek textual annotation symbols are also included in the supplemental punctuation list of ISO IEC standard 10646 for character sets.

Unicode encodes the following:

  • U+2058 ⁘ four dot punctuation
  • U+2059 ⁙ five dot punctuation (Greek pentonkion)
  • U+205A ⁚ two dot punctuation
  • U+205B ⁛ four dot mark
  • U+205C ⁜ dotted cross
  • U+2E0E ⸎ editorial coronis
  • U+2E0F ⸏ paragraphos
  • U+2E10 ⸐ forked paragraphos
  • U+2E11 ⸑ reversed forked paragraphos
  • U+2E12 ⸒ hypodiastole
  • U+2E13 ⸓ dotted obelos
  • U+2E14 ⸔ downwards ancora
  • U+2E15 ⸕ upwards ancora
  • U+2E16 ⸖ dotted right-pointing angle (diple periestigmene)

Some of these were also used in Ancient Greek punctuation as word dividers.

Representations

System Representation
8283
UTF-8 E2 81 9B
UTF-16 20 5B
UTF-32 00 00 20 5B
URL-Quoted %E2%81%9B
HTML-Escape ⁛
Wrong windows-1252 Mojibake ⁛

Elsewhere

Complete Record

Property Value
Age (age) 4.1
Unicode Name (na) FOUR DOT MARK
Unicode 1 Name (na1)
Block (blk) Punctuation
General Category (gc) Other Punctuation
Script (sc) Common
Bidirectional Category (bc) Other Neutral
Combining Class (ccc) Not Reordered
Decomposition Type (dt) None
Decomposition Mapping (dm) ⁛
Lowercase (Lower)
Simple Lowercase Mapping (slc) ⁛
Lowercase Mapping (lc) ⁛
Uppercase (Upper)
Simple Uppercase Mapping (suc) ⁛
Uppercase Mapping (uc) ⁛
Simple Titlecase Mapping (stc) ⁛
Titlecase Mapping (tc) ⁛
Case Folding (cf) ⁛
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Bidi Paired Bracket (bpb) ⁛
Bidi Paired Bracket Type (bpt) None
Cased (Cased)
Composition Exclusion (CE)
Case Ignorable (CI)
Full Composition Exclusion (Comp_Ex)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Dash (Dash)
Deprecated (Dep)
Default Ignorable Code Point (DI)
Diacritic (Dia)
East Asian Width (ea) Neutral
Extender (Ext)
FC NFKC Closure (FC_NFKC) ⁛
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Hex Digit (Hex)
Hangul Syllable Type (hst) Not Applicable
Hyphen (Hyphen)
ID Continue (IDC)
Ideographic (Ideo)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
InMC (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Join Control (Join_C)
Jamo Short Name (JSN)
Joining Type (jt) Non Joining
Line Break (lb) Break After
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) ⁛
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Numeric Type (nt) None
Numeric Value (nv) NaN
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Radical (Radical)
Sentence Break (SB) Other
Simple Case Folding (scf) ⁛
Script Extension (scx) Common
Soft Dotted (SD)
STerm (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)