Home: go to the homepage U+2400 to U+243F Control Pictures
Glyph for U+2420
Source: Noto Sans Symbols 2

U+2420 Symbol for Space

U+2420 was added in Unicode version 1.1 in 1993. It belongs to the block U+2400 to U+243F Control Pictures in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Symbol and is commonly used, that is, in no specific script.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it acts as Other Neutral. When changing direction it is not mirrored. The word that U+2420 forms with similar adjacent characters prevents a line break inside it.

The Wikipedia has the following information about this codepoint:

In writing, a space ( ) is a blank area that separates words, sentences, syllables (in syllabification) and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex. Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.

Typesetting can use spaces of varying widths, just as it can use graphic characters of varying widths. Unlike graphic characters, typeset spaces are commonly stretched in order to align text. The typewriter, on the other hand, typically has only one width for all characters, including spaces. Following widespread acceptance of the typewriter, some typewriter conventions influenced typography and the design of printed works.

Computer representation of text facilitates getting around mechanical and physical limitations such as character widths in at least two ways:

  • Character encodings such as Unicode provide spaces of several widths, which are encoded using distinct numeric code points. For example, Unicode U+0020 is the "normal" space character, but U+00A0 adds the meaning that a new line should not be started there, while U+2003 represents a space with a fixed width of one em. Collectively, such characters are called Whitespace characters.
  • Formatting and drawing languages and software commonly provide much more flexibility in spacing. For example, SVG, PostScript, and countless other languages enable drawing characters at specific (x,y) coordinates on a screen or page. By drawing each word at a specific starting coordinate, such programs need not "draw" spaces at all (this can lead to difficulties in extracting the correct text back out). Similarly, word processors can "fully justify" text, stretching inter-word spaces to make all lines the same length (as can mechanical Linotype machines). Precision is limited by physical capabilities of output devices.

Representations

System Representation
9248
UTF-8 E2 90 A0
UTF-16 24 20
UTF-32 00 00 24 20
URL-Quoted %E2%90%A0
HTML hex reference ␠
Wrong windows-1252 Mojibake ␠
Encoding: GB18030 (hex bytes) 81 37 86 32

Elsewhere

Complete Record

Property Value
Age (age) 1.1 (1993)
Unicode Name (na) SYMBOL FOR SPACE
Unicode 1 Name (na1) GRAPHIC FOR SPACE
Block (blk) Control Pictures
General Category (gc) Other Symbol
Script (sc) Common
Bidirectional Category (bc) Other Neutral
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+2420 Symbol for Space
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+2420 Symbol for Space
Lowercase Mapping (lc) Glyph for U+2420 Symbol for Space
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+2420 Symbol for Space
Uppercase Mapping (uc) Glyph for U+2420 Symbol for Space
Simple Titlecase Mapping (stc) Glyph for U+2420 Symbol for Space
Titlecase Mapping (tc) Glyph for U+2420 Symbol for Space
Case Folding (cf) Glyph for U+2420 Symbol for Space
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+2420 Symbol for Space
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Modifier Combining Mark (MCM)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+2420 Symbol for Space
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+2420 Symbol for Space
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Other
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+2420 Symbol for Space
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Alphabetic
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+2420 Symbol for Space
Script Extension (scx)
Vertical Orientation (vo) U