Home: go to the homepage U+0000 to U+007F Basic Latin
Glyph for U+0020
Source: Noto Sans Symbols 2

U+0020 Space

U+0020 was added in Unicode version 1.1 in 1993. It belongs to the block U+0000 to U+007F Basic Latin in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Space Separator and is commonly used, that is, in no specific script.

The glyph is not a composition. Its East Asian Width is narrow. In bidirectional text it acts as White Space. When changing direction it is not mirrored. U+0020 allows line breaks at its position. The glyph can be confused with 19 other glyphs.

The Wikipedia has the following information about this codepoint:

In writing, a space ( ) is a blank area that separates words, sentences, syllables (in syllabification) and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex. Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.

Typesetting can use spaces of varying widths, just as it can use graphic characters of varying widths. Unlike graphic characters, typeset spaces are commonly stretched in order to align text. The typewriter, on the other hand, typically has only one width for all characters, including spaces. Following widespread acceptance of the typewriter, some typewriter conventions influenced typography and the design of printed works.

Computer representation of text facilitates getting around mechanical and physical limitations such as character widths in at least two ways:

  • Character encodings such as Unicode provide spaces of several widths, which are encoded using distinct numeric code points. For example, Unicode U+0020 is the "normal" space character, but U+00A0 adds the meaning that a new line should not be started there, while U+2003 represents a space with a fixed width of one em. Collectively, such characters are called Whitespace characters.
  • Formatting and drawing languages and software commonly provide much more flexibility in spacing. For example, SVG, PostScript, and countless other languages enable drawing characters at specific (x,y) coordinates on a screen or page. By drawing each word at a specific starting coordinate, such programs need not "draw" spaces at all (this can lead to difficulties in extracting the correct text back out). Similarly, word processors can "fully justify" text, stretching inter-word spaces to make all lines the same length (as can mechanical Linotype machines). Precision is limited by physical capabilities of output devices.

Representations

System Representation
32
UTF-8 20
UTF-16 00 20
UTF-32 00 00 00 20
URL-Quoted %20
HTML hex reference  
Wrong windows-1252 Mojibake ␠
abbreviation SP
Encoding: ASCII (hex bytes) 20
Encoding: BIG5 (hex bytes) 20
Encoding: BIG5HKSCS (hex bytes) 20
Encoding: CP037 (hex bytes) 40
Encoding: CP273 (hex bytes) 40
Encoding: CP424 (hex bytes) 40
Encoding: CP437 (hex bytes) 20
Encoding: CP500 (hex bytes) 40
Encoding: CP720 (hex bytes) 20
Encoding: CP737 (hex bytes) 20
Encoding: CP775 (hex bytes) 20
Encoding: CP850 (hex bytes) 20
Encoding: CP852 (hex bytes) 20
Encoding: CP855 (hex bytes) 20
Encoding: CP856 (hex bytes) 20
Encoding: CP857 (hex bytes) 20
Encoding: CP858 (hex bytes) 20
Encoding: CP860 (hex bytes) 20
Encoding: CP861 (hex bytes) 20
Encoding: CP862 (hex bytes) 20
Encoding: CP863 (hex bytes) 20
Encoding: CP864 (hex bytes) 20
Encoding: CP865 (hex bytes) 20
Encoding: CP866 (hex bytes) 20
Encoding: CP869 (hex bytes) 20
Encoding: CP874 (hex bytes) 20
Encoding: CP875 (hex bytes) 40
Encoding: CP932 (hex bytes) 20
Encoding: CP949 (hex bytes) 20
Encoding: CP950 (hex bytes) 20
Encoding: CP1006 (hex bytes) 20
Encoding: CP1026 (hex bytes) 40
Encoding: CP1125 (hex bytes) 20
Encoding: CP1140 (hex bytes) 40
Encoding: CP1250 (hex bytes) 20
Encoding: CP1251 (hex bytes) 20
Encoding: CP1252 (hex bytes) 20
Encoding: CP1253 (hex bytes) 20
Encoding: CP1254 (hex bytes) 20
Encoding: CP1255 (hex bytes) 20
Encoding: CP1256 (hex bytes) 20
Encoding: CP1257 (hex bytes) 20
Encoding: CP1258 (hex bytes) 20
Encoding: EUC_JP (hex bytes) 20
Encoding: EUC_JIS_2004 (hex bytes) 20
Encoding: EUC_JISX0213 (hex bytes) 20
Encoding: EUC_KR (hex bytes) 20
Encoding: GB2312 (hex bytes) 20
Encoding: GBK (hex bytes) 20
Encoding: GB18030 (hex bytes) 20
Encoding: HZ (hex bytes) 20
Encoding: ISO2022_JP (hex bytes) 20
Encoding: ISO2022_JP_1 (hex bytes) 20
Encoding: ISO2022_JP_2 (hex bytes) 20
Encoding: ISO2022_JP_2004 (hex bytes) 20
Encoding: ISO2022_JP_3 (hex bytes) 20
Encoding: ISO2022_JP_EXT (hex bytes) 20
Encoding: ISO2022_KR (hex bytes) 20
Encoding: LATIN_1 (hex bytes) 20
Encoding: ISO8859_2 (hex bytes) 20
Encoding: ISO8859_3 (hex bytes) 20
Encoding: ISO8859_4 (hex bytes) 20
Encoding: ISO8859_5 (hex bytes) 20
Encoding: ISO8859_6 (hex bytes) 20
Encoding: ISO8859_7 (hex bytes) 20
Encoding: ISO8859_8 (hex bytes) 20
Encoding: ISO8859_9 (hex bytes) 20
Encoding: ISO8859_10 (hex bytes) 20
Encoding: ISO8859_11 (hex bytes) 20
Encoding: ISO8859_13 (hex bytes) 20
Encoding: ISO8859_14 (hex bytes) 20
Encoding: ISO8859_15 (hex bytes) 20
Encoding: ISO8859_16 (hex bytes) 20
Encoding: JOHAB (hex bytes) 20
Encoding: KOI8_R (hex bytes) 20
Encoding: KOI8_T (hex bytes) 20
Encoding: KOI8_U (hex bytes) 20
Encoding: KZ1048 (hex bytes) 20
Encoding: MAC_CYRILLIC (hex bytes) 20
Encoding: MAC_GREEK (hex bytes) 20
Encoding: MAC_ICELAND (hex bytes) 20
Encoding: MAC_LATIN2 (hex bytes) 20
Encoding: MAC_ROMAN (hex bytes) 20
Encoding: MAC_TURKISH (hex bytes) 20
Encoding: PTCP154 (hex bytes) 20
Encoding: SHIFT_JIS (hex bytes) 20
Encoding: SHIFT_JIS_2004 (hex bytes) 20
Encoding: SHIFT_JISX0213 (hex bytes) 20
Encoding: CP037 (hex bytes) 40
Encoding: CP1025 (hex bytes) 40
Encoding: CP1047 (hex bytes) 40
Encoding: CP1097 (hex bytes) 40
Encoding: CP1112 (hex bytes) 40
Encoding: CP1122 (hex bytes) 40
Encoding: CP1123 (hex bytes) 40
Encoding: CP1140 (hex bytes) 40
Encoding: CP1141 (hex bytes) 40
Encoding: CP1142 (hex bytes) 40
Encoding: CP1143 (hex bytes) 40
Encoding: CP1144 (hex bytes) 40
Encoding: CP1145 (hex bytes) 40
Encoding: CP1146 (hex bytes) 40
Encoding: CP1147 (hex bytes) 40
Encoding: CP1148 (hex bytes) 40
Encoding: CP1148MS (hex bytes) 40
Encoding: CP1149 (hex bytes) 40
Encoding: CP273 (hex bytes) 40
Encoding: CP277 (hex bytes) 40
Encoding: CP278 (hex bytes) 40
Encoding: CP280 (hex bytes) 40
Encoding: CP284 (hex bytes) 40
Encoding: CP285 (hex bytes) 40
Encoding: CP290 (hex bytes) 40
Encoding: CP297 (hex bytes) 40
Encoding: CP420 (hex bytes) 40
Encoding: CP424 (hex bytes) 40
Encoding: CP500 (hex bytes) 40
Encoding: CP500MS (hex bytes) 40
Encoding: CP833 (hex bytes) 40
Encoding: CP838 (hex bytes) 40
Encoding: CP870 (hex bytes) 40
Encoding: CP871 (hex bytes) 40
Encoding: CP875 (hex bytes) 40
LATEX \space
AGL: Latin-1 space
AGL: Latin-2 space
AGL: Latin-3 space
AGL: Latin-4 space
AGL: Latin-5 space
Adobe Glyph List space
Adobe Glyph List spacehackarabic
digraph SP

Related Characters

Confusables

Elsewhere

Complete Record

Property Value
Age (age) 1.1 (1993)
Unicode Name (na) SPACE
Unicode 1 Name (na1)
Block (blk) Basic Latin
General Category (gc) Space Separator
Script (sc) Common
Bidirectional Category (bc) White Space
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+0020 Space
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+0020 Space
Lowercase Mapping (lc) Glyph for U+0020 Space
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+0020 Space
Uppercase Mapping (uc) Glyph for U+0020 Space
Simple Titlecase Mapping (stc) Glyph for U+0020 Space
Titlecase Mapping (tc) Glyph for U+0020 Space
Case Folding (cf) Glyph for U+0020 Space
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+0020 Space
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Modifier Combining Mark (MCM)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+0020 Space
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+0020 Space
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Space
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) WSegSpace
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+0020 Space
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) narrow
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Space
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+0020 Space
Script Extension (scx)
Vertical Orientation (vo) R