U+0020 Space
U+0020 was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Space Separator and is commonly used, that is, in no specific script.
The glyph is not a composition. Its East Asian Width is narrow. In bidirectional text it acts as White Space. When changing direction it is not mirrored. U+0020 allows line breaks at its position. The glyph can be confused with 19 other glyphs.
The Wikipedia has the following information about this codepoint:
In writing, a space ( ) is a blank area that separates words, sentences, syllables (in syllabification) and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex. Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.
Typesetting can use spaces of varying widths, just as it can use graphic characters of varying widths. Unlike graphic characters, typeset spaces are commonly stretched in order to align text. The typewriter, on the other hand, typically has only one width for all characters, including spaces. Following widespread acceptance of the typewriter, some typewriter conventions influenced typography and the design of printed works.
Computer representation of text facilitates getting around mechanical and physical limitations such as character widths in at least two ways:
- Character encodings such as Unicode provide spaces of several widths, which are encoded using distinct numeric code points. For example, Unicode U+0020 is the "normal" space character, but U+00A0 adds the meaning that a new line should not be started there, while U+2003 represents a space with a fixed width of one em. Collectively, such characters are called Whitespace characters.
- Formatting and drawing languages and software commonly provide much more flexibility in spacing. For example, SVG, PostScript, and countless other languages enable drawing characters at specific (x,y) coordinates on a screen or page. By drawing each word at a specific starting coordinate, such programs need not "draw" spaces at all (this can lead to difficulties in extracting the correct text back out). Similarly, word processors can "fully justify" text, stretching inter-word spaces to make all lines the same length (as can mechanical Linotype machines). Precision is limited by physical capabilities of output devices.
Representations
System | Representation |
---|---|
Nº | 32 |
UTF-8 | 20 |
UTF-16 | 00 20 |
UTF-32 | 00 00 00 20 |
URL-Quoted | %20 |
HTML hex reference |   |
Wrong windows-1252 Mojibake | â |
abbreviation | SP |
Encoding: ASCII (hex bytes) | 20 |
Encoding: BIG5 (hex bytes) | 20 |
Encoding: BIG5HKSCS (hex bytes) | 20 |
Encoding: CP037 (hex bytes) | 40 |
Encoding: CP273 (hex bytes) | 40 |
Encoding: CP424 (hex bytes) | 40 |
Encoding: CP437 (hex bytes) | 20 |
Encoding: CP500 (hex bytes) | 40 |
Encoding: CP720 (hex bytes) | 20 |
Encoding: CP737 (hex bytes) | 20 |
Encoding: CP775 (hex bytes) | 20 |
Encoding: CP850 (hex bytes) | 20 |
Encoding: CP852 (hex bytes) | 20 |
Encoding: CP855 (hex bytes) | 20 |
Encoding: CP856 (hex bytes) | 20 |
Encoding: CP857 (hex bytes) | 20 |
Encoding: CP858 (hex bytes) | 20 |
Encoding: CP860 (hex bytes) | 20 |
Encoding: CP861 (hex bytes) | 20 |
Encoding: CP862 (hex bytes) | 20 |
Encoding: CP863 (hex bytes) | 20 |
Encoding: CP864 (hex bytes) | 20 |
Encoding: CP865 (hex bytes) | 20 |
Encoding: CP866 (hex bytes) | 20 |
Encoding: CP869 (hex bytes) | 20 |
Encoding: CP874 (hex bytes) | 20 |
Encoding: CP875 (hex bytes) | 40 |
Encoding: CP932 (hex bytes) | 20 |
Encoding: CP949 (hex bytes) | 20 |
Encoding: CP950 (hex bytes) | 20 |
Encoding: CP1006 (hex bytes) | 20 |
Encoding: CP1026 (hex bytes) | 40 |
Encoding: CP1125 (hex bytes) | 20 |
Encoding: CP1140 (hex bytes) | 40 |
Encoding: CP1250 (hex bytes) | 20 |
Encoding: CP1251 (hex bytes) | 20 |
Encoding: CP1252 (hex bytes) | 20 |
Encoding: CP1253 (hex bytes) | 20 |
Encoding: CP1254 (hex bytes) | 20 |
Encoding: CP1255 (hex bytes) | 20 |
Encoding: CP1256 (hex bytes) | 20 |
Encoding: CP1257 (hex bytes) | 20 |
Encoding: CP1258 (hex bytes) | 20 |
Encoding: EUC_JP (hex bytes) | 20 |
Encoding: EUC_JIS_2004 (hex bytes) | 20 |
Encoding: EUC_JISX0213 (hex bytes) | 20 |
Encoding: EUC_KR (hex bytes) | 20 |
Encoding: GB2312 (hex bytes) | 20 |
Encoding: GBK (hex bytes) | 20 |
Encoding: GB18030 (hex bytes) | 20 |
Encoding: HZ (hex bytes) | 20 |
Encoding: ISO2022_JP (hex bytes) | 20 |
Encoding: ISO2022_JP_1 (hex bytes) | 20 |
Encoding: ISO2022_JP_2 (hex bytes) | 20 |
Encoding: ISO2022_JP_2004 (hex bytes) | 20 |
Encoding: ISO2022_JP_3 (hex bytes) | 20 |
Encoding: ISO2022_JP_EXT (hex bytes) | 20 |
Encoding: ISO2022_KR (hex bytes) | 20 |
Encoding: LATIN_1 (hex bytes) | 20 |
Encoding: ISO8859_2 (hex bytes) | 20 |
Encoding: ISO8859_3 (hex bytes) | 20 |
Encoding: ISO8859_4 (hex bytes) | 20 |
Encoding: ISO8859_5 (hex bytes) | 20 |
Encoding: ISO8859_6 (hex bytes) | 20 |
Encoding: ISO8859_7 (hex bytes) | 20 |
Encoding: ISO8859_8 (hex bytes) | 20 |
Encoding: ISO8859_9 (hex bytes) | 20 |
Encoding: ISO8859_10 (hex bytes) | 20 |
Encoding: ISO8859_11 (hex bytes) | 20 |
Encoding: ISO8859_13 (hex bytes) | 20 |
Encoding: ISO8859_14 (hex bytes) | 20 |
Encoding: ISO8859_15 (hex bytes) | 20 |
Encoding: ISO8859_16 (hex bytes) | 20 |
Encoding: JOHAB (hex bytes) | 20 |
Encoding: KOI8_R (hex bytes) | 20 |
Encoding: KOI8_T (hex bytes) | 20 |
Encoding: KOI8_U (hex bytes) | 20 |
Encoding: KZ1048 (hex bytes) | 20 |
Encoding: MAC_CYRILLIC (hex bytes) | 20 |
Encoding: MAC_GREEK (hex bytes) | 20 |
Encoding: MAC_ICELAND (hex bytes) | 20 |
Encoding: MAC_LATIN2 (hex bytes) | 20 |
Encoding: MAC_ROMAN (hex bytes) | 20 |
Encoding: MAC_TURKISH (hex bytes) | 20 |
Encoding: PTCP154 (hex bytes) | 20 |
Encoding: SHIFT_JIS (hex bytes) | 20 |
Encoding: SHIFT_JIS_2004 (hex bytes) | 20 |
Encoding: SHIFT_JISX0213 (hex bytes) | 20 |
Encoding: CP037 (hex bytes) | 40 |
Encoding: CP1025 (hex bytes) | 40 |
Encoding: CP1047 (hex bytes) | 40 |
Encoding: CP1097 (hex bytes) | 40 |
Encoding: CP1112 (hex bytes) | 40 |
Encoding: CP1122 (hex bytes) | 40 |
Encoding: CP1123 (hex bytes) | 40 |
Encoding: CP1140 (hex bytes) | 40 |
Encoding: CP1141 (hex bytes) | 40 |
Encoding: CP1142 (hex bytes) | 40 |
Encoding: CP1143 (hex bytes) | 40 |
Encoding: CP1144 (hex bytes) | 40 |
Encoding: CP1145 (hex bytes) | 40 |
Encoding: CP1146 (hex bytes) | 40 |
Encoding: CP1147 (hex bytes) | 40 |
Encoding: CP1148 (hex bytes) | 40 |
Encoding: CP1148MS (hex bytes) | 40 |
Encoding: CP1149 (hex bytes) | 40 |
Encoding: CP273 (hex bytes) | 40 |
Encoding: CP277 (hex bytes) | 40 |
Encoding: CP278 (hex bytes) | 40 |
Encoding: CP280 (hex bytes) | 40 |
Encoding: CP284 (hex bytes) | 40 |
Encoding: CP285 (hex bytes) | 40 |
Encoding: CP290 (hex bytes) | 40 |
Encoding: CP297 (hex bytes) | 40 |
Encoding: CP420 (hex bytes) | 40 |
Encoding: CP424 (hex bytes) | 40 |
Encoding: CP500 (hex bytes) | 40 |
Encoding: CP500MS (hex bytes) | 40 |
Encoding: CP833 (hex bytes) | 40 |
Encoding: CP838 (hex bytes) | 40 |
Encoding: CP870 (hex bytes) | 40 |
Encoding: CP871 (hex bytes) | 40 |
Encoding: CP875 (hex bytes) | 40 |
LATEX | \space |
AGL: Latin-1 | space |
AGL: Latin-2 | space |
AGL: Latin-3 | space |
AGL: Latin-4 | space |
AGL: Latin-5 | space |
Adobe Glyph List | space |
Adobe Glyph List | spacehackarabic |
digraph | SP |
Related Characters
No-Break SpaceGlyph for U+00A0 DiaeresisGlyph for U+00A8 MacronGlyph for U+00AF Acute AccentGlyph for U+00B4 CedillaGlyph for U+00B8 BreveGlyph for U+02D8 Dot AboveGlyph for U+02D9 Ring AboveGlyph for U+02DA OgonekGlyph for U+02DB Small TildeGlyph for U+02DC Double Acute AccentGlyph for U+02DD Greek YpogegrammeniGlyph for U+037A Greek TonosGlyph for U+0384 Greek Dialytika TonosGlyph for U+0385 Greek KoronisGlyph for U+1FBD Greek PsiliGlyph for U+1FBF Greek PerispomeniGlyph for U+1FC0 Greek Dialytika and PerispomeniGlyph for U+1FC1 Greek Psili and VariaGlyph for U+1FCD Greek Psili and OxiaGlyph for U+1FCE Greek Psili and PerispomeniGlyph for U+1FCF Greek Dasia and VariaGlyph for U+1FDD Greek Dasia and OxiaGlyph for U+1FDE Greek Dasia and PerispomeniGlyph for U+1FDF Greek Dialytika and VariaGlyph for U+1FED Greek Dialytika and OxiaGlyph for U+1FEE Greek OxiaGlyph for U+1FFD Greek DasiaGlyph for U+1FFE En QuadGlyph for U+2000 Em QuadGlyph for U+2001 En SpaceGlyph for U+2002 Em SpaceGlyph for U+2003 Three-Per-Em SpaceGlyph for U+2004 Four-Per-Em SpaceGlyph for U+2005 Six-Per-Em SpaceGlyph for U+2006 Figure SpaceGlyph for U+2007 Punctuation SpaceGlyph for U+2008 Thin SpaceGlyph for U+2009 Hair SpaceGlyph for U+200A Double Low LineGlyph for U+2017 Narrow No-Break SpaceGlyph for U+202F OverlineGlyph for U+203E Medium Mathematical SpaceGlyph for U+205F Ideographic SpaceGlyph for U+3000 Katakana-Hiragana Voiced Sound MarkGlyph for U+309B Katakana-Hiragana Semi-Voiced Sound MarkGlyph for U+309C Arabic Ligature Shadda with Dammatan Isolated FormGlyph for U+FC5E Arabic Ligature Shadda with Kasratan Isolated FormGlyph for U+FC5F Arabic Ligature Shadda with Fatha Isolated FormGlyph for U+FC60 Arabic Ligature Shadda with Damma Isolated FormGlyph for U+FC61 Arabic Ligature Shadda with Kasra Isolated FormGlyph for U+FC62 Arabic Ligature Shadda with Superscript Alef Isolated FormGlyph for U+FC63 Arabic Ligature Sallallahou Alayhe WasallamGlyph for U+FDFA Arabic Ligature JallajalalouhouGlyph for U+FDFB Dashed OverlineGlyph for U+FE49 Centreline OverlineGlyph for U+FE4A Wavy OverlineGlyph for U+FE4B Double Wavy OverlineGlyph for U+FE4C Arabic Fathatan Isolated FormGlyph for U+FE70 Arabic Dammatan Isolated FormGlyph for U+FE72 Arabic Kasratan Isolated FormGlyph for U+FE74 Arabic Fatha Isolated FormGlyph for U+FE76 Arabic Damma Isolated FormGlyph for U+FE78 Arabic Kasra Isolated FormGlyph for U+FE7A Arabic Shadda Isolated FormGlyph for U+FE7C Arabic Sukun Isolated FormGlyph for U+FE7E Fullwidth MacronGlyph for U+FFE3
Confusables
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
SPACE | |
— | |
Basic Latin | |
Space Separator | |
Common | |
White Space | |
Not Reordered | |
none | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
Yes | |
|
|
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
Space | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
WSegSpace | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
narrow | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Space | |
none | |
not a number | |
|
|
R |