Home: go to the homepage U+2000 to U+206F General Punctuation
Glyph for U+2010
Source: Noto Sans

U+2010 Hyphen

U+2010 was added to Unicode in version 1.1 (1993). It belongs to the block U+2000 to U+206F General Punctuation in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Dash Punctuation and is commonly used, that is, in no specific script.

The glyph is not a composition. It has a Ambiguous East Asian Width. In bidirectional context it acts as Other Neutral and is not mirrored. The glyph can, under circumstances, be confused with 1 other glyphs. In text U+2010 behaves as Break After regarding line breaks. It has type Other for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

The CLDR project labels this character “hyphen” for use in screen reading software. It assigns additional tags, e.g. for search in emoji pickers: dash.

The Wikipedia has the following information about this codepoint:

The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. Son-in-law is an example of a hyphenated word.

The hyphen is sometimes confused with dashes (en dash – and em dash — and others), which are longer, or with the minus sign −, which is also longer and usually higher up to match the crossbar in the plus sign +.

As an orthographic concept, the hyphen is a single entity. In character encoding it is represented by any of several characters and glyphs, including the Unicode hyphen (shown at the top of the infobox on this page), the hyphen-minus, the soft hyphen, and the nonbreaking hyphen. The character most often used to represent a hyphen (and the one produced by the key on a keyboard) is called the "hyphen-minus" by Unicode, deriving from the original ASCII standard, where it was called "hyphen (minus)".

Representations

System Representation
8208
UTF-8 E2 80 90
UTF-16 20 10
UTF-32 00 00 20 10
URL-Quoted %E2%80%90
HTML hex reference ‐
Wrong windows-1252 Mojibake ‐
HTML named entity ‐
HTML named entity ‐
Encoding: JIS0208 (hex bytes) A1 BE
LATEX -
AGL: Latin-4 uni2010
AGL: Latin-5 uni2010
Adobe Glyph List hyphentwo
digraph -1

Related Characters

Confusables

Elsewhere

Complete Record

Property Value
Age 1.1 (1993)
Unicode Name HYPHEN
Unicode 1 Name
Block General Punctuation
General Category Dash Punctuation
Script Common
Bidirectional Category Other Neutral
Combining Class Not Reordered
Decomposition Type None
Decomposition Mapping Glyph for U+2010 Hyphen
Lowercase
Simple Lowercase Mapping Glyph for U+2010 Hyphen
Lowercase Mapping Glyph for U+2010 Hyphen
Uppercase
Simple Uppercase Mapping Glyph for U+2010 Hyphen
Uppercase Mapping Glyph for U+2010 Hyphen
Simple Titlecase Mapping Glyph for U+2010 Hyphen
Titlecase Mapping Glyph for U+2010 Hyphen
Case Folding Glyph for U+2010 Hyphen
ASCII Hex Digit
Alphabetic
Bidi Control
Bidi Mirrored
Composition Exclusion
Case Ignorable
Changes When Casefolded
Changes When Casemapped
Changes When NFKC Casefolded
Changes When Lowercased
Changes When Titlecased
Changes When Uppercased
Cased
Full Composition Exclusion
Default Ignorable Code Point
Dash
Deprecated
Diacritic
Emoji Modifier Base
Emoji Component
Emoji Modifier
Emoji Presentation
Emoji
Extender
Extended Pictographic
FC NFKC Closure Glyph for U+2010 Hyphen
Grapheme Cluster Break Any
Grapheme Base
Grapheme Extend
Grapheme Link
Hex Digit
Hyphen
ID Continue
ID Start
IDS Binary Operator
IDS Trinary Operator and
IDSU 0
ID_Compat_Math_Continue 0
ID_Compat_Math_Start 0
Ideographic
InCB None
Indic Mantra Category
Indic Positional Category NA
Indic Syllabic Category Consonant_Placeholder
Jamo Short Name
Join Control
Logical Order Exception
Math
Noncharacter Code Point
NFC Quick Check Yes
NFD Quick Check Yes
NFKC Casefold Glyph for U+2010 Hyphen
NFKC Quick Check Yes
NFKC_SCF Glyph for U+2010 Hyphen
NFKD Quick Check Yes
Other Alphabetic
Other Default Ignorable Code Point
Other Grapheme Extend
Other ID Continue
Other ID Start
Other Lowercase
Other Math
Other Uppercase
Prepended Concatenation Mark
Pattern Syntax
Pattern White Space
Quotation Mark
Regional Indicator
Radical
Sentence Break Other
Soft Dotted
Sentence Terminal
Terminal Punctuation
Unified Ideograph
Variation Selector
Word Break Other
White Space
XID Continue
XID Start
Expands On NFC
Expands On NFD
Expands On NFKC
Expands On NFKD
Bidi Paired Bracket Glyph for U+2010 Hyphen
Bidi Paired Bracket Type None
East Asian Width Ambiguous
Hangul Syllable Type Not Applicable
ISO 10646 Comment
Joining Group No_Joining_Group
Joining Type Non Joining
Line Break Break After
Numeric Type None
Numeric Value not a number
Simple Case Folding Glyph for U+2010 Hyphen
Script Extension
Vertical Orientation R