U+2010 Hyphen
U+2010 was added to Unicode in version 1.1 (1993). It belongs to the block
This character is a Dash Punctuation and is commonly used, that is, in no specific script.
The glyph is not a composition. It has a Ambiguous East Asian Width. In bidirectional context it acts as Other Neutral and is not mirrored. The glyph can, under circumstances, be confused with 1 other glyphs. In text U+2010 behaves as Break After regarding line breaks. It has type Other for sentence and Other for word breaks. The Grapheme Cluster Break is Any.
The CLDR project labels this character “hyphen” for use in screen reading software. It assigns additional tags, e.g. for search in emoji pickers: dash.
The Wikipedia has the following information about this codepoint:
The hyphen ‐ is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. Son-in-law is an example of a hyphenated word.
The hyphen is sometimes confused with dashes (en dash – and em dash — and others), which are longer, or with the minus sign −, which is also longer and usually higher up to match the crossbar in the plus sign +.
As an orthographic concept, the hyphen is a single entity. In character encoding it is represented by any of several characters and glyphs, including the Unicode hyphen (shown at the top of the infobox on this page), the hyphen-minus, the soft hyphen, and the nonbreaking hyphen. The character most often used to represent a hyphen (and the one produced by the key on a keyboard) is called the "hyphen-minus" by Unicode, deriving from the original ASCII standard, where it was called "hyphen (minus)".
Representations
System | Representation |
---|---|
Nº | 8208 |
UTF-8 | E2 80 90 |
UTF-16 | 20 10 |
UTF-32 | 00 00 20 10 |
URL-Quoted | %E2%80%90 |
HTML hex reference | ‐ |
Wrong windows-1252 Mojibake | †|
HTML named entity | ‐ |
HTML named entity | ‐ |
Encoding: JIS0208 (hex bytes) | A1 BE |
LATEX | - |
AGL: Latin-4 | uni2010 |
AGL: Latin-5 | uni2010 |
Adobe Glyph List | hyphentwo |
digraph | -1 |
Related Characters
Confusables
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
HYPHEN | |
— | |
General Punctuation | |
Dash Punctuation | |
Common | |
Other Neutral | |
Not Reordered | |
None | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Consonant_Placeholder | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
Yes | |
|
|
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
Ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Break After | |
None | |
not a number | |
|
|
R |