U+00AD Soft Hyphen
U+00AD was added to Unicode in version 1.1 (1993). It belongs to the block
This character is a Format and is commonly used, that is, in no specific script. The character is also known as discretionary hyphen.
The glyph is not a composition. It has a Ambiguous East Asian Width. In bidirectional context it acts as Boundary Neutral and is not mirrored. In text U+00AD behaves as Break After regarding line breaks. It has type Format for sentence and Format for word breaks. The Grapheme Cluster Break is Control.
The Wikipedia has the following information about this codepoint:
In computing and typesetting, a soft hyphen (ISO 8859: 0xAD, Unicode U+00AD SOFT HYPHEN, HTML: ­ or ­ or ­) or syllable hyphen (EBCDIC: 0xCA), abbreviated SHY, is a code point reserved in some coded character sets for the purpose of breaking words across lines by inserting visible hyphens. Two alternative ways of using the soft hyphen character for this purpose have emerged, depending on whether the encoded text will be broken into lines by its recipient, or has already been preformatted by its originator.
Representations
System | Representation |
---|---|
Nº | 173 |
UTF-8 | C2 AD |
UTF-16 | 00 AD |
UTF-32 | 00 00 00 AD |
URL-Quoted | %C2%AD |
HTML hex reference | ­ |
Wrong windows-1252 Mojibake | Â |
HTML named entity | ­ |
HTML named entity | ­ |
abbreviation | SHY |
alias | discretionary hyphen |
Encoding: EUC-KR (hex bytes) | A1 A9 |
Encoding: ISO-8859-10 (hex bytes) | AD |
Encoding: ISO-8859-13 (hex bytes) | AD |
Encoding: ISO-8859-14 (hex bytes) | AD |
Encoding: ISO-8859-15 (hex bytes) | AD |
Encoding: ISO-8859-16 (hex bytes) | AD |
Encoding: ISO-8859-2 (hex bytes) | AD |
Encoding: ISO-8859-3 (hex bytes) | AD |
Encoding: ISO-8859-4 (hex bytes) | AD |
Encoding: ISO-8859-5 (hex bytes) | AD |
Encoding: ISO-8859-6 (hex bytes) | AD |
Encoding: ISO-8859-7 (hex bytes) | AD |
Encoding: ISO-8859-8 (hex bytes) | AD |
Encoding: WINDOWS-1250 (hex bytes) | AD |
Encoding: WINDOWS-1251 (hex bytes) | AD |
Encoding: WINDOWS-1252 (hex bytes) | AD |
Encoding: WINDOWS-1253 (hex bytes) | AD |
Encoding: WINDOWS-1254 (hex bytes) | AD |
Encoding: WINDOWS-1255 (hex bytes) | AD |
Encoding: WINDOWS-1256 (hex bytes) | AD |
Encoding: WINDOWS-1257 (hex bytes) | AD |
Encoding: WINDOWS-1258 (hex bytes) | AD |
LATEX | \- |
AGL: Latin-2 | uni00AD |
AGL: Latin-3 | uni00AD |
AGL: Latin-4 | uni00AD |
AGL: Latin-5 | uni00AD |
Adobe Glyph List | sfthyphen |
Adobe Glyph List | softhyphen |
digraph | -- |
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
SOFT HYPHEN | |
— | |
Latin-1 Supplement | |
Format | |
Common | |
Boundary Neutral | |
Not Reordered | |
None | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Control | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
Yes | |
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Format | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Format | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
Ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Transparent | |
Break After | |
None | |
not a number | |
|
|
R |