U+00A8 Diaeresis
U+00A8 was added to Unicode in version 1.1 (1993). It belongs to the block
This character is a Modifier Symbol and is commonly used, that is, in no specific script.
The glyph is a Compat composition of the glyphs
The CLDR project labels this character “diaeresis” for use in screen reading software. It assigns additional tags, e.g. for search in emoji pickers: diaeresis, tréma, umlaut.
The Wikipedia has the following information about this codepoint:
The diaeresis ( dy-ERR-ə-sis, -EER-; also known as the trema) and the umlaut () are two different diacritical marks that (in modern usage) look alike. They both consist of two dots ¨ placed over a letter, usually a vowel; when that letter is an i or a j, the diacritic replaces the tittle: ï. In computer systems, both forms have the same code point (binary code). Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.
The "diaeresis" and the "umlaut" are diacritics marking two distinct phonological phenomena.
- The "diaeresis" diacritic is used to mark the separation of two distinct vowels in adjacent syllables when an instance of diaeresis (or hiatus) occurs, so as to distinguish from a digraph or diphthong.
- The "umlaut" diacritic, in contrast, indicates a sound shift phenomenon – also known as umlaut – in which a back vowel becomes a front vowel.
Neither of these phenomena occur in English, except in loanwords (like naïve) or for stylistic reasons (as in the Brontë family or Mötley Crüe).
These two diacritics have different origins, the diaeresis being considerably older. Nevertheless, in modern computer systems using Unicode, the umlaut and diaeresis diacritics are encoded identically. For example, U+00E4 ä LATIN SMALL LETTER A WITH DIAERESIS represents both a-umlaut and a-diaeresis.
The same mark, placed above or below the letter, is used in other contexts and for different purposes and meanings. For example, in Albanian, ë represents a schwa.
Representations
System | Representation |
---|---|
Nº | 168 |
UTF-8 | C2 A8 |
UTF-16 | 00 A8 |
UTF-32 | 00 00 00 A8 |
URL-Quoted | %C2%A8 |
HTML hex reference | ¨ |
Wrong windows-1252 Mojibake | ◌¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
Encoding: EUC-KR (hex bytes) | A1 A7 |
Encoding: ISO-8859-2 (hex bytes) | A8 |
Encoding: ISO-8859-3 (hex bytes) | A8 |
Encoding: ISO-8859-4 (hex bytes) | A8 |
Encoding: ISO-8859-7 (hex bytes) | A8 |
Encoding: ISO-8859-8 (hex bytes) | A8 |
Encoding: JIS0208 (hex bytes) | A1 AF |
Encoding: MACINTOSH (hex bytes) | AC |
Encoding: WINDOWS-1250 (hex bytes) | A8 |
Encoding: WINDOWS-1252 (hex bytes) | A8 |
Encoding: WINDOWS-1253 (hex bytes) | A8 |
Encoding: WINDOWS-1254 (hex bytes) | A8 |
Encoding: WINDOWS-1255 (hex bytes) | A8 |
Encoding: WINDOWS-1256 (hex bytes) | A8 |
Encoding: WINDOWS-1257 (hex bytes) | 8D |
Encoding: WINDOWS-1258 (hex bytes) | A8 |
LATEX | \textasciidieresis |
AGL: Latin-1 | dieresis |
AGL: Latin-2 | dieresis |
AGL: Latin-3 | dieresis |
AGL: Latin-4 | dieresis |
AGL: Latin-5 | dieresis |
Adobe Glyph List | dieresis |
digraph | ': |
Related Characters
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
DIAERESIS | |
SPACING DIAERESIS | |
Latin-1 Supplement | |
Modifier Symbol | |
Common | |
Other Neutral | |
Not Reordered | |
Compat | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
No | |
No | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✔ | |
|
|
None | |
Ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Ambiguous (Alphabetic or Ideographic) | |
None | |
not a number | |
|
|
R |