Home U+1E00 to U+1EFF Latin Extended Additional
Glyph for U+1E26
Source: Noto Sans

U+1E26 LATIN CAPITAL LETTER H WITH DIAERESIS

U+1E26 was added to Unicode in version 1.1 (1993). It belongs to the block U+1E00 to U+1EFF Latin Extended Additional in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Uppercase Letter and is mainly used in the Latin script. It is related to its lowercase variant Glyph for U+1E27 Latin Small Letter H with Diaeresis.

The glyph is a Canonical composition of the glyphs Glyph for U+0048 Latin Capital Letter H, Glyph for U+0308 Combining Diaeresis. It has a Neutral East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+1E26 behaves as Alphabetic regarding line breaks. It has type Upper for sentence and Alphabetic Letter for word breaks. The Grapheme Cluster Break is Any.

The Wikipedia has the following information about this codepoint:

The diaeresis ( dy-ERR-ə-sis, -⁠EER-; also known as the trema) and the umlaut () are two different diacritical marks that (in modern usage) look alike. They both consist of two dots ¨ placed over a letter, usually a vowel; when that letter is an i or a j, the diacritic replaces the tittle: ï. In computer systems, both forms have the same code point (binary code). Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.

The "diaeresis" and the "umlaut" are diacritics marking two distinct phonological phenomena.

  • The "diaeresis" diacritic is used to mark the separation of two distinct vowels in adjacent syllables when an instance of diaeresis (or hiatus) occurs, so as to distinguish from a digraph or diphthong.
  • The "umlaut" diacritic, in contrast, indicates a sound shift phenomenon – also known as umlaut – in which a back vowel becomes a front vowel.

Neither of these phenomena occur in English, except in loanwords (like naïve) or for stylistic reasons (as in the Brontë family or Mötley Crüe).

These two diacritics have different origins, the diaeresis being considerably older. Nevertheless, in modern computer systems using Unicode, the umlaut and diaeresis diacritics are encoded identically. For example, U+00E4 ä LATIN SMALL LETTER A WITH DIAERESIS represents both a-umlaut and a-diaeresis.

The same mark, placed above or below the letter, is used in other contexts and for different purposes and meanings. For example, in Albanian, ë represents a schwa.

Representations

System Representation
7718
UTF-8 E1 B8 A6
UTF-16 1E 26
UTF-32 00 00 1E 26
URL-Quoted %E1%B8%A6
HTML-Escape Ḧ
Wrong windows-1252 Mojibake Ḧ
AGL: Latin-5 uni1E26
Adobe Glyph List Hdieresis
digraph H:

Related Characters

Elsewhere

Complete Record

Property Value
Age 1.1 (1993)
Unicode Name LATIN CAPITAL LETTER H WITH DIAERESIS
Unicode 1 Name
Block Latin Extended Additional
General Category Uppercase Letter
Script Latin
Bidirectional Category Left To Right
Combining Class Not Reordered
Decomposition Type Canonical
Decomposition Mapping Glyph for U+0048 Latin Capital Letter H Glyph for U+0308 Combining Diaeresis
Lowercase
Simple Lowercase Mapping Glyph for U+1E27 Latin Small Letter H with Diaeresis
Lowercase Mapping Glyph for U+1E27 Latin Small Letter H with Diaeresis
Uppercase
Simple Uppercase Mapping Glyph for U+1E26 Latin Capital Letter H with Diaeresis
Uppercase Mapping Glyph for U+1E26 Latin Capital Letter H with Diaeresis
Simple Titlecase Mapping Glyph for U+1E26 Latin Capital Letter H with Diaeresis
Titlecase Mapping Glyph for U+1E26 Latin Capital Letter H with Diaeresis
Case Folding Glyph for U+1E27 Latin Small Letter H with Diaeresis
ASCII Hex Digit
Alphabetic
Bidi Control
Bidi Mirrored
Bidi Paired Bracket Glyph for U+1E26 Latin Capital Letter H with Diaeresis
Bidi Paired Bracket Type None
Cased
Composition Exclusion
Case Ignorable
Full Composition Exclusion
Changes When Casefolded
Changes When Casemapped
Changes When NFKC Casefolded
Changes When Lowercased
Changes When Titlecased
Changes When Uppercased
Dash
Deprecated
Default Ignorable Code Point
Diacritic
East Asian Width Neutral
Emoji Modifier Base
Emoji Component
Emoji Modifier
Emoji
Emoji Presentation
Extender
Extended Pictographic
FC NFKC Closure Glyph for U+1E26 Latin Capital Letter H with Diaeresis
Grapheme Cluster Break Any
Grapheme Base
Grapheme Extend
Grapheme Link
Hex Digit
Hangul Syllable Type Not Applicable
Hyphen
ID Continue
Ideographic
ID Start
IDS Binary Operator
IDS Trinary Operator and
Indic Mantra Category
Indic Positional Category NA
Indic Syllabic Category Other
ISO 10646 Comment
Joining Group No_Joining_Group
Join Control
Jamo Short Name
Joining Type Non Joining
Line Break Alphabetic
Logical Order Exception
Math
Noncharacter Code Point
NFC Quick Check Yes
NFD Quick Check No
NFKC Casefold Glyph for U+1E27 Latin Small Letter H with Diaeresis
NFKC Quick Check Yes
NFKD Quick Check No
Numeric Type None
Numeric Value not a number
Other Alphabetic
Other Default Ignorable Code Point
Other Grapheme Extend
Other ID Continue
Other ID Start
Other Lowercase
Other Math
Other Uppercase
Pattern Syntax
Pattern White Space
Prepended Concatenation Mark
Quotation Mark
Radical
Regional Indicator
Sentence Break Upper
Simple Case Folding Glyph for U+1E27 Latin Small Letter H with Diaeresis
Script Extension
Soft Dotted
Sentence Terminal
Terminal Punctuation
Unified Ideograph
Vertical Orientation R
Variation Selector
Word Break Alphabetic Letter
White Space
XID Continue
XID Start
Expands On NFC
Expands On NFD
Expands On NFKC
Expands On NFKD