Home: go to the homepage U+0080 to U+00FF Latin-1 Supplement
Glyph for U+00DF
Source: Noto Sans

U+00DF Latin Small Letter Sharp S

U+00DF was added to Unicode in version 1.1 (1993). It belongs to the block U+0080 to U+00FF Latin-1 Supplement in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Lowercase Letter and is mainly used in the Latin script. It is related to its uppercase variant Glyph for U+0053 Latin Capital Letter SGlyph for U+0053 Latin Capital Letter S and its titlecase variant Glyph for U+0053 Latin Capital Letter SGlyph for U+0073 Latin Small Letter S. The character is also known as Eszett.

The glyph is not a composition. It has a Ambiguous East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. The glyph can, under circumstances, be confused with 9 other glyphs. In text U+00DF behaves as Alphabetic regarding line breaks. It has type Lower for sentence and Alphabetic Letter for word breaks. The Grapheme Cluster Break is Any.

The Wikipedia has the following information about this codepoint:

In German orthography, the letter (uppercase) and ß (lowercase), called Eszett (IPA: [ɛsˈtsɛt]) and scharfes S (IPA: [ˌʃaʁfəs ˈʔɛs], "sharp S"), represents the /s/ phoneme in Standard German when following long vowels and diphthongs. The letter-name Eszett combines the names of the letters of ⟨s⟩ (Es) and ⟨z⟩ (Zett) in German. The character's Unicode names in English are sharp s and eszett. The Eszett letter is used only in German, and can be typographically replaced with the double-s digraph ⟨ss⟩, if the ß-character is unavailable. In the 20th century, the ß-character was replaced with ss in the spelling of Swiss Standard German (Switzerland and Liechtenstein), while remaining Standard German spelling in other varieties of the German language.

The letter originates as the ⟨sz⟩ digraph as used in late medieval and early modern German orthography, represented as a ligature of ⟨ſ⟩ (long s) and ⟨ʒ⟩ (tailed z) in blackletter typefaces, yielding ⟨ſʒ⟩. This developed from an earlier usage of ⟨z⟩ in Old and Middle High German to represent a separate sibilant sound from ⟨s⟩; when the difference between the two sounds was lost in the 13th century, the two symbols came to be combined as ⟨sz⟩ in some situations.

Traditionally, ⟨ß⟩ did not have a capital form, although some type designers introduced de facto capitalized variants. In 2017, the Council for German Orthography officially adopted a capital, ⟨ẞ⟩, into German orthography, ending a long orthographic debate.

⟨ß⟩ was encoded by ECMA-94 (1985) at position 223 (hexadecimal DF), inherited by Latin-1 and Unicode (U+00DF ß LATIN SMALL LETTER SHARP S). The HTML entity ß was introduced with HTML 2.0 (1995). The capital (U+1E9E ẞ LATIN CAPITAL LETTER SHARP S) was encoded by ISO 10646 in 2008.

Representations

System Representation
223
UTF-8 C3 9F
UTF-16 00 DF
UTF-32 00 00 00 DF
URL-Quoted %C3%9F
HTML hex reference ß
Wrong windows-1252 Mojibake ß
HTML named entity &szlig
HTML named entity ß
alias Eszett
Encoding: EUC-KR (hex bytes) A9 AC
Encoding: ISO-8859-10 (hex bytes) DF
Encoding: ISO-8859-13 (hex bytes) DF
Encoding: ISO-8859-14 (hex bytes) DF
Encoding: ISO-8859-15 (hex bytes) DF
Encoding: ISO-8859-16 (hex bytes) DF
Encoding: ISO-8859-2 (hex bytes) DF
Encoding: ISO-8859-3 (hex bytes) DF
Encoding: ISO-8859-4 (hex bytes) DF
Encoding: MACINTOSH (hex bytes) A7
Encoding: WINDOWS-1250 (hex bytes) DF
Encoding: WINDOWS-1252 (hex bytes) DF
Encoding: WINDOWS-1254 (hex bytes) DF
Encoding: WINDOWS-1257 (hex bytes) DF
Encoding: WINDOWS-1258 (hex bytes) DF
LATEX \ss
AGL: Latin-1 germandbls
AGL: Latin-2 germandbls
AGL: Latin-3 germandbls
AGL: Latin-4 germandbls
AGL: Latin-5 germandbls
Adobe Glyph List germandbls
digraph ss

Related Characters

Confusables

Elsewhere

Complete Record

Property Value
Age 1.1 (1993)
Unicode Name LATIN SMALL LETTER SHARP S
Unicode 1 Name
Block Latin-1 Supplement
General Category Lowercase Letter
Script Latin
Bidirectional Category Left To Right
Combining Class Not Reordered
Decomposition Type None
Decomposition Mapping Glyph for U+00DF Latin Small Letter Sharp S
Lowercase
Simple Lowercase Mapping Glyph for U+00DF Latin Small Letter Sharp S
Lowercase Mapping Glyph for U+00DF Latin Small Letter Sharp S
Uppercase
Simple Uppercase Mapping Glyph for U+00DF Latin Small Letter Sharp S
Uppercase Mapping Glyph for U+0053 Latin Capital Letter S Glyph for U+0053 Latin Capital Letter S
Simple Titlecase Mapping Glyph for U+00DF Latin Small Letter Sharp S
Titlecase Mapping Glyph for U+0053 Latin Capital Letter S Glyph for U+0073 Latin Small Letter S
Case Folding Glyph for U+0073 Latin Small Letter S Glyph for U+0073 Latin Small Letter S
ASCII Hex Digit
Alphabetic
Bidi Control
Bidi Mirrored
Composition Exclusion
Case Ignorable
Changes When Casefolded
Changes When Casemapped
Changes When NFKC Casefolded
Changes When Lowercased
Changes When Titlecased
Changes When Uppercased
Cased
Full Composition Exclusion
Default Ignorable Code Point
Dash
Deprecated
Diacritic
Emoji Modifier Base
Emoji Component
Emoji Modifier
Emoji Presentation
Emoji
Extender
Extended Pictographic
FC NFKC Closure Glyph for U+00DF Latin Small Letter Sharp S
Grapheme Cluster Break Any
Grapheme Base
Grapheme Extend
Grapheme Link
Hex Digit
Hyphen
ID Continue
ID Start
IDS Binary Operator
IDS Trinary Operator and
IDSU 0
ID_Compat_Math_Continue 0
ID_Compat_Math_Start 0
Ideographic
InCB None
Indic Mantra Category
Indic Positional Category NA
Indic Syllabic Category Other
Jamo Short Name
Join Control
Logical Order Exception
Math
Noncharacter Code Point
NFC Quick Check Yes
NFD Quick Check Yes
NFKC Casefold Glyph for U+0073 Latin Small Letter S Glyph for U+0073 Latin Small Letter S
NFKC Quick Check Yes
NFKC_SCF Glyph for U+00DF Latin Small Letter Sharp S
NFKD Quick Check Yes
Other Alphabetic
Other Default Ignorable Code Point
Other Grapheme Extend
Other ID Continue
Other ID Start
Other Lowercase
Other Math
Other Uppercase
Prepended Concatenation Mark
Pattern Syntax
Pattern White Space
Quotation Mark
Regional Indicator
Radical
Sentence Break Lower
Soft Dotted
Sentence Terminal
Terminal Punctuation
Unified Ideograph
Variation Selector
Word Break Alphabetic Letter
White Space
XID Continue
XID Start
Expands On NFC
Expands On NFD
Expands On NFKC
Expands On NFKD
Bidi Paired Bracket Glyph for U+00DF Latin Small Letter Sharp S
Bidi Paired Bracket Type None
East Asian Width Ambiguous
Hangul Syllable Type Not Applicable
ISO 10646 Comment
Joining Group No_Joining_Group
Joining Type Non Joining
Line Break Alphabetic
Numeric Type None
Numeric Value not a number
Simple Case Folding Glyph for U+00DF Latin Small Letter Sharp S
Script Extension
Vertical Orientation R