U+00DE Latin Capital Letter Thorn
U+00DE was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Uppercase Letter and is mainly used in the Latin script. Its lowercase variant is
The glyph is not a composition. Its width in East Asian texts is determined by its context. It can be displayed wide or narrow. In bidirectional text it is written from left to right. When changing direction it is not mirrored. The word that U+00DE forms with similar adjacent characters prevents a line break inside it. The glyph can be confused with 2 other glyphs.
The Wikipedia has the following information about this codepoint:
Thorn or þorn (Þ, þ) is a letter in the Old English, Old Norse, Old Swedish and modern Icelandic alphabets, as well as modern transliterations of the Gothic alphabet, Middle Scots, and some dialects of Middle English. It was also used in medieval Scandinavia but was later replaced with the digraph th, except in Iceland, where it survives. The letter originated from the rune ᚦ in the Elder Futhark and was called thorn in the Anglo-Saxon and thorn or thurs in the Scandinavian rune poems. It is similar in appearance to the archaic Greek letter sho (ϸ), although the two are historically unrelated. The only language in which þ is currently in use is Icelandic.
It is pronounced as either a voiceless dental fricative [θ] or its voiced counterpart [ð]. However, in modern Icelandic it is pronounced as a laminal voiceless alveolar non-sibilant fricative [θ̠], similar to th as in the English word thick, or a (usually apical) voiced alveolar non-sibilant fricative [ð̠], similar to th as in the English word the. Modern Icelandic usage generally excludes the latter, which is instead represented with the letter eth ⟨Ð, ð⟩; however, [ð̠] may occur as an allophone of /θ̠/, and written ⟨þ⟩, when it appears in an unstressed pronoun or adverb after a voiced sound.
In typography the lowercase thorn character is unusual in that it has both an ascender and a descender (other examples are the lowercase Cyrillic ф, and, in some [especially italic] fonts, the Latin letters f and ſ ).
Representations
System | Representation |
---|---|
Nº | 222 |
UTF-8 | C3 9E |
UTF-16 | 00 DE |
UTF-32 | 00 00 00 DE |
URL-Quoted | %C3%9E |
HTML hex reference | Þ |
Wrong windows-1252 Mojibake | Þ |
HTML named entity | Þ |
HTML named entity | Þ |
Encoding: CP037 (hex bytes) | AE |
Encoding: CP273 (hex bytes) | AE |
Encoding: CP500 (hex bytes) | AE |
Encoding: CP850 (hex bytes) | E8 |
Encoding: CP858 (hex bytes) | E8 |
Encoding: CP861 (hex bytes) | 8D |
Encoding: CP949 (hex bytes) | A8 AD |
Encoding: CP1140 (hex bytes) | AE |
Encoding: CP1252 (hex bytes) | DE |
Encoding: EUC_JP (hex bytes) | 8F A9 B0 |
Encoding: EUC_JIS_2004 (hex bytes) | A9 D4 |
Encoding: EUC_JISX0213 (hex bytes) | A9 D4 |
Encoding: EUC_KR (hex bytes) | A8 AD |
Encoding: GB18030 (hex bytes) | 81 30 89 37 |
Encoding: ISO2022_JP_1 (hex bytes) | 1B 24 28 44 29 30 1B 28 42 |
Encoding: ISO2022_JP_2 (hex bytes) | 1B 24 28 44 29 30 1B 28 42 |
Encoding: ISO2022_JP_2004 (hex bytes) | 1B 24 28 51 29 54 1B 28 42 |
Encoding: ISO2022_JP_3 (hex bytes) | 1B 24 28 4F 29 54 1B 28 42 |
Encoding: ISO2022_JP_EXT (hex bytes) | 1B 24 28 44 29 30 1B 28 42 |
Encoding: ISO2022_KR (hex bytes) | 1B 24 29 43 0E 28 2D 0F |
Encoding: LATIN_1 (hex bytes) | DE |
Encoding: ISO8859_10 (hex bytes) | DE |
Encoding: ISO8859_15 (hex bytes) | DE |
Encoding: JOHAB (hex bytes) | DC AD |
Encoding: MAC_ICELAND (hex bytes) | DE |
Encoding: SHIFT_JIS_2004 (hex bytes) | 85 73 |
Encoding: SHIFT_JISX0213 (hex bytes) | 85 73 |
Encoding: CP037 (hex bytes) | AE |
Encoding: CP1047 (hex bytes) | AE |
Encoding: CP1140 (hex bytes) | AE |
Encoding: CP1141 (hex bytes) | AE |
Encoding: CP1142 (hex bytes) | AE |
Encoding: CP1143 (hex bytes) | AE |
Encoding: CP1144 (hex bytes) | AE |
Encoding: CP1145 (hex bytes) | AE |
Encoding: CP1146 (hex bytes) | AE |
Encoding: CP1147 (hex bytes) | AE |
Encoding: CP1148 (hex bytes) | AE |
Encoding: CP1148MS (hex bytes) | AE |
Encoding: CP1149 (hex bytes) | 4A |
Encoding: CP273 (hex bytes) | AE |
Encoding: CP277 (hex bytes) | AE |
Encoding: CP278 (hex bytes) | AE |
Encoding: CP280 (hex bytes) | AE |
Encoding: CP284 (hex bytes) | AE |
Encoding: CP285 (hex bytes) | AE |
Encoding: CP297 (hex bytes) | AE |
Encoding: CP500 (hex bytes) | AE |
Encoding: CP500MS (hex bytes) | AE |
Encoding: CP871 (hex bytes) | 4A |
LATEX | \TH |
AGL: Latin-1 | Thorn |
AGL: Latin-2 | Thorn |
AGL: Latin-3 | Thorn |
AGL: Latin-4 | Thorn |
AGL: Latin-5 | Thorn |
Adobe Glyph List | Thorn |
digraph | TH |
Related Characters
Confusables
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
LATIN CAPITAL LETTER THORN | |
— | |
Latin-1 Supplement | |
Uppercase Letter | |
Latin | |
Left To Right | |
Not Reordered | |
none | |
|
|
✘ | |
|
|
|
|
✔ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✔ | |
✔ | |
✔ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✔ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
Yes | |
|
|
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Upper | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Alphabetic Letter | |
✘ | |
✔ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Alphabetic | |
none | |
not a number | |
|
|
R |