U+00EF Latin Small Letter I with Diaeresis
U+00EF was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Lowercase Letter and is mainly used in the Latin script. Its uppercase variant is
The glyph is a canonical composition of the glyphs
The Wikipedia has the following information about this codepoint:
Ï, lowercase ï, is a symbol used in various languages written with the Latin alphabet; it can be read as the letter I with diaeresis, I-umlaut or I-trema.
Initially in French and also in Afrikaans, Catalan, Dutch, Galician, Southern Sami, Welsh, and occasionally English, ⟨ï⟩ is used when ⟨i⟩ follows another vowel and indicates hiatus in the pronunciation of such a word. It indicates that the two vowels are pronounced in separate syllables, rather than together as a diphthong or digraph. For example, French maïs (IPA: [ma.is] ; "maize"); without the diaeresis, the ⟨i⟩ is part of the digraph ⟨ai⟩: mais (IPA: [mɛ] ; *but"). The letter is also used in the same context in Dutch, as in Oekraïne (pronounced [ukraːˈ(j)inə] *and not [uˈkrɑinə]; "Ukraine"), and English naïve ( nah-EEV or ny-EEV).
In scholarly writing on Turkic languages, ⟨ï⟩ is sometimes used to write the close back unrounded vowel /ɯ/, which, in the standard modern Turkish alphabet, is written as the dotless i ⟨ı⟩. The back neutral vowel reconstructed in Proto-Mongolic is sometimes written ⟨ï⟩.
In the transcription of Amazonian languages, ⟨ï⟩ is used to represent the high central vowel [ɨ].
It is also a transliteration of the rune ᛇ.
Representations
System | Representation |
---|---|
Nº | 239 |
UTF-8 | C3 AF |
UTF-16 | 00 EF |
UTF-32 | 00 00 00 EF |
URL-Quoted | %C3%AF |
HTML hex reference | ï |
Wrong windows-1252 Mojibake | ï |
HTML named entity | ï |
HTML named entity | ï |
Encoding: CP037 (hex bytes) | 57 |
Encoding: CP273 (hex bytes) | 57 |
Encoding: CP437 (hex bytes) | 8B |
Encoding: CP500 (hex bytes) | 57 |
Encoding: CP720 (hex bytes) | 8B |
Encoding: CP850 (hex bytes) | 8B |
Encoding: CP857 (hex bytes) | 8B |
Encoding: CP858 (hex bytes) | 8B |
Encoding: CP863 (hex bytes) | 8B |
Encoding: CP865 (hex bytes) | 8B |
Encoding: CP1026 (hex bytes) | 57 |
Encoding: CP1140 (hex bytes) | 57 |
Encoding: CP1252 (hex bytes) | EF |
Encoding: CP1254 (hex bytes) | EF |
Encoding: CP1256 (hex bytes) | EF |
Encoding: CP1258 (hex bytes) | EF |
Encoding: EUC_JP (hex bytes) | 8F AB C1 |
Encoding: EUC_JIS_2004 (hex bytes) | A9 E5 |
Encoding: EUC_JISX0213 (hex bytes) | A9 E5 |
Encoding: GB18030 (hex bytes) | 81 30 8A 37 |
Encoding: ISO2022_JP_1 (hex bytes) | 1B 24 28 44 2B 41 1B 28 42 |
Encoding: ISO2022_JP_2 (hex bytes) | 1B 24 28 44 2B 41 1B 28 42 |
Encoding: ISO2022_JP_2004 (hex bytes) | 1B 24 28 51 29 65 1B 28 42 |
Encoding: ISO2022_JP_3 (hex bytes) | 1B 24 28 4F 29 65 1B 28 42 |
Encoding: ISO2022_JP_EXT (hex bytes) | 1B 24 28 44 2B 41 1B 28 42 |
Encoding: LATIN_1 (hex bytes) | EF |
Encoding: ISO8859_3 (hex bytes) | EF |
Encoding: ISO8859_9 (hex bytes) | EF |
Encoding: ISO8859_10 (hex bytes) | EF |
Encoding: ISO8859_14 (hex bytes) | EF |
Encoding: ISO8859_15 (hex bytes) | EF |
Encoding: ISO8859_16 (hex bytes) | EF |
Encoding: MAC_GREEK (hex bytes) | 95 |
Encoding: MAC_ICELAND (hex bytes) | 95 |
Encoding: MAC_ROMAN (hex bytes) | 95 |
Encoding: MAC_TURKISH (hex bytes) | 95 |
Encoding: SHIFT_JIS_2004 (hex bytes) | 85 85 |
Encoding: SHIFT_JISX0213 (hex bytes) | 85 85 |
Encoding: CP037 (hex bytes) | 57 |
Encoding: CP1047 (hex bytes) | 57 |
Encoding: CP1122 (hex bytes) | 57 |
Encoding: CP1140 (hex bytes) | 57 |
Encoding: CP1141 (hex bytes) | 57 |
Encoding: CP1142 (hex bytes) | 57 |
Encoding: CP1143 (hex bytes) | 57 |
Encoding: CP1144 (hex bytes) | 57 |
Encoding: CP1145 (hex bytes) | 57 |
Encoding: CP1146 (hex bytes) | 57 |
Encoding: CP1147 (hex bytes) | 57 |
Encoding: CP1148 (hex bytes) | 57 |
Encoding: CP1148MS (hex bytes) | 57 |
Encoding: CP1149 (hex bytes) | 57 |
Encoding: CP273 (hex bytes) | 57 |
Encoding: CP277 (hex bytes) | 57 |
Encoding: CP278 (hex bytes) | 57 |
Encoding: CP280 (hex bytes) | 57 |
Encoding: CP284 (hex bytes) | 57 |
Encoding: CP285 (hex bytes) | 57 |
Encoding: CP297 (hex bytes) | 57 |
Encoding: CP500 (hex bytes) | 57 |
Encoding: CP500MS (hex bytes) | 57 |
Encoding: CP871 (hex bytes) | 57 |
LATEX | \"{\i} |
AGL: Latin-1 | idieresis |
AGL: Latin-2 | idieresis |
AGL: Latin-3 | idieresis |
AGL: Latin-4 | idieresis |
AGL: Latin-5 | idieresis |
Adobe Glyph List | idieresis |
digraph | i: |
Related Characters
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
LATIN SMALL LETTER I WITH DIAERESIS | |
LATIN SMALL LETTER I DIAERESIS | |
Latin-1 Supplement | |
Lowercase Letter | |
Latin | |
Left To Right | |
Not Reordered | |
canonical | |
|
|
✔ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✔ | |
✔ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✔ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
No | |
|
|
Yes | |
|
|
No | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Lower | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Alphabetic Letter | |
✘ | |
✔ | |
✔ | |
✘ | |
✔ | |
✘ | |
✔ | |
|
|
None | |
neutral | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Alphabetic | |
none | |
not a number | |
|
|
R |