Home: go to the homepage U+0100 to U+017F Latin Extended-A
Glyph for U+0133
Source: Noto Sans

U+0133 Latin Small Ligature Ij

U+0133 was added in Unicode version 1.1 in 1993. It belongs to the block U+0100 to U+017F Latin Extended-A in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Lowercase Letter and is mainly used in the Latin script. Its uppercase variant is Glyph for U+0132 Latin Capital Ligature Ij and its titlecase variant is Glyph for U+0132 Latin Capital Ligature Ij.

The glyph is a compatibility composition of the glyphs Glyph for U+0069 Latin Small Letter I, Glyph for U+006A Latin Small Letter J. Its width in East Asian texts is determined by its context. It can be displayed wide or narrow. In bidirectional text it is written from left to right. When changing direction it is not mirrored. The word that U+0133 forms with similar adjacent characters prevents a line break inside it. The glyph can be confused with one other glyph.

The Wikipedia has the following information about this codepoint:

IJ (lowercase ij; Dutch pronunciation: [ɛi] ; also encountered as Unicode compatibility characters IJ and ij) is a digraph of the letters i and j. Occurring in the Dutch language, it is sometimes considered a ligature, or a letter in itself. In most fonts that have a separate character for ij, the two composing parts are not connected but are separate glyphs, which are sometimes slightly kerned.

An ij in written Dutch usually represents the diphthong [ɛi], similar to the pronunciation of ⟨ay⟩ in "pay". In standard Dutch and most Dutch dialects, there are two possible spellings for the diphthong [ɛi]: ij and ei. That causes confusion for school children, who need to learn which words to write with ei and which with ij. To distinguish between the two, the ij is referred to as the lange ij ("long ij"), the ei as korte ei ("short ei") or simply E – I. In certain Dutch dialects (notably West Flemish and Zeelandic) and the Dutch Low Saxon dialects of Low German, a difference in the pronunciation of ei and ij is maintained. Whether it is pronounced identically to ei or not, the pronunciation of ij is often perceived as being difficult by people who do not have either sound in their native language.

The ij originally represented a 'long i'. It used to be written as ii, as in Finnish and Estonian, but for orthographic purposes, the second i was eventually elongated, which is a reason why it is called lange ij. This can still be seen in the pronunciation of some words like bijzonder (bi.zɔn.dər), and the etymology of some words in the Dutch form of several foreign placenames: Berlin and Paris are spelled Berlijn and Parijs. Nowadays, the pronunciation mostly follows the spelling, and they are pronounced with [ɛi]. The ij is distinct from the letter y. Particularly when writing capitals, Y used to be common instead of IJ in the past. That practice has long been deprecated, since 1804, and the standard Dutch alphabet has the letter ij in the alphabet between the "x" and the "z". The letter "y" is not part of the Dutch alphabet, but commonly used in loan words and archaic names. In scientific disciplines such as mathematics and physics, the symbol y is usually pronounced ij.

To distinguish the Y from IJ in common speech, however, Y is often called Griekse ij (meaning "Greek Y"), a literal translation of i-grec (from French, with the stress on grec: [iˈɡrɛk]) or alternatively called Ypsilon. In modern Dutch, the letter Y occurs only in loanwords, proper nouns, or when deliberately spelled as Early Modern Dutch. The spelling of Afrikaans (a daughter language of early modern Dutch) has evolved in the exact opposite direction and IJ has been completely replaced by Y.

However, the ancient use of Y in Dutch has survived in some personal names, particularly that of Dutch immigrants in the United States, Canada, Australia and New Zealand where as a result of anglicization, the IJ became a Y. For example, the surname Spijker was often changed into Spyker and Snijder into Snyder.

Representations

System Representation
307
UTF-8 C4 B3
UTF-16 01 33
UTF-32 00 00 01 33
URL-Quoted %C4%B3
HTML hex reference ij
Wrong windows-1252 Mojibake ij
HTML named entity ij
Encoding: CP949 (hex bytes) A9 A6
Encoding: EUC_JP (hex bytes) 8F A9 C6
Encoding: EUC_JIS_2004 (hex bytes) 8F A9 C6
Encoding: EUC_JISX0213 (hex bytes) 8F A9 C6
Encoding: EUC_KR (hex bytes) A9 A6
Encoding: GB18030 (hex bytes) 81 30 90 35
Encoding: ISO2022_JP_1 (hex bytes) 1B 24 28 44 29 46 1B 28 42
Encoding: ISO2022_JP_2 (hex bytes) 1B 24 28 44 29 46 1B 28 42
Encoding: ISO2022_JP_EXT (hex bytes) 1B 24 28 44 29 46 1B 28 42
Encoding: ISO2022_KR (hex bytes) 1B 24 29 43 0E 29 26 0F
Encoding: JOHAB (hex bytes) DD 36
LATEX ij
AGL: Latin-5 ij
Adobe Glyph List ij
digraph ij

Related Characters

Confusables

Elsewhere

Complete Record

Property Value
Age (age) 1.1 (1993)
Unicode Name (na) LATIN SMALL LIGATURE IJ
Unicode 1 Name (na1) LATIN SMALL LETTER I J
Block (blk) Latin Extended-A
General Category (gc) Lowercase Letter
Script (sc) Latin
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) compatibility
Decomposition Mapping (dm) Glyph for U+0069 Latin Small Letter I Glyph for U+006A Latin Small Letter J
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+0133 Latin Small Ligature Ij
Lowercase Mapping (lc) Glyph for U+0133 Latin Small Ligature Ij
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+0132 Latin Capital Ligature Ij
Uppercase Mapping (uc) Glyph for U+0132 Latin Capital Ligature Ij
Simple Titlecase Mapping (stc) Glyph for U+0132 Latin Capital Ligature Ij
Titlecase Mapping (tc) Glyph for U+0132 Latin Capital Ligature Ij
Case Folding (cf) Glyph for U+0133 Latin Small Ligature Ij
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+0133 Latin Small Ligature Ij
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Modifier Combining Mark (MCM)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+0069 Latin Small Letter I Glyph for U+006A Latin Small Letter J
NFKC Quick Check (NFKC_QC) No
NFKC_SCF (NFKC_SCF) Glyph for U+0069 Latin Small Letter I Glyph for U+006A Latin Small Letter J
NFKD Quick Check (NFKD_QC) No
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Lower
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Alphabetic Letter
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+0133 Latin Small Ligature Ij
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) ambiguous
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Alphabetic
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+0133 Latin Small Ligature Ij
Script Extension (scx)
Vertical Orientation (vo) R