U+1C73 was added to Unicode in version 5.1 (2008). It belongs to the block Ol Chiki in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Ol Chiki script.

The glyph is not a composition. It has a Neutral East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+1C73 behaves as Alphabetic regarding line breaks. It has type OLetter for sentence and ALetter for word breaks. The Grapheme Cluster Break is Any.

The Wikipedia has the following information about this codepoint:

The Ol Chiki script, also known as Ol Cemetʼ (Santali: ol 'writing', cemet‍ '​ 'learning'), Ol Ciki, Ol, and sometimes as the Santali alphabet, was created in 1925 by Raghunath Murmu for the Santali language.

Previously, Santali had been written with the Latin alphabet. But because Santali is not an Indo-Aryan language (like most other languages in the south of India), Indic scripts did not have letters for all of Santali's phonemes, especially its stop consonants and vowels, which made writing the language accurately in an unmodified Indic script difficult. The detailed analysis was given by Byomkes Chakrabarti in his "Comparative Study of Santali and Bengali". Missionaries (first of all Paul Olaf Bodding, a Norwegian) brought the Latin script, which is better at representing Santali stops, phonemes and nasal sounds with the use of diacritical marks and accents. Unlike most Indic scripts, which are derived from Brahmi, Ol Chiki is not an abugida, with vowels given equal representation with consonants. Additionally, it was designed specifically for the language, but one letter could not be assigned to each phoneme because the sixth vowel in Ol Chiki is still problematic.

Ol Chiki has 30 letters, the forms of which are intended to evoke natural shapes. Linguist Norman Zide said "The shapes of the letters are not arbitrary, but reflect the names for the letters, which are words, usually the names of objects or actions representing conventionalized form in the pictorial shape of the characters." It is written from left to right.


System Representation
UTF-8 E1 B1 B3
UTF-16 1C 73
UTF-32 00 00 1C 73
URL-Quoted %E1%B1%B3
HTML-Escape ᱳ
Wrong windows-1252 Mojibake á±³


