Home: go to the homepage Search All Planes Random

Browse Codepoints by Script

See writing scripts and systems how they are used around the world. Click on a script name for additional information.

  • Adlam: 90 code points

    90 characters are encoded in this script.

    The Adlam script is a script used to write Fulani. The name Adlam is an acronym derived from the first four letters of the alphabet, standing for Alkule Dandayɗe Leñol Mulugol, which means "the alphabet that protects the peoples from vanishing". It is one of many indigenous scripts developed for specific languages in West Africa.

  • Afaka: 0 code points

    0 characters are encoded in this script.

    The Afaka script is a syllabary of 56 letters devised in 1910 for the Ndyuka language, an English-based creole of Suriname. The script is named after its inventor, Afáka Atumisi. It continues to be used to write Ndyuka in the 21st century, but the literacy rate in the language for all scripts is under 10%.

  • Caucasian Albanian: 53 code points

    53 characters are encoded in this script.

    The Caucasian Albanian script was an alphabetic writing system used by the Caucasian Albanians, one of the ancient Northeast Caucasian peoples whose territory comprised parts of the present-day Republic of Azerbaijan and Dagestan.

  • Ahom: 65 code points

    65 characters are encoded in this script.

    The Ahom script or Tai Ahom Script is an abugida that is used to write the Ahom language, a dormant Tai language undergoing revival spoken by the Ahom people till the late 18th-century, who established the Ahom kingdom and ruled the eastern part of the Brahmaputra valley between the 13th and the 18th centuries. The old Ahom language today survives in the numerous manuscripts written in this script currently in institutional and private possession.

  • Arabic: 1414 code points

    1414 characters are encoded in this script.

    The Arabic script is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world, the second-most widely used writing system in the world by number of countries using it, and the third-most by number of users.

  • Nastaliq Arabic: 0 code points

    0 characters are encoded in this script.

    Nastaliq, also romanized as Nastaʿlīq or Nastaleeq, is one of the main calligraphic hands used to write the Perso-Arabic script, and it is used for some Indo-Iranian languages, predominantly Classical Persian, Urdu, Kashmiri, and Punjabic languages such as Punjabi and Lahnda varieties (Shahmukhi). It is often used also for Ottoman Turkish poetry, but rarely for Arabic. Nastaliq developed in Iran from naskh beginning in the 13th century and remains very widely used in Iran, Afghanistan, India and Pakistan and other countries for written poetry and as a form of art.

  • Imperial Aramaic: 31 code points

    31 characters are encoded in this script.

    The ancient Aramaic alphabet was used to write the Aramaic languages spoken by ancient Aramean pre-Christian tribes throughout the Fertile Crescent. It was also adopted by other peoples as their own alphabet when empires and their subjects underwent linguistic Aramaization during a language shift for governing purposes—a precursor to Arabization centuries later—including among Assyrians who permanently replaced their Akkadian language and its cuneiform script with Aramaic and its script, and among Jews, who adopted the Aramaic language as their vernacular and started using the Aramaic alphabet even for writing Hebrew, displacing the former Paleo-Hebrew alphabet..

  • Armenian: 96 code points

    96 characters are encoded in this script.

    The Armenian alphabet, or more broadly the Armenian script, is an alphabetic writing system developed for Armenian and occasionally used to write other languages. It was developed around 405 AD by Mesrop Mashtots, an Armenian linguist and ecclesiastical leader. There are several inscriptions in Armenian lettering from Sinai and Nazareth that date to the beginning of the 5th century. The script originally had 36 letters; eventually, two more were adopted. It was in wide use in the Ottoman Empire in the 18th and 19th centuries.

  • Avestan: 61 code points

    61 characters are encoded in this script.

    The Avestan alphabet is a writing system developed during Iran's Sasanian era (226–651 CE) to render the Avestan language.

  • Balinese: 124 code points

    124 characters are encoded in this script.

    The Balinese script, natively known as Aksara Bali and Hanacaraka, is an abugida used in the island of Bali, Indonesia, commonly for writing the Austronesian Balinese language, Old Javanese, and the liturgical language Sanskrit. With some modifications, the script is also used to write the Sasak language, used in the neighboring island of Lombok. The script is a descendant of the Brahmi script, and so has many similarities with the modern scripts of South and Southeast Asia. The Balinese script, along with the Javanese script, is considered the most elaborate and ornate among Brahmic scripts of Southeast Asia.

  • Bamum: 657 code points

    657 characters are encoded in this script.

    The Bamum scripts are an evolutionary series of six scripts created for the Bamum language by Ibrahim Njoya, King of Bamum at the turn of the 19th century. They are notable for evolving from a pictographic system to a semi-syllabary in the space of fourteen years, from 1896 to 1910. Bamum type was cast in 1918, but the script fell into disuse around 1931. A project began around 2007 to revive the Bamum script.

  • Bassa Vah: 36 code points

    36 characters are encoded in this script.

    Bassa Vah, also known as simply Vah is an alphabetic script for writing the Bassa language of Liberia. As an old system nearing extinction in the 1900s, it was rediscovered among Bassa in Brazil and the West Indies, then revived in Liberia, by Thomas Flo Lewis. Type was cast for it, and an association for its promotion was formed in Liberia in 1959. It is not used today and has been classified as a failed script.

  • Batak: 56 code points

    56 characters are encoded in this script.

    The Batak script is a writing system used to write the Austronesian Batak languages spoken by several million people on the Indonesian island of Sumatra. The script may be derived from the Kawi and Pallava script, ultimately derived from the Brahmi script of India, or from the hypothetical Proto-Sumatran script influenced by Pallava.

  • Bengali: 113 code points

    113 characters are encoded in this script.

    The Bengali–Assamese script, also known as Eastern Nagari, is a modern eastern Indic script that emerged from the Brahmi script. Gaudi script is considered the ancestor of the script. It is known as Bengali script among Bengali speakers, as Assamese script among Assamese speakers, and Eastern-Nāgarī is used in academic discourse.

  • Bhaiksuki: 97 code points

    97 characters are encoded in this script.

    Bhaiksuki is a Brahmi-based script that was used around the 11th and 12th centuries CE. It used to be known in English as the "Arrow-Headed Script" or "Point-Headed Script," while an older designation, "Sindhura," had been used in Tibet for at least three centuries. Records showing usage of the script mainly appeared in the present-day states of Bihar and West Bengal in India, and in regions of Bangladesh. Records have also been located in Tibet, Nepal, and Burma.

  • Bliss: 0 code points

    0 characters are encoded in this script.

    Blissymbols or Blissymbolics is a constructed language conceived as an ideographic writing system called Semantography consisting of several hundred basic symbols, each representing a concept, which can be composed together to generate new symbols that represent new concepts. Blissymbols differ from most of the world's major writing systems in that the characters do not correspond at all to the sounds of any spoken language.

  • Bopomofo: 117 code points

    117 characters are encoded in this script.

    Bopomofo, or Mandarin Phonetic Symbols, also named Zhuyin, is a Chinese transliteration and writing system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe other varieties of Chinese, particularly other varieties of Mandarin Chinese dialects, as well as Taiwanese Hokkien. Consisting of 37 characters and five tone marks, it transcribes all possible sounds in Mandarin.

  • Brahmi: 115 code points

    115 characters are encoded in this script.

    Brahmi is a writing system of ancient South Asia that appeared as a fully developed script in the third century BCE. Its descendants, the 198 Brahmic scripts continue to be used today across South and Southeast Asia, making it one of the world's most influential writing traditions.

  • Braille: 256 code points

    256 characters are encoded in this script.

    Braille is a tactile writing system used by people who are visually impaired, including people who suffer from low vision, blindness, or deafblindness. It can be read either on embossed paper or by using refreshable braille displays that connect to computers and smartphone devices. Braille can be written using a slate and stylus, a braille writer, an electronic braille notetaker or with the use of a computer connected to a braille embosser.

  • Buginese: 31 code points

    31 characters are encoded in this script.

    The Lontara script, also known as the Bugis script, Bugis-Makassar script, or Urupu Sulapa’ Eppa’ "four-cornered letters", is one of Indonesia's traditional scripts developed in the South Sulawesi and West Sulawesi region. The script is primarily used to write the Buginese language, followed by Makassarese and Mandar. Closely related variants of Lontara are also used to write several languages outside of Sulawesi such as Bima, Ende, and Sumbawa. The script was actively used by several South Sulawesi societies for day-to-day and literary texts from at least mid-15th Century CE until the mid-20th Century CE, before its function was gradually supplanted by the Latin alphabet. Today the script is taught in South Sulawesi Province as part of the local curriculum, but with very limited usage in everyday life.

  • Buhid: 22 code points

    22 characters are encoded in this script.

    Surat Buhid is an abugida used to write the Buhid language. As a Brahmic script indigenous to the Philippines, it closely related to Baybayin and Hanunó'o. It is still used today by the Mangyans, found mainly on island of Mindoro, to write their language, Buhid, together with the Filipino latin script.

  • Chakma: 91 code points

    91 characters are encoded in this script.

    The Chakma Script, also called Ajhā pāṭh, Ojhapath, Ojhopath, Aaojhapath, is an abugida used for the Chakma language, and recently for the Pali language.

  • Canadian Aboriginal: 726 code points

    726 characters are encoded in this script.

    Canadian syllabic writing, or simply syllabics, is a family of writing systems used in a number of Indigenous Canadian languages of the Algonquian, Inuit, and (formerly) Athabaskan language families. These languages had no formal writing system previously. They are valued for their distinctiveness from the Latin script and for the ease with which literacy can be achieved; indeed, by the late 19th century the Cree had achieved what may have been one of the highest rates of literacy in the world.

  • Carian: 49 code points

    49 characters are encoded in this script.

    The Carian alphabets are a number of regional scripts used to write the Carian language of western Anatolia. They consisted of some 30 alphabetic letters, with several geographic variants in Caria and a homogeneous variant attested from the Nile delta, where Carian mercenaries fought for the Egyptian pharaohs. They were written left-to-right in Caria and right-to-left in Egypt.

  • Cham: 83 code points

    83 characters are encoded in this script.

    The Cham script is a Brahmic abugida used to write Cham, an Austronesian language spoken by some 245,000 Chams in Vietnam and Cambodia. It is written horizontally left to right, just like other Brahmic abugidas.

  • Cherokee: 172 code points

    172 characters are encoded in this script.

    The Cherokee syllabary is a syllabary invented by Sequoyah in the late 1810s and early 1820s to write the Cherokee language. His creation of the syllabary is particularly noteworthy as he was illiterate until its creation. He first experimented with logograms, but his system later developed into the syllabary. In his system, each symbol represents a syllable rather than a single phoneme; the 85 characters provide a suitable method for writing Cherokee. Although some symbols resemble Latin, Greek, Cyrillic, and Glagolitic letters, they are not used to represent the same sounds.

  • Chorasmian: 28 code points

    28 characters are encoded in this script.

  • Cirth: 0 code points

    0 characters are encoded in this script.

    The Cirth is a semi‑artificial script, based on real‑life runic alphabets, one of several scripts invented by J. R. R. Tolkien for the constructed languages he devised and used in his works. Cirth is written with a capital letter when referring to the writing system; the letters themselves can be called cirth.

  • Coptic: 165 code points

    165 characters are encoded in this script.

    The Coptic script is the script used for writing the Coptic language, the latest stage of Egyptian. The repertoire of glyphs is based on the uncial Greek alphabet, augmented by letters borrowed from the Egyptian Demotic. It was the first alphabetic script used for the Egyptian language. There are several Coptic alphabets, as the script varies greatly among the various dialects and eras of the Coptic language.

  • Cypro Minoan: 101 code points

    101 characters are encoded in this script.

    The Cypro-Minoan syllabary (CM) is an undeciphered syllabary used on the island of Cyprus during the late Bronze Age. The term "Cypro-Minoan" was coined by Arthur Evans in 1909 based on its visual similarity to Linear A on Minoan Crete, from which CM is thought to be derived. Approximately 250 objects—such as clay balls, cylinders, and tablets and votive stands—which bear Cypro-Minoan inscriptions, have been found. Discoveries have been made at various sites around Cyprus, as well as in the ancient city of Ugarit on the Syrian coast.

  • Cypriot: 112 code points

    112 characters are encoded in this script.

    The Cypriot or Cypriote syllabary is a syllabic script used in Iron Age Cyprus, from about the 11th to the 4th centuries BCE, when it was replaced by the Greek alphabet. A pioneer of that change was King Evagoras of Salamis. It is descended from the Cypro-Minoan syllabary, in turn, a variant or derivative of Linear A. Most texts using the script are in the Arcadocypriot dialect of Greek, but also one bilingual inscription was found in Amathus.

  • Cyrillic: 510 code points

    510 characters are encoded in this script.

    The Cyrillic script, Slavonic script or the Slavic script is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Eastern Europe, the Caucasus, Central Asia, North Asia, and East Asia, and used by many other minority languages.

  • Old Cyrillic: 0 code points

    0 characters are encoded in this script.

    The Early Cyrillic alphabet, also called classical Cyrillic or paleo-Cyrillic, is a writing system that was developed in Medieval Bulgaria in the Preslav Literary School during the late 9th century. The modern Cyrillic script is used for some Slavic languages, and for East European and Asian languages that have experienced a great amount of Russian cultural influence.

  • Devanagari: 220 code points

    220 characters are encoded in this script.

    Devanāgarī or Devanagari, also called Nāgarī, is a left-to-right abugida, based on the ancient Brāhmī script, used in the northern Indian subcontinent. It is one of the official scripts of the Republic of India and Nepal. It was developed and in regular use by the 7th century CE. The Devanāgarī script, composed of 48 primary characters, including 14 vowels and 34 consonants, is the fourth most widely adopted writing system in the world, being used for over 120 languages.

  • Dhives Akuru: 72 code points

    72 characters are encoded in this script.

    Dhives Akuru, later called Dhivehi Akuru is a script formerly used for the Maldivian language. The name can be alternatively spelled Dives Akuru or Divehi Akuru using the ISO 15919 Romanization scheme, as the "d" is unaspirated.

  • Dogri: 82 code points

    82 characters are encoded in this script.

    The Dogri script, also known as the Name Dogra Akkhar script is a writing system originally used for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent.

  • Deseret: 80 code points

    80 characters are encoded in this script.

    The Deseret alphabet is a phonemic English-language spelling reform developed between 1847 and 1854 by the board of regents of the University of Deseret under the leadership of Brigham Young, the second president of the Church of Jesus Christ of Latter-day Saints. George D. Watt is reported to have been the most actively involved in the development of the script's novel characters, which were used to replace those of Isaac Pitman's English phonotypic alphabet. He was also the "New Alphabet's" first serious user.

  • Duployan: 147 code points

    147 characters are encoded in this script.

    The Duployan shorthand, or Duployan stenography, was created by Father Émile Duployé in 1860 for writing French. Since then, it has been expanded and adapted for writing English, German, Spanish, Romanian, Latin, Danish, and Chinook Jargon. The Duployan stenography is classified as a geometric, alphabetic stenography and is written left-to-right in connected stenographic style. The Duployan shorthands, including Chinook writing, Pernin's Universal Phonography, Perrault's English Shorthand, the Sloan-Duployan Modern Shorthand, and Romanian stenography, were included as a single script in version 7.0 of the Unicode Standard / ISO 10646

  • Demotic: 0 code points

    0 characters are encoded in this script.

    Demotic is the ancient Egyptian script derived from northern forms of hieratic used in the Nile Delta. The term was first used by the Greek historian Herodotus to distinguish it from hieratic and hieroglyphic scripts. By convention, the word "Demotic" is capitalized in order to distinguish it from demotic Greek.

  • Hieratic: 0 code points

    0 characters are encoded in this script.

    Hieratic is the name given to a cursive writing system used for Ancient Egyptian and the principal script used to write that language from its development in the third millennium BC until the rise of Demotic in the mid-first millennium BC. It was primarily written in ink with a reed pen on papyrus.

  • Egyptian Hieroglyphs: 1110 code points

    1110 characters are encoded in this script.

    Egyptian hieroglyphs were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined logographic, syllabic and alphabetic elements, with some 1,000 distinct characters. Cursive hieroglyphs were used for religious literature on papyrus and wood. The later hieratic and demotic Egyptian scripts were derived from hieroglyphic writing, as was the Proto-Sinaitic script that later evolved into the Phoenician alphabet. Through the Phoenician alphabet's major child systems, the Egyptian hieroglyphic script is ancestral to the majority of scripts in modern use, most prominently the Latin and Cyrillic scripts and the Arabic script, and possibly the Brahmic family of scripts.

  • Elbasan: 40 code points

    40 characters are encoded in this script.

    The Elbasan alphabet is a mid 18th-century alphabetic script created for the Albanian language Elbasan Gospel Manuscript, also known as the Anonimi i Elbasanit, which is the only document written in it. The document was created at St. Jovan Vladimir's Church in central Albania, but is preserved today at the National Archives of Albania in Tirana. The alphabet, like the manuscript, is named after the city of Elbasan, where it was invented, and although the manuscript isn't the oldest document written in Albanian, Elbasan is the oldest out of seven known original alphabets created for Albanian. Its 59 pages contain Biblical content written in a script of 40 letters, of which 35 frequently recur and 5 are rare.

  • Elymaic: 23 code points

    23 characters are encoded in this script.

    The Elymaic alphabet is a right-to-left, non-joining abjad. It is derived from the Aramaic alphabet. Elymaic was used in the ancient state of Elymais, which was a semi-independent state of the 2nd century BCE to the early 3rd century CE, frequently a vassal under Parthian control, in the present-day region of Khuzestan, Iran (Susiana).

  • Ethiopic: 523 code points

    523 characters are encoded in this script.

    Geʽez is a script used as an abugida (alphasyllabary) for several Afro-Asiatic and Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an abjad and was first used to write the Geʽez language, now the liturgical language of the Ethiopian Orthodox Tewahedo Church, the Eritrean Orthodox Tewahedo Church, the Eritrean Catholic Church, the Ethiopian Catholic Church, and Haymanot Judaism of the Beta Israel Jewish community in Ethiopia. In the languages Amharic and Tigrinya, the script is often called fidäl (ፊደል), meaning “script” or “letter”. Under the Unicode Standard and ISO 15924, it is defined as Ethiopic text.

  • Chutsuri: 0 code points

    0 characters are encoded in this script.

  • Georgian: 174 code points

    174 characters are encoded in this script.

  • Glagolitic: 138 code points

    138 characters are encoded in this script.

    The Glagolitic script is the oldest known Slavic alphabet. It is generally agreed to have been created in the 9th century by Saint Cyril, a monk from Thessalonica. He and his brother Saint Methodius were sent by the Byzantine Emperor Michael III in 863 to Great Moravia to spread Christianity among the West Slavs in the area. The brothers decided to translate liturgical books into the contemporary Slavic language understandable to the general population. As the words of that language could not be easily written by using either the Greek or Latin alphabets, Cyril decided to invent a new script, Glagolitic, which he based on the local dialect of the Slavic tribes from the Byzantine theme of Thessalonica.

  • Gundjala Gondi: 65 code points

    65 characters are encoded in this script.

    The Gunjala Gondi lipi or Gunjala Gondi script is a script used to write the Gondi language, a Dravidian language spoken by the Gond people of northern Telangana, eastern Maharashtra, southeastern Madhya Pradesh, and Chhattisgarh. Approximately a dozen manuscripts in the script were recovered from Gunjala, a Gond village in Adilabad district of Telangana, by a team of researchers from the University of Hyderabad, led by Professor Jayadheer Tirumala Rao. The script and preliminary font were unveiled in early 2014.

  • Gondi: 77 code points

    77 characters are encoded in this script.

    Gondi has typically been written in Devanagari script or Telugu script, but native scripts are in existence. A Gond by the name of Munshi Mangal Singh Masaram designed a Brahmi-based script in 1918, and in 2006, a native script that dates up to 1750 has been discovered by a group of researchers from the University of Hyderabad.

  • Gothic: 27 code points

    27 characters are encoded in this script.

    The Gothic alphabet is an alphabet used for writing the Gothic language. Ulfilas developed it in the 4th century AD for the purpose of translating the Bible.

  • Grantha: 116 code points

    116 characters are encoded in this script.

    The Grantha script was a classical South-Indian script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, the Grantha script is related to Tamil and Vatteluttu scripts. The modern Malayalam script of Kerala is a direct descendant of the Grantha script. The Southeast Asian and Indonesian scripts such as Thai and Javanese respectively, as well as South Asian Tigalari and Sinhala scripts, are derived or closely related to Grantha through the early Pallava script. The Pallava script or Pallava Grantha, emerged in the 4th century CE and was used until the 7th century CE, in India. This early Grantha script was used to write Sanskrit texts, inscriptions on copper plates and stones of Hindu temples and monasteries. It was also used for classical Manipravalam – a language that is a blend of Sanskrit and Tamil. From it evolved Middle Grantha by the 7th century, and Transitional Grantha by about the 8th century, which remained in use until about the 14th century. Modern Grantha has been in use since the 14th century and into the modern era, to write classical texts in Sanskrit and Dravidian languages. It is also used to chant hymns and in traditional Vedic schools.

  • Greek: 522 code points

    522 characters are encoded in this script.

    The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It is derived from the earlier Phoenician alphabet, and was the earliest known alphabetic script to have distinct letters for vowels as well as consonants. In Archaic and early Classical times, the Greek alphabet existed in many local variants, but, by the end of the 4th century BC, the Euclidean alphabet, with 24 letters, ordered from alpha to omega, had become standard and it is this version that is still used for Greek writing today.

  • Gujarati: 105 code points

    105 characters are encoded in this script.

    The Gujarati script is an abugida for the Gujarati language, Kutchi language, and various other languages. It is one of the official scripts of the Indian Republic. It is a variant of the Devanagari script differentiated by the loss of the characteristic horizontal line running above the letters and by a number of modifications to some characters.

  • Gurmukhi: 94 code points

    94 characters are encoded in this script.

    Gurmukhī or Gurumukhī is an Indic script predominantly used in present-day Punjab, India. It is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). It is commonly regarded as a Sikh script, used by Punjabi Sikhs to write the Punjabi language, and is one of the official scripts of the Indian Republic, while the Arabic-based Shahmukhi script is used in Punjab, Pakistan.

  • Han Bopomofo: 0 code points

    0 characters are encoded in this script.

    ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.

  • Hangul: 11775 code points

    11775 characters are encoded in this script.

    The Korean alphabet, known as Hangul in South Korea and Chosŏn'gŭl (조선글) in North Korea, is the modern official writing system for the Korean language. The letters for the five basic consonants reflect the shape of the speech organs used to pronounce them, and they are systematically modified to indicate phonetic features; similarly, the vowel letters are systematically modified for related sounds, making Hangul a featural writing system. It has been described as a syllabic alphabet as it combines the features of alphabetic and syllabic writing systems, although it is not necessarily an abugida.

  • Han: 99318 code points

    99318 characters are encoded in this script.

    Chinese characters are logograms developed for the writing of Chinese. Chinese characters are the oldest continuously used system of writing in the world. By virtue of their widespread current use throughout East Asia and Southeast Asia, as well as their profound historic use throughout the Sinosphere, Chinese characters are among the most widely adopted writing systems in the world by number of users.

  • Hanunoo: 23 code points

    23 characters are encoded in this script.

    Hanunoo, also rendered Hanunó'o, is one of the scripts indigenous to the Philippines and is used by the Mangyan peoples of southern Mindoro to write the Hanunó'o language.

  • Han Simplified: 0 code points

    0 characters are encoded in this script.

    Simplified Chinese characters are one type of standard Chinese character sets. Along with traditional Chinese characters, they are one of the two standard character sets of the contemporary Chinese written language. The government of the People's Republic of China in mainland China has promoted them for use in printing since the 1950s and 1960s to encourage literacy. They are officially used in the People's Republic of China, Malaysia and Singapore, while traditional Chinese characters remain in common use in Hong Kong, Macau, Taiwan and Japan, as well as South Korea to a certain extent.

  • Han Traditional: 0 code points

    0 characters are encoded in this script.

    Traditional Chinese characters are one set of standard Chinese characters used for written Chinese. Their modern shapes first appeared with the emergence of the clerical script during the Han dynasty beginning around 200 BC, and were standardized with the introduction of the regular script beginning in the 2nd century AD. They remained the standard form of printed Chinese characters or literary Chinese throughout the Sinosphere until the middle of the 20th century, when various countries that adopted Chinese characters as a writing system started to introduce their own simplified versions of Chinese scripts.

  • Hatran: 26 code points

    26 characters are encoded in this script.

    Aramaic of Hatra, Hatran Aramaic or Ashurian designates a Middle Aramaic dialect, that was used in the region of Hatra and Assur in northeastern parts of Mesopotamia, approximately from the 3rd century BC to the 3rd century CE. Its range extended from the Nineveh Plains in the centre, up to Tur Abdin in the north, Dura-Europos in the west and Tikrit in the south.

  • Hebrew: 134 code points

    134 characters are encoded in this script.

    The Hebrew alphabet, known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is traditionally an abjad script used in the writing of the Hebrew language and other Jewish languages, most notably Yiddish, Ladino, Judeo-Arabic, and Judeo-Persian. In modern Hebrew, vowels are increasingly introduced. It is also used informally in Israel to write Levantine Arabic, especially among Druze. It is an offshoot of the Imperial Aramaic alphabet, which flourished during the Achaemenid Empire and which itself derives from the Phoenician alphabet.

  • Hiragana: 433 code points

    433 characters are encoded in this script.

    Hiragana is a Japanese syllabary, part of the Japanese writing system, along with katakana as well as kanji.

  • Anatolian Hieroglyphs: 583 code points

    583 characters are encoded in this script.

    Anatolian hieroglyphs are an indigenous logographic script native to central Anatolia, consisting of some 500 signs. They were once commonly known as Hittite hieroglyphs, but the language they encode proved to be Luwian, not Hittite, and the term Luwian hieroglyphs is used in English publications. They are typologically similar to Egyptian hieroglyphs, but do not derive graphically from that script, and they are not known to have played the sacred role of hieroglyphs in Egypt. There is no demonstrable connection to Hittite cuneiform.

  • Pahawh Hmong: 127 code points

    127 characters are encoded in this script.

    Pahawh Hmong is an indigenous semi-syllabic script, invented in 1959 by Shong Lue Yang, to write two Hmong languages, Hmong Daw (Hmoob Dawb White Miao) and Hmong Njua AKA Hmong Leng (Moob Leeg Green Miao).

  • Nyiakeng Puachue Hmong: 71 code points

    71 characters are encoded in this script.

    Nyiakeng Puachue Hmong is an alphabet script devised for White Hmong and Green Hmong in the 1980s by Reverend Chervang Kong for use within his United Christians Liberty Evangelical Church. The church, which moved around California, Minnesota, Wisconsin, North Carolina, Colorado, and many other states, has used the script in printed material and videos. It is reported to have some use in Laos, Thailand, Vietnam, France, and Australia.

  • Katakana Or Hiragana: 0 code points

    0 characters are encoded in this script.

    Kana are syllabaries used to write Japanese phonological units, morae. Such syllabaries include (1) the original kana, or magana , which were Chinese characters (kanji) used phonetically to transcribe Japanese, the most prominent magana system being man'yōgana (万葉仮名); the two descendants of man'yōgana, (2) hiragana , and (3) katakana . There are also hentaigana , which are historical variants of the now-standard hiragana. In current usage, 'kana' can simply mean hiragana and katakana.

  • Old Hungarian: 108 code points

    108 characters are encoded in this script.

    The Old Hungarian script or Hungarian runes is an alphabetic writing system used for writing the Hungarian language. Modern Hungarian is written using the Latin-based Hungarian alphabet. The term "old" refers to the historical priority of the script compared with the Latin-based one. The Old Hungarian script is a child system of the Old Turkic alphabet.

  • Indus: 0 code points

    0 characters are encoded in this script.

    The Indus script, also known as the Harappan script, is a corpus of symbols produced by the Indus Valley Civilisation. Most inscriptions containing these symbols are extremely short, making it difficult to judge whether or not they constituted a writing system used to record the as-yet unidentified language(s) of the Indus Valley Civilisation. Despite many attempts, the 'script' has not yet been deciphered, but efforts are ongoing. There is no known bilingual inscription to help decipher the script, and the script shows no significant changes over time. However, some of the syntax varies depending upon location.

  • Old Italic: 39 code points

    39 characters are encoded in this script.

    The Old Italic scripts are a family of ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which was the immediate ancestor of the Latin alphabet used by more than 100 languages today, including English. The runic alphabets used in northern Europe are believed to have been separately derived from one of these alphabets by the 2nd century AD.

  • Javanese: 91 code points

    91 characters are encoded in this script.

    The Javanese script is one of Indonesia's traditional scripts developed on the island of Java. The script is primarily used to write the Javanese language, but in the course of its development has also been used to write several other regional languages such as Sundanese, Madurese, and Sasak; the lingua franca of the region, Malay; as well as the historical languages Kawi and Sanskrit. Javanese script was actively used by the Javanese people for writing day-to-day and literary texts from at least the mid-15th century CE until the mid-20th century CE, before its function was gradually supplanted by the Latin alphabet. Today the script is taught in DI Yogyakarta, Central Java, and the East Java Province as part of the local curriculum, but with very limited function in everyday use.

  • Japanese: 0 code points

    0 characters are encoded in this script.

    The modern Japanese writing system uses a combination of logographic kanji, which are adopted Chinese characters, and syllabic kana. Kana itself consists of a pair of syllabaries: hiragana, used primarily for native or naturalised Japanese words and grammatical elements; and katakana, used primarily for foreign words and names, loanwords, onomatopoeia, scientific names, and sometimes for emphasis. Almost all written Japanese sentences contain a mixture of kanji and kana. Because of this mixture of scripts, in addition to a large inventory of kanji characters, the Japanese writing system is considered to be one of the most complicated currently in use.

  • Jurchen: 0 code points

    0 characters are encoded in this script.

    The Jurchen script was the writing system used to write the Jurchen language, the language of the Jurchen people who created the Jin Empire in northeastern China in the 12th–13th centuries. It was derived from the Khitan script, which in turn was derived from Chinese. The script has only been decoded to a small extent.

  • Kayah Li: 48 code points

    48 characters are encoded in this script.

    The Kayah Li alphabet is used to write the Kayah languages Eastern Kayah Li and Western Kayah Li, which are members of Karenic branch of the Sino-Tibetan language family. They are also known as Red Karen and Karenni. Eastern Kayah Li is spoken by about 26,000 people, and Western Kayah Li by about 100,000 people, mostly in the Kayah and Karen states of Myanmar, but also by people living in Thailand.

  • Katakana: 373 code points

    373 characters are encoded in this script.

    Katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script.

  • Kawi: 86 code points

    86 characters are encoded in this script.

    The Kawi or Old Javanese script is a Brahmic script found primarily in Java and used across much of Maritime Southeast Asia between the 8th century and the 16th century. The script is an abugida meaning that characters are read with an inherent vowel. Diacritics are used, either to suppress the vowel and represent a pure consonant, or to represent other vowels.

  • Kharoshthi: 68 code points

    68 characters are encoded in this script.

    The Kharoṣṭhī script, also known as the Gāndhārī script, was an ancient Indo-Iranian script used by various peoples from the north-western outskirts of the Indian subcontinent to Central Asia via Afghanistan. An abugida, it was introduced at least by the middle of the 3rd century BCE, possibly during the 4th century BCE, and remained in use until it died out in its homeland around the 3rd century CE.

  • Khmer: 146 code points

    146 characters are encoded in this script.

    Khmer script is an abugida (alphasyllabary) script used to write the Khmer language, the official language of Cambodia. It is also used to write Pali in the Buddhist liturgy of Cambodia and Thailand.

  • Khojki: 85 code points

    85 characters are encoded in this script.

    Khojkī, Khojakī, or Khwājā Sindhī, is a script used formerly and almost exclusively by the Khoja community of parts of the Indian subcontinent, including Sindh, Gujarat, and Punjab. However, this script also had a further reach and was used by members of Ismaili communities from Burma to East and South Africa. The name "Khojki" is likely derived from the Persian word khoja, which means "master", or "lord".

  • Khitan Large: 0 code points

    0 characters are encoded in this script.

    The Khitan large script was one of two writing systems used for the now-extinct Khitan language. It was used during the 10th–12th centuries by the Khitan people, who had created the Liao Empire in north-eastern China. In addition to the large script, the Khitans simultaneously also used a functionally independent writing system known as the Khitan small script. Both Khitan scripts continued to be in use to some extent by the Jurchens for several decades after the fall of the Liao dynasty, until the Jurchens fully switched to a script of their own. Examples of the scripts appeared most often on epitaphs and monuments, although other fragments sometimes surface.

  • Khitan Small: 471 code points

    471 characters are encoded in this script.

    The Khitan small script was one of two writing systems used for the now-extinct Khitan language. It was used during the 10th–12th century by the Khitan people, who had created the Liao Empire in present-day northeastern China. In addition to the small script, the Khitans simultaneously also used a functionally independent writing system known as the Khitan large script. Both Khitan scripts continued to be in use to some extent by the Jurchens for several decades after the fall of the Liao dynasty, until the Jurchens fully switched to a script of their own. Examples of the scripts appeared most often on epitaphs and monuments, although other fragments sometimes surface.

  • Kannada: 106 code points

    106 characters are encoded in this script.

    The Kannada script is an abugida of the Brahmic family, used to write Kannada, one of the Dravidian languages of South India especially in the state of Karnataka. It is one of the official scripts of the Indian Republic. Kannada script is also widely used for writing Sanskrit texts in Karnataka. Several minor languages, such as Tulu, Konkani, Kodava, Sanketi and Beary, also use alphabets based on the Kannada script. The Kannada and Telugu scripts share very high mutual intellegibility with each other, and are often considered to be regional variants of single script. Other scripts similar to Kannada script are Sinhala script, and Old Peguan script (used in Burma).

  • Korean Mixed: 0 code points

    0 characters are encoded in this script.

    Korean mixed script is a form of writing the Korean language that uses a mixture of the Korean alphabet or Hangul and Hanja, the Korean name for Chinese characters. The distribution on how to write words usually follows that all native Korean words, including suffixes, particles, and honorific markers are generally written in hangul and never in hanja. Sino-Korean vocabulary or hanja-eo, either words borrowed from Chinese or created from Sino-Korean roots, were generally always written in hanja, although very rare or complex characters were often substituted with hangul. Although the Korean alphabet was introduced and taught to people beginning in 1446, most literature until the early twentieth century was written in literary Chinese known as hanmun.

  • Kpelle: 0 code points

    0 characters are encoded in this script.

    The Kpelle syllabary was invented c. 1935 by Chief Gbili of Sanoyie, Liberia. It was intended for writing the Kpelle language, a member of the Mande group of Niger-Congo languages spoken by about 490,000 people in Liberia and around 300,000 people in Guinea at that time.

  • Kaithi: 88 code points

    88 characters are encoded in this script.

    Kaithi, also called Kayathi or Kayasthi, is a historical Brahmic script that was used widely in parts of Northern and Eastern India, primarily in the present-day states of Uttar Pradesh, Jharkhand and Bihar. In particular, it was used for writing legal, administrative and private records. It was used for a variety of Indo-Aryan languages, including Awadhi, Bhojpuri, Hindustani, Magahi, and Nagpuri.

  • Tai Tham: 127 code points

    127 characters are encoded in this script.

    Tai Tham script is an abugida writing system used mainly for a group of Southwestern Tai languages i.e., Northern Thai, Tai Lü, Khün and Lao; as well as the liturgical languages of Buddhism i.e., Pali and Sanskrit. It is historically known as Tua Tham. In Thailand and Myanmar, the script is often referred to as Lanna script in relation to the historical kingdom of Lan Na situating in the Northern region of modern day Thailand and a part of Shan state in Myanmar. Local people in Northern Thailand also call the script as Tua Mueang in parallel to Kam Mueang, a local name for Northern Thai language. In Laos and Isan region of Thailand, a variation of Tai Tham script, often dubbed Lao Tham, is also known by the locals as To Tham Lao or Yuan script. Tai Tham script is traditionally written on a dried palm leaf as a palm-leaf manuscript.

  • Lao: 83 code points

    83 characters are encoded in this script.

    Lao script or Akson Lao is the primary script used to write the Lao language and other minority languages in Laos. Its earlier form, the Tai Noi script, was also used to write the Isan language, but was replaced by the Thai script. It has 27 consonants, 7 consonantal ligatures, 33 vowels, and 4 tone marks.

  • Latin Fraktur: 0 code points

    0 characters are encoded in this script.

    Fraktur is a calligraphic hand of the Latin alphabet and any of several blackletter typefaces derived from this hand. Letters are designed such that the individual strokes are broken apart; in this way it is often contrasted with the curves of the Antiqua (common) typefaces where the letters are designed to flow and strokes connect together in a continuous fashion. The word "Fraktur" derives from Latin frāctūra, built from frāctus, passive participle of frangere, the same root as the English word "fracture".

  • Latin Gaelic: 0 code points

    0 characters are encoded in this script.

    Gaelic type is a family of Insular script typefaces devised for printing Classical Gaelic. It was widely used from the 16th century until the mid-18th century in Scotland and the mid-20th century in Ireland, but is now rarely used. Sometimes, all Gaelic typefaces are called Celtic or uncial although most Gaelic types are not uncials. The "Anglo-Saxon" types of the 17th century are included in this category because both the Anglo-Saxon types and the Gaelic/Irish types derive from the insular manuscript hand.

  • Latin: 1510 code points

    1510 characters are encoded in this script.

    The Latin script, also known as the Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern Italy. The Greek alphabet was adopted by the Etruscans, and subsequently their alphabet was adopted by the Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.

  • Leke: 0 code points

    0 characters are encoded in this script.

    The Leke script, previously known as Karen Chicken Scratch script, is an abugida used to write the Pwo Karen language and Sgaw language in Burma (Myanmar) and Thailand. It has 25 consonants, 17 vowels and 3 tones. The script also has a unique set of numerals and punctuation, such as a full stop (period) and a comma.

  • Lepcha: 74 code points

    74 characters are encoded in this script.

    The Lepcha script, or Róng script, is an abugida used by the Lepcha people to write the Lepcha language. Unusually for an abugida, syllable-final consonants are written as diacritics.

  • Limbu: 69 code points

    69 characters are encoded in this script.

    The Limbu script is used to write the Limbu language. It is a Brahmic type abugida.

  • Linear A: 386 code points

    386 characters are encoded in this script.

    Linear A is a writing system that was used by the Minoans of Crete from 1800 to 1450 BC to write the hypothesized Minoan language or languages. Linear A was the primary script used in palace and religious writings of the Minoan civilization. It was succeeded by Linear B, which was used by the Mycenaeans to write an early form of Greek. It was discovered by archaeologist Sir Arthur Evans. No texts in Linear A have yet been deciphered.

  • Linear B: 268 code points

    268 characters are encoded in this script.

    Linear B is a syllabic script that was used for writing in Mycenaean Greek, the earliest attested form of the Greek language. The script predates the Greek alphabet by several centuries. The oldest Mycenaean writing dates to about 1400 BC. It is hypothesised to be descended from the older Linear A, an undeciphered earlier script potentially used for writing the Minoan language, as is the later Cypriot syllabary, which also recorded Greek. Linear B, found mainly in the palace archives at Knossos, Kydonia, Pylos, Thebes and Mycenae, disappeared with the fall of Mycenaean civilization during the Late Bronze Age collapse. The succeeding period, known as the Greek Dark Ages, provides no evidence of the use of writing.

  • Lisu: 49 code points

    49 characters are encoded in this script.

    The Fraser or Old Lisu script is an artificial abugida invented around 1915 by Sara Ba Thaw, a Karen preacher from Myanmar and improved by the missionary James O. Fraser, to write the Lisu language. It is a single-case (unicameral) alphabet. It was also used for the Naxi language, e.g. the 1932 Naxi Gospel of Mark and used in the Zaiwa or Atsi language e.g. the 1938 Atsi Gospel of Mark.

  • Lycian: 29 code points

    29 characters are encoded in this script.

    The Lycian alphabet was used to write the Lycian language of the Asia Minor region of Lycia. It was an extension of the Greek alphabet, with half a dozen additional letters for sounds not found in Greek. It was largely similar to the Lydian and the Phrygian alphabets.

  • Lydian: 27 code points

    27 characters are encoded in this script.

    Lydian script was used to write the Lydian language. Like other scripts of Anatolia in the Iron Age, the Lydian alphabet is based on the Phoenician alphabet. It is related to the East Greek alphabet, but it has unique features.

  • Mahajani: 61 code points

    61 characters are encoded in this script.

    Mahajani is a Laṇḍā mercantile script that was historically used in northern India for writing accounts and financial records in Marwari, Hindi and Punjabi. It is a Brahmic script and is written left-to-right. Mahajani refers to the Hindi word for 'bankers', also known as 'sarrafi' or 'kothival' (merchant).

  • Makasar: 25 code points

    25 characters are encoded in this script.

    The Makasar script, also known as Ukiri' Jangang-jangang or Old Makasar script, is a historical Indonesian Writing system that was used in South Sulawesi to write the Makassarese language between the 17th and 19th centuries until it was supplanted by the Lontara Bugis script.

  • Mandaic: 30 code points

    30 characters are encoded in this script.

    The Mandaic alphabet is thought to have evolved between the second and seventh century CE from either a cursive form of Aramaic or from Inscriptional Parthian. The exact roots of the script are difficult to determine. It was developed by members of the Mandaean faith of Lower Mesopotamia to write the Mandaic language for liturgical purposes. Classical Mandaic and its descendant Neo-Mandaic are still in limited use. The script has changed very little over centuries of use.

  • Manichaean: 52 code points

    52 characters are encoded in this script.

    The Manichaean script is an abjad-based writing system rooted in the Semitic family of alphabets and associated with the spread of Manichaeism from southwest to central Asia and beyond, beginning in the third century CE. It bears a sibling relationship to early forms of the Pahlavi scripts, both systems having developed from the Imperial Aramaic alphabet, in which the Achaemenid court rendered its particular, official dialect of Aramaic. Unlike Pahlavi, the Manichaean script reveals influences from the Sogdian alphabet, which in turn descends from the Syriac branch of Aramaic. The Manichaean script is so named because Manichaean texts attribute its design to Mani himself. Middle Persian is written with this alphabet.

  • Marchen: 68 code points

    68 characters are encoded in this script.

    The Marchen script was a Brahmic abugida which was used for writing the extinct Zhangzhung language. It was derived from the Tibetan script.

  • Maya: 0 code points

    0 characters are encoded in this script.

    Maya script, also known as Maya glyphs, is historically the native writing system of the Maya civilization of Mesoamerica and is the only Mesoamerican writing system that has been substantially deciphered. The earliest inscriptions found which are identifiably Maya date to the 3rd century BCE in San Bartolo, Guatemala. Maya writing was in continuous use throughout Mesoamerica until the Spanish conquest of the Maya in the 16th and 17th centuries.

  • Medefaidrin: 91 code points

    91 characters are encoded in this script.

  • Mende Kikakui: 213 code points

    213 characters are encoded in this script.

    The Mende Kikakui script is a syllabary used for writing the Mende language of Sierra Leone.

  • Meroitic Cursive: 90 code points

    90 characters are encoded in this script.

  • Meroitic Hieroglyphs: 32 code points

    32 characters are encoded in this script.

  • Malayalam: 127 code points

    127 characters are encoded in this script.

    Malayalam script is a Brahmic script used commonly to write Malayalam, which is the principal language of Kerala, India, spoken by 45 million people in the world. It is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry by the Malayali people. It is one of the official scripts of the Indian Republic. Malayalam script is also widely used for writing Sanskrit texts in Kerala.

  • Modi: 89 code points

    89 characters are encoded in this script.

    Modi is a script used to write the Marathi language, which is the primary language spoken in the state of Maharashtra, India. There are multiple theories concerning its origin. The Modi script was used alongside the Devanagari script to write Marathi until the 20th century when the Balbodh style of the Devanagari script was promoted as the standard writing system for Marathi.

  • Mongolian: 172 code points

    172 characters are encoded in this script.

    The classical or traditional Mongolian script, also known as the Hudum Mongol bichig, was the first writing system created specifically for the Mongolian language, and was the most widespread until the introduction of Cyrillic in 1946. It is traditionally written in vertical lines Top-Down, right across the page. Derived from the Old Uyghur alphabet, it is a true alphabet, with separate letters for consonants and vowels. It has been adapted for such languages as Oirat and Manchu. Alphabets based on this classical vertical script continue to be used in Mongolia and Inner Mongolia to write Mongolian, Xibe and, experimentally, Evenki.

  • Moon System: 0 code points

    0 characters are encoded in this script.

    The Moon System of Embossed Reading is a writing system for the blind, using embossed symbols mostly derived from the Latin script. It is claimed by its supporters to be easier to understand than braille, though it is mainly used by people who have lost their sight as adults, and thus already have knowledge of the shapes of letters.

  • Mro: 43 code points

    43 characters are encoded in this script.

  • Meetei Mayek: 79 code points

    79 characters are encoded in this script.

    The Meitei script, also known as the Meetei script, is an abugida used for the Meitei language, the official language of Manipur state and one of the 22 official languages of India. It is one of the official scripts of the Indian Republic. It is also popularly known as the Kanglei script and the Kok Sam Lai script. Its earliest known evidence of existence dates back to the 6th century AD coins, engraving the Meitei letters, as verified by the various publications of the National Sahitya Akademi. It was used until the 18th century, when it was replaced by the Bengali alphabet. A few manuscripts survive. In the 20th century, the script has experienced a resurgence, and is again being used. Starting from 2021, Meitei script was officially used by the Government of Manipur, along with the Bengali-Assamese script, to write the Meitei language, as per "The Manipur Official Language (Amendment) Act, 2021".

  • Multani: 48 code points

    48 characters are encoded in this script.

    Multani is a Brahmic script originating in the Multan region of Punjab and in northern Sindh, Pakistan. It was used to write Saraiki, often considered a dialect of Lahnda group of languages. The script was used for routine writing and commercial activities. Multani is one of four Landa scripts whose usage was extended beyond the mercantile domain and formalized for literary activity and printing; the others being Gurmukhi, Khojki, and Khudabadi. Although Multani is now obsolete, it is a historical script in which written and printed records exist. It was also known as Karikki and as Sarai.

  • Myanmar: 224 code points

    224 characters are encoded in this script.

    The Mon–Burmese script (မွန်မြန်မာအက္ခရာ) is an abugida that derives from the Pallava Grantha script of southern India and later of Southeast Asia. It is the basis of the alphabets used for modern Burmese, Mon, Shan, Rakhine, Jingpho and Karen.

  • Nag Mundari: 42 code points

    42 characters are encoded in this script.

    Mundari Bani is the writing system created for the Mundari language, spoken in eastern India. Mundari is an Austroasiatic language. Mundari Bani has 27 letters and five diacritics, the forms of which are intended to evoke natural shapes. The script is written from left to right.

  • Nandinagari: 86 code points

    86 characters are encoded in this script.

    Nandināgarī is a Brahmic script derived from the Nāgarī script which appeared in the 7th century AD. This script and its variants were used in the central Deccan region and south India, and an abundance of Sanskrit manuscripts in Nandināgarī have been discovered but remain untransliterated. Some of the discovered manuscripts of Madhvacharya of the Dvaita Vedanta school of Hinduism are in Nandināgarī script.

  • Ancient North Arabian: 32 code points

    32 characters are encoded in this script.

    Ancient North Arabian (ANA) is a collection of scripts and possibly a language or family of languages related to Old Arabic that were used in north and central Arabia and south Syria from the 8th century BCE to the 4th century CE. The term "Ancient North Arabian" is defined negatively. It refers to all of the South Semitic scripts except Ancient South Arabian (ASA) regardless of their genetic relationships.

  • Nabataean: 40 code points

    40 characters are encoded in this script.

    The Nabataean script is an abjad that was used to write Nabataean Aramaic and Nabataean Arabic from the second century BC onwards. Important inscriptions are found in Petra, the Sinai Peninsula, and other archaeological sites including Abdah and Mada'in Saleh in Saudi Arabia.

  • Prachalit: 97 code points

    97 characters are encoded in this script.

    Prachalit is a type of abugida script developed from the Nepalese scripts, which are a part of the family of Brahmic scripts descended from Brahmi script. It is used to write Nepal Bhasa, Sanskrit and Pali. Various publications are still published in this script including the Sikkim Herald the bulletin of the Sikkim government.

  • Dongba: 0 code points

    0 characters are encoded in this script.

    The Dongba, Tomba or Tompa or Mo-so symbols are a system of pictographic glyphs used by the ²dto¹mba of the Naxi people in southern China. In the Naxi language it is called ²ss ³dgyu 'wood records' or ²lv ³dgyu 'stone records'. The first artifacts with this script on them originate from approximately 30 AD.

  • Geba: 0 code points

    0 characters are encoded in this script.

    Geba is a syllabic script for the Naxi language. It is called ¹Ggo¹baw in Naxi, adapted as Geba, 哥巴, in Chinese. Some glyphs resemble the Yi script, and some appear to be adaptations of Chinese characters. Geba is used only to transcribe mantras, and there are few texts, though it is sometimes used to annotate dongba pictographs. Geba's phonetics can vary depending on who is using it. Symbols do not have fixed phonetic values, and they may have the same phonetic values as well.

  • Nko: 67 code points

    67 characters are encoded in this script.

    N'Ko is an alphabetic script devised by Solomana Kanté in 1949, as a modern writing system for the Manding languages of West Africa. The term N'Ko, which means I say in all Manding languages, is also used for the Manding literary standard written in the N'Ko script.

  • Nushu: 397 code points

    397 characters are encoded in this script.

    Nüshu is a syllabic script derived from Chinese characters that was used exclusively among women in Jiangyong County in Hunan province of southern China before going extinct in the 20th century.

  • Ogham: 29 code points

    29 characters are encoded in this script.

    Ogham is an Early Medieval alphabet used primarily to write the early Irish language, and later the Old Irish language. There are roughly 400 surviving orthodox inscriptions on stone monuments throughout Ireland and western Britain, the bulk of which are in southern Munster. The largest number outside Ireland are in Pembrokeshire, Wales.

  • Ol Chiki: 48 code points

    48 characters are encoded in this script.

    The Ol Chiki script, also known as Ol Chemetʼ, Ol Ciki, Ol, and sometimes as the Santali alphabet invented by Pandit Raghunath Murmu in the year 1925, is the official writing system for Santali, an Austroasiatic language recognized as an official regional language in India. It is one of the official scripts of the Indian Republic. It has 30 letters, the design of which is intended to evoke natural shapes. The script is written from left to right, and has two styles. Unicode does not maintain a distinction between these two, as is typical for print and cursive variants of a script. In both styles, the script is unicameral.

    The shapes of the letters are not arbitrary, but reflect the names for the letters, which are words, usually the names of objects or actions representing conventionalized form in the pictorial shape of the characters.

  • Old Turkic: 73 code points

    73 characters are encoded in this script.

    The Old Turkic script was the alphabet used by the Göktürks and other early Turkic khanates from the 8th to 10th centuries to record the Old Turkic language.

  • Oriya: 97 code points

    97 characters are encoded in this script.

    The Odia script is a Brahmic script used to write primarily Odia language and others including Sanskrit and other regional languages. It is one of the official scripts of the Indian Republic. The script has developed over more than 1000 years from a variant of Siddhaṃ script which was used in Eastern India, where the characteristic top line transformed into a distinct round umbrella shape due to the influence of palm leaf manuscripts and also being influenced by the neighbouring scripts from the Western and Southern regions.

  • Osage: 72 code points

    72 characters are encoded in this script.

    The Osage script is a new script promulgated in 2006 and revised 2012–2014 for the Osage language. Because Latin orthographies were subject to interference from English conventions among Osage students who were more familiar with English than with Osage, in 2006 the director of the Osage Language Program, Herman Mongrain Lookout, decided to create a distinct script by modifying or fusing Latin letters. This Osage script has been in regular use on the Osage Nation ever since.

  • Osmanya: 40 code points

    40 characters are encoded in this script.

    The Osmanya alphabet, also known as Far Soomaali and, in Arabic, as al-kitābah al-ʿuthmānīyah, is an alphabetic script created to transcribe the Somali language. It was invented between 1920 and 1922 by Osman Yusuf Kenadid, the son of Sultan Yusuf Ali Kenadid and brother of Sultan Ali Yusuf Kenadid of the Sultanate of Hobyo.

  • Old Uyghur: 28 code points

    28 characters are encoded in this script.

    The Old Uyghur alphabet was a Turkic script used for writing the Old Uyghur, a variety of Old Turkic spoken in Turpan and Gansu that is the ancestor of the modern Western Yugur language. The term "Old Uyghur" used for this alphabet is misleading because Qocho, the Uyghur (Yugur) kingdom created in 843, originally used the Old Turkic alphabet. The Uyghur adopted this "Old Uyghur" script from local inhabitants when they migrated into Turfan after 840. It was an adaptation of the Aramaic alphabet used for texts with Buddhist, Manichaean and Christian content for 700–800 years in Turpan. The last known manuscripts are dated to the 18th century. This was the prototype for the Mongolian and Manchu alphabets. The Old Uyghur alphabet was brought to Mongolia by Tata-tonga.

  • Palmyrene: 32 code points

    32 characters are encoded in this script.

    The Palmyrene alphabet was a historical Semitic alphabet used to write Palmyrene Aramaic. It was used between 100 BCE and 300 CE in Palmyra in the Syrian desert. The oldest surviving Palmyrene inscription dates to 44 BCE. The last surviving inscription dates to 274 CE, two years after Palmyra was sacked by Roman Emperor Aurelian, ending the Palmyrene Empire. Use of the Palmyrene language and script declined, being replaced with Greek and Latin.

  • Pau Cin Hau: 57 code points

    57 characters are encoded in this script.

    The Pau Cin Hau scripts, known as Pau Cin Hau lai, or Zo tual lai in Zomi, are two scripts, a logographic script and an alphabetic script created by Pau Cin Hau, a Zomi religious leader from Chin State, Burma. The logographic script consists of 1,050 characters, which is a traditionally significant number based on the number of characters appearing in a religious text. The alphabetic script is a simplified script of 57 characters, which is divided into 21 consonants, 7 vowels, 9 final consonants, and 20 tone, length, and glottal marks. The original script was produced in 1902, but it is thought to have undergone at least two revisions, of which the first revision produced the logographic script.

  • Protoelamite: 0 code points

    0 characters are encoded in this script.

    The Proto-Elamite script is an Early Bronze Age writing system briefly in use before the introduction of Elamite cuneiform.

  • Old Permic: 44 code points

    44 characters are encoded in this script.

    The Old Permic script, sometimes known by its initial 2 characters as Abur or Anbur, is a "highly idiosyncratic adaptation" of the Cyrillic script once used to write medieval Komi.

  • Phags Pa: 59 code points

    59 characters are encoded in this script.

    The Phagspa script or ʼPhags-pa script is an alphabet designed by the Tibetan monk and State Preceptor Drogön Chögyal Phagpa for Kublai Khan, the founder of the Yuan dynasty, as a unified script for the written languages within the Yuan. The actual use of this script was limited to about a hundred years during the Mongol-led Yuan dynasty, and it fell out of use with the advent of the Ming dynasty.

  • Inscriptional Pahlavi: 27 code points

    27 characters are encoded in this script.

    Inscriptional Pahlavi is the earliest attested form of Pahlavi scripts, and is evident in clay fragments that have been dated to the reign of Mithridates I. Other early evidence includes the Pahlavi inscriptions of Arsacid era coins and rock inscriptions of Sassanid kings and other notables such as Kartir.

  • Psalter Pahlavi: 30 code points

    30 characters are encoded in this script.

    Psalter Pahlavi is a cursive abjad that was used for writing Middle Persian on paper; it is thus described as one of the Pahlavi scripts. It was written right to left, usually with spaces between words.

  • Book Pahlavi: 0 code points

    0 characters are encoded in this script.

  • Phoenician: 29 code points

    29 characters are encoded in this script.

    The Phoenician alphabet is an alphabet known in modern times from the Canaanite and Aramaic inscriptions found across the Mediterranean region. The name comes from the Phoenician civilization.

  • Miao: 149 code points

    149 characters are encoded in this script.

    The Pollard script, also known as Pollard Miao or Miao, is an abugida loosely based on the Latin alphabet and invented by Methodist missionary Sam Pollard. Pollard invented the script for use with A-Hmao, one of several Miao languages spoken in southeast Asia. The script underwent a series of revisions until 1936, when a translation of the New Testament was published using it.

  • Inscriptional Parthian: 30 code points

    30 characters are encoded in this script.

    Inscriptional Parthian is a script used to write Parthian language on coins of Parthia from the time of Arsaces I of Parthia. It was also used for inscriptions of Parthian and later Sassanian periods.

  • Protosinaitic: 0 code points

    0 characters are encoded in this script.

    Proto-Sinaitic is found in a small corpus of c. 40 inscriptions and fragments, the vast majority from Serabit el-Khadim in the Sinai Peninsula, dating to the Middle Bronze Age. They are considered the earliest trace of alphabetic writing and the common ancestor of both the Ancient South Arabian script and the Phoenician alphabet, which led to many modern alphabets including the Greek alphabet. According to common theory, Canaanites or Hyksos who spoke a Semitic language repurposed Egyptian hieroglyphs to construct a different script.

  • Ranjana: 0 code points

    0 characters are encoded in this script.

    The Rañjanā script (Lantsa) is an abugida writing system which developed in the 11th century and until the mid-20th century was used in an area from Nepal to Tibet by the Newar people, the historic inhabitants of the Kathmandu Valley, to write Sanskrit and Newar. Nowadays it is also used in Buddhist monasteries in India; China, especially in the Tibetan Buddhist areas within the Tibet Autonomous Region, Sichuan, Yunnan, Qinghai and Gansu; Mongolia, and Japan. It is normally written from left to right but the Kutakshar form is written from top to bottom. It is also considered to be the standard Nepali calligraphic script.

  • Rejang: 37 code points

    37 characters are encoded in this script.

    The Rejang alphabet is an abugida of the Brahmic family that is related to other scripts of the region, such as the Batak and Lontara scripts. Rejang is also a member of the closely related group of Ulu scripts that include the script variants of South Sumatra, Bengkulu, Lembak, Lintang, Lebong, and Serawai. Other closely related scripts that are sometimes included in the Surat Ulu group include the Ogan, Kerinci, and Lampung scripts. The script was in use prior to the introduction of Islam to the Rejang area; the earliest attested document appears to date from the mid-18th century CE. The Rejang script is sometimes also known as the KaGaNga script following the first three letters of the alphabet. The term KaGaNga was never used by the users of the script community, but it was coined by the British anthropologist Mervyn A. Jaspan (1926–1975) in his book Folk literature of South Sumatra. Redjang Ka-Ga-Nga texts. Canberra, The Australian National University 1964.

  • Hanifi Rohingya: 55 code points

    55 characters are encoded in this script.

    The Hanifi Rohingya script is a unified script for the Rohingya language. Rohingya today is written in three scripts, Hanafi, Arabic, and Latin (Rohingyalish). The Rohingya language was first written in the 19th century with a version of the Perso-Arabic script. In 1975, an orthographic Arabic script was developed and approved by the community leaders, based on the Urdu alphabet but with unique innovations to make the script suitable to Rohingya.

  • Rongorongo: 0 code points

    0 characters are encoded in this script.

    Rongorongo is a system of glyphs discovered in the 19th century on Rapa Nui that appears to be writing or proto-writing. Numerous attempts at decipherment have been made, with none being successful. Although some calendrical and what might prove to be genealogical information has been identified, none of these glyphs can actually be read. If rongorongo does prove to be writing and proves to be an independent invention, it would be one of very few independent inventions of writing in human history.

  • Runic: 86 code points

    86 characters are encoded in this script.

    A rune is a letter in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write Germanic languages before they adopted the Latin alphabet, and for specialised purposes thereafter. In addition to representing a sound value, runes can be used to represent the concepts after which they are named (ideographs). Scholars refer to instances of the latter as Begriffsrunen. The Scandinavian variants are also known as futhark or fuþark ; the Anglo-Saxon variant is futhorc or fuþorc.

  • Samaritan: 61 code points

    61 characters are encoded in this script.

    The Samaritan script is used by the Samaritans for religious writings, including the Samaritan Pentateuch, writings in Samaritan Hebrew, and for commentaries and translations in Samaritan Aramaic and occasionally Arabic.

  • Sarati: 0 code points

    0 characters are encoded in this script.

    Sarati is an artificial script, one of several scripts created by J. R. R. Tolkien. According to Tolkien's mythology, the Sarati alphabet was invented by the Elf Rúmil of Tirion.

  • Old South Arabian: 32 code points

    32 characters are encoded in this script.

    The Ancient South Arabian script branched from the Proto-Sinaitic script in about the late 2nd millennium BCE. It was used for writing the Old South Arabian languages Sabaic, Qatabanic, Hadramautic, Minaean, and Hasaitic, and the Ethiopic language Geʽez in Dʿmt. The earliest instances of the Ancient South Arabian script are painted pottery sherds from Raybun in Hadhramaut in Yemen, which are dated to the late 2nd millennium BCE. There are no letters for vowels, which are marked by matres lectionis.

  • Saurashtra: 82 code points

    82 characters are encoded in this script.

    The Saurashtra script is an abugida script that is used by Saurashtrians of Tamil Nadu to write the Saurashtra language. The script is of Brahmic origin, although its exact derivation is not known which was later reformed and standardized by T.M.Ram Rai. Its usage has declined, and the Tamil and Latin scripts are now used more commonly.

  • Sutton SignWriting: 672 code points

    672 characters are encoded in this script.

    Sutton SignWriting, or simply SignWriting, is a system of writing sign languages. It is highly featural and visually iconic, both in the shapes of the characters, which are abstract pictures of the hands, face, and body, and in their spatial arrangement on the page, which does not follow a sequential order like the letters that make up written English words. It was developed in 1974 by Valerie Sutton, a dancer who had, two years earlier, developed DanceWriting. Some newer standardized forms are known as the International Sign Writing Alphabet (ISWA).

  • Shavian: 48 code points

    48 characters are encoded in this script.

    The Shavian alphabet is an alphabet conceived as a way to provide simple, phonemic orthography for the English language to replace the difficulties of conventional spelling using the Latin alphabet. It was posthumously funded by and named after Irish playwright Bernard Shaw.

  • Sharada: 109 code points

    109 characters are encoded in this script.

    The Śāradā, Sarada or Sharada script is an abugida writing system of the Brahmic family of scripts. The script was widespread between the 8th and 12th centuries in the northwestern parts of Indian Subcontinent, for writing Sanskrit and Kashmiri. Originally more widespread, its use became later restricted to Kashmir, and it is now rarely used except by the Kashmiri Pandit community for religious purposes.

  • Sui: 0 code points

    0 characters are encoded in this script.

    The Sui script or Shuishu, is a logographic writing system with some pictographic characters that can be used to write the Sui language. However, traditionally only shamans were familiar with the writing system, and it is not utilized for everyday use by ordinary Sui people. This system is used for geomancy and divination purposes. There are at least 500 different Sui characters, known as le1 sui3 in the Sui language. According to tradition, these characters were created by ljok8 to2 qong5. Some of these characters are pictorial representations, such as of a bird or a fish, and a few are schematic representations of a characteristic quality, such a snail represented by a drawing of an inward curving spiral. Many of these characters appear to be borrowings from Chinese characters and are written backwards, apparently for increased supernatural power.

  • Siddham: 92 code points

    92 characters are encoded in this script.

    Siddhaṃ, also known in its later evolved form as Siddhamātṛkā, is a medieval Brahmic abugida, derived from the Gupta script and ancestral to the Nāgarī, Assamese, Bengali, Tirhuta, Odia and Nepalese scripts.

  • Khudabadi: 81 code points

    81 characters are encoded in this script.

    Khudabadi was a script used to write the Sindhi language, generally used by some Sindhi Hindus even in the present-day. The script originates from Khudabad, a city in Sindh, and is named after it. It is also known as Hathvanki script. Khudabadi is one of the four scripts used for writing Sindhi, the others being Perso-Arabic, Khojki and Devanagari script. It was used by traders and merchants to record their information and rose to importance as the script began to be used to record information kept secret from other non-Sindhi groups.

  • Sinhala: 114 code points

    114 characters are encoded in this script.

    The Sinhala script, also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write the Sinhala language as well as the liturgical languages Pali and Sanskrit. The Sinhalese Akṣara Mālāva, one of the Brahmic scripts, is a descendant of the Ancient Indian Brahmi script. It is also related to the Grantha script.

  • Sogdian: 43 code points

    43 characters are encoded in this script.

    The Sogdian alphabet was originally used for the Sogdian language, a language in the Iranian family used by the people of Sogdia. The alphabet is derived from Syriac, a descendant script of the Aramaic alphabet. The Sogdian alphabet is one of three scripts used to write the Sogdian language, the others being the Manichaean alphabet and the Syriac alphabet. It was used throughout Central Asia, from the edge of Iran in the west, to China in the east, from approximately 100–1200 A.D.

  • Old Sogdian: 40 code points

    40 characters are encoded in this script.

  • Sora Sompeng: 35 code points

    35 characters are encoded in this script.

    Sorang Sompeng script is used to write in Sora, a Munda language with 300,000 speakers in India. The script was created by Mangei Gomango in 1936 and is used in religious contexts. He was familiar with Odia, Telugu and English.

  • Soyombo: 83 code points

    83 characters are encoded in this script.

    The Soyombo script is an abugida developed by the monk and scholar Zanabazar in 1686 to write Mongolian. It can also be used to write Tibetan and Sanskrit.

  • Sundanese: 72 code points

    72 characters are encoded in this script.

    Standard Sundanese script is a writing system which is used by the Sundanese people. It is built based on Old Sundanese script which was used by the ancient Sundanese from the 14th to the 18th centuries.

  • Sunuwar: 0 code points

    0 characters are encoded in this script.

    The Sunuwar Alphabet is an alphabet developed by Krishna Bahadur Jentich in 1942, to write the Sunwar language, a member of the Kiranti language family spoken in Eastern Nepal, as in Sikkim. It is recognised in Sikkim and used as an official writing system. The alphabet has 33 letters, 10 numerals and 1 'auspicious sign'.

  • Syloti Nagri: 57 code points

    57 characters are encoded in this script.

    Sylheti Nagri or Sylheti Nagari, known in classical manuscripts as Sylhet Nagri amongst many other names, was an Indic script of the Brahmic family. The script was historically used in areas of Bengal and Assam that were east of the Padma, primarily in the eastern part of the Sylhet region, to document Muslim religious poetry known as puthis; having no presence in formal documentations. In the course of the 20th century, it has lost much ground to the standardised Eastern Nagari script. Printing presses for Sylheti Nagri existed as late as into the 1970s, and in the 2000s, a Unicode font was created for the script.

  • Syriac: 107 code points

    107 characters are encoded in this script.

    The Syriac alphabet is a writing system primarily used to write the Syriac language since the 1st century AD. It is one of the Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares similarities with the Phoenician, Hebrew, Arabic and Sogdian, the precursor and a direct ancestor of the traditional Mongolian scripts.

  • Tagbanwa: 20 code points

    20 characters are encoded in this script.

    Tagbanwa is one of the scripts indigenous to the Philippines, used by the Tagbanwa and the Palawan people as their ethnic writing system.

  • Takri: 80 code points

    80 characters are encoded in this script.

    The Tākri script is an abugida writing system of the Brahmic family of scripts. It is derived from the Sharada script formerly employed for Kashmiri. It is the sister script of Laṇḍā scripts. It has another variant Dogra Takri employed in Jammu region. There are numerous varieties present throughout Himachal Pradesh. Until the late 1940s, the adapted version of the script was the official script for writing Dogri in the princely state of Jammu and Kashmir. Throughout the history, different kingdoms of what now forms Himachal Pradesh used their own variety to maintain their records. The Takri script used in the Sirmour in Himachal Pradesh and Jaunsar-Bawar region has some distinction.

  • Tai Le: 45 code points

    45 characters are encoded in this script.

    The Tai Le script, or Dehong Dai script, is a Brahmic script used to write the Tai Nüa language spoken by the Tai Nua people of south-central Yunnan, China. It is written in horizontal lines from left to right, with spaces only between clauses and sentences.

  • New Tai Lue: 83 code points

    83 characters are encoded in this script.

    New Tai Lue script, also known as Xishuangbanna Dai and Simplified Tai Lue, is an abugida used to write the Tai Lü language. Developed in China in the 1950s, New Tai Lue is based on the traditional Tai Tham alphabet developed c. 1200. The government of China promoted the alphabet for use as a replacement for the older script; teaching the script was not mandatory, however, and as a result many are illiterate in New Tai Lue. In addition, communities in Burma, Laos, Thailand and Vietnam still use the Tai Tham alphabet.

  • Tamil: 133 code points

    133 characters are encoded in this script.

    The Tamil script is an abugida script that is used by Tamils and Tamil speakers in India, Sri Lanka, Malaysia, Singapore, Indonesia and elsewhere to write the Tamil language. It is one of the official scripts of the Indian Republic. Certain minority languages such as Saurashtra, Badaga, Irula and Paniya are also written in the Tamil script.

  • Tangut: 6914 code points

    6914 characters are encoded in this script.

    The Tangut script was a logographic writing system, used for writing the extinct Tangut language of the Western Xia dynasty. According to the latest count, 5863 Tangut characters are known, excluding variants. The Tangut characters are similar in appearance to Chinese characters, with the same type of strokes, but the methods of forming characters in the Tangut writing system are significantly different from those of forming Chinese characters. As in Chinese calligraphy, regular, running, cursive and seal scripts were used in Tangut writing.

  • Tai Viet: 72 code points

    72 characters are encoded in this script.

    The Tai Viet script is a Brahmic script used by the Tai Dam people and various other Thai people in Vietnam and Thailand.

  • Telugu: 106 code points

    106 characters are encoded in this script.

    Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language, a Dravidian language spoken in the Indian states of Andhra Pradesh and Telangana as well as several other neighbouring states. It is one of the official scripts of the Indian Republic. The Telugu script is also widely used for writing Sanskrit texts and to some extent the Gondi language. It gained prominence during the Eastern Chalukyas also known as Vengi Chalukya era. It shares extensive similarities with the Kannada script, as both of them evolved from the Bhattiprolu and Kadamba scripts of the Brahmi family. In 2008, the Telugu language was given the status of a Classical Language of India, in recognition of its rich history and heritage.

  • Tengwar: 0 code points

    0 characters are encoded in this script.

    The Tengwar script is an artificial script, one of several scripts created by J. R. R. Tolkien, the author of The Lord of the Rings.

  • Tifinagh: 59 code points

    59 characters are encoded in this script.

    Tifinagh is a script used to write the Berber languages. Tifinagh is descended from the ancient Libyco-Berber alphabet. The traditional Tifinagh, sometimes called Tuareg Tifinagh, is still favored by the Tuareg Berbers of the Sahara desert in southern Algeria, northeastern Mali, northern Niger and northern Burkina Faso for writing the Tuareg Berber language. Neo-Tifinagh is an alphabet developed by Berber Academy to adopt Tuareg Tifinagh for use with Kabyle; it has been since modified for use across North Africa.

  • Tagalog: 25 code points

    25 characters are encoded in this script.

    Baybayin is a Philippine script. The script is an abugida belonging to the family of the Brahmic scripts. Geographically, it was widely used in Luzon and other parts of the Philippines prior to and during the 16th and 17th centuries before being replaced by the Latin alphabet during the period of Spanish colonization. It was used in the Tagalog language and, to a lesser extent, Kapampangan-speaking areas; its use spread to the Ilocanos in the early 17th century. In the 19th and 20th centuries, baybayin survived and evolved into multiple forms—the Tagbanwa script of Palawan, and the Hanuno'o and Buhid scripts of Mindoro—and was used to create the constructed modern Kulitan script of the Kapampangan and the Ibalnan script of the Palawan people. Under the Unicode Standard and ISO 15924, the script is encoded as the Tagalog block.

  • Thaana: 66 code points

    66 characters are encoded in this script.

    Thaana, Tãnaa, Taana or Tāna is the present writing system of the Maldivian language spoken in the Maldives. Thaana has characteristics of both an abugida and a true alphabet, with consonants derived from indigenous and Arabic numerals, and vowels derived from the vowel diacritics of the Arabic abjad. Maldivian orthography in Thaana is largely phonemic.

  • Thai: 86 code points

    86 characters are encoded in this script.

    The Thai script is the abugida used to write Thai, Southern Thai and many other languages spoken in Thailand. The Thai alphabet itself has 44 consonant symbols and 16 vowel symbols that combine into at least 32 vowel forms and four tone diacritics to create characters mostly representing syllables.

  • Tibetan: 207 code points

    207 characters are encoded in this script.

    The Tibetan script is a segmental writing system (abugida) of Indic origin used to write certain Tibetic languages, including Tibetan, Dzongkha, Sikkimese, Ladakhi, Jirel and Balti. It has also been used for some non-Tibetic languages in close cultural contact with Tibet, such as Thakali and Old Turkic. The printed form is called uchen script while the hand-written cursive form used in everyday writing is called umê script. This writing system is used across the Himalayas, and Tibet.

  • Tirhuta: 97 code points

    97 characters are encoded in this script.

    The Tirhuta or Maithili script was the primary historical script for the Maithili language, as well as one of the historical scripts for Sanskrit. It is believed to have originated in the 10th century CE. It is very similar to Bengali–Assamese script, with most consonants being effectively identical in appearance. For the most part, writing in Maithili has switched to the Devanagari script, which is used to write neighbouring Central Indic languages to the west and north such as Hindi and Nepali, and the number of people with a working knowledge of Tirhuta has dropped considerably in recent years.

  • Tangsa: 89 code points

    89 characters are encoded in this script.

  • Toto: 31 code points

    31 characters are encoded in this script.

  • Ugaritic: 31 code points

    31 characters are encoded in this script.

    The Ugaritic writing system is a Cuneiform Abjad with syllabic elements used from around either 1400 BCE or 1300 BCE for Ugaritic, an extinct Northwest Semitic language, and discovered in Ugarit, Syria, in 1928. It has 30 letters. Other languages were occasionally written in the Ugaritic script in the area around Ugarit, although not elsewhere.

  • Vai: 300 code points

    300 characters are encoded in this script.

    The Vai syllabary is a syllabic writing system devised for the Vai language by Momolu Duwalu Bukele of Jondu, in what is now Grand Cape Mount County, Liberia. Bukele is regarded within the Vai community, as well as by most scholars, as the syllabary's inventor and chief promoter when it was first documented in the 1830s. It is one of the two most successful indigenous scripts in West Africa in terms of the number of current users and the availability of literature written in the script, the other being N'Ko.

  • Visible Speech: 0 code points

    0 characters are encoded in this script.

    Visible Speech is a system of phonetic symbols developed by British linguist Alexander Melville Bell in 1867 to represent the position of the speech organs in articulating sounds. Bell was known internationally as a teacher of speech and proper elocution and an author of books on the subject. The system is composed of symbols that show the position and movement of the throat, tongue, and lips as they produce the sounds of language, and it is a type of phonetic notation. The system was used to aid the deaf in learning to speak.

  • Vithkuqi: 70 code points

    70 characters are encoded in this script.

    The Vithkuqi alphabet, also called Büthakukye or Beitha Kukju after the appellation applied to it by German Albanologist Johann Georg von Hahn, was an alphabetic script invented for writing the Albanian language between 1825 and 1845 by Albanian scholar Naum Veqilharxhi.

  • Warang Citi: 84 code points

    84 characters are encoded in this script.

    Warang Citi is a writing system invented by Lako Bodra for the Ho language spoken in East India. It is used in primary and adult education and in various publications.

  • Wancho: 59 code points

    59 characters are encoded in this script.

    Wancho script is an alphabet created between 2001 and 2012 by middle school teacher Banwang Losu in Longding district, Arunachal Pradesh for writing the Wancho language. Letters represent consonants and vowels. Conjunct consonants are not used. Tone is indicated with diacritical marks on vowel letters.

  • Woleai: 0 code points

    0 characters are encoded in this script.

    The Woleai or Caroline Island script, thought to have been a syllabary, was a partially Latin-based script indigenous to Woleai Atoll and nearby islands of Micronesia and used to write the Woleaian language until the mid-20th century. At the time the script was first noticed by Europeans, this part of Micronesia was known as the Caroline Islands, hence the name Caroline Island script.

  • Old Persian: 50 code points

    50 characters are encoded in this script.

    Old Persian cuneiform is a semi-alphabetic cuneiform script that was the primary script for Old Persian. Texts written in this cuneiform have been found in Iran, Armenia, Romania (Gherla), Turkey, and along the Suez Canal. They were mostly inscriptions from the time period of Darius I, such as the DNa inscription, as well as his son, Xerxes I. Later kings down to Artaxerxes III used more recent forms of the language classified as "pre-Middle Persian".

  • Cuneiform: 1234 code points

    1234 characters are encoded in this script.

    Cuneiform proper is a logo-syllabic script that was used to write several languages of the Ancient Near East. The script was in active use from the early Bronze Age until the beginning of the Common Era. Cuneiform scripts in general are marked by and named for the characteristic wedge-shaped impressions which form their signs. Cuneiform is the earliest known writing system and was originally developed to write the Sumerian language of southern Mesopotamia.

  • Yezidi: 60 code points

    60 characters are encoded in this script.

  • Yi: 1246 code points

    1246 characters are encoded in this script.

    The Yi scripts are two scripts used to write the Yi languages; Classical Yi, and the later Yi syllabary. The script is historically known in Chinese as Cuan Wen or Wei Shu and various other names (夷字、倮語、倮倮文、畢摩文), among them "tadpole writing" (蝌蚪文).

  • Zanabazar Square: 72 code points

    72 characters are encoded in this script.

    Zanabazar's square script is a horizontal Mongolian square script, an abugida developed by the monk and scholar Zanabazar to write Mongolian. It can also be used to write Tibetan and Sanskrit.

  • Inherited: 657 code points

    657 characters are encoded in this script.

  • Mathematical Notation: 0 code points

    0 characters are encoded in this script.

    Mathematical notation consists of using symbols for representing operations, unspecified numbers, relations, and any other mathematical objects and assembling them into expressions and formulas. Mathematical notation is widely used in mathematics, science, and engineering for representing complex concepts and properties in a concise, unambiguous, and accurate way.

  • Emoji: 0 code points

    0 characters are encoded in this script.

    An emoji is a pictogram, logogram, ideogram, or smiley embedded in text and used in electronic messages and web pages. The primary function of emoji is to fill in emotional cues otherwise missing from typed conversation. Emoji exist in various genres, including facial expressions, common objects, places and types of weather, and animals. They are much like emoticons, except emoji are pictures rather than typographic approximations; the term "emoji" in the strict sense refers to such pictures which can be represented as encoded characters, but it is sometimes applied to messaging stickers by extension. Originally meaning pictograph, the word emoji comes from Japanese e  + moji ; the resemblance to the English words emotion and emoticon is purely coincidental. The ISO 15924 script code for emoji is Zsye.

  • Symbol: 0 code points

    0 characters are encoded in this script.

    A symbol is a mark, sign, or word that indicates, signifies, or is understood as representing an idea, object, or relationship. Symbols allow people to go beyond what is known or seen by creating linkages between otherwise very different concepts and experiences. All communication is achieved through the use of symbols. Symbols take the form of words, sounds, gestures, ideas, or visual images and are used to convey other ideas and beliefs. For example, a red octagon is a common symbol for "STOP"; on maps, blue lines often represent rivers; and a red rose often symbolizes love and compassion. Numerals are symbols for numbers; letters of an alphabet may be symbols for certain phonemes; and personal names are symbols representing individuals.

  • Common: 8306 code points

    8306 characters are encoded in this script.

  • Unknown: 66 code points

    66 characters are encoded in this script.