U+060C Arabic Comma
U+060C was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Other Punctuation and is commonly used, that is, in no specific script. It is also used in the scripts Arabic, Garay, N’Ko, Hanifi Rohingya, Syriac, Thaana, Yezidi.
The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it is written as number separator according to the number it separates. When changing direction it is not mirrored. It will not end a sentence. U+060C prohibits a line break after it, and before it, too, if preceded by a number. The glyph can be confused with 2 other glyphs.
The CLDR project calls this character “arabic comma” for use in screen reading software. It assigns these additional labels, e.g. for search in emoji pickers: arabic, comma.
The Wikipedia has the following information about this codepoint:
The comma , is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical, others give it the appearance of a miniature filled-in figure 9 placed on the baseline. In many typefaces it is the same shape as an apostrophe or single closing quotation mark ’.
The comma is used in many contexts and languages, mainly to separate parts of a sentence such as clauses, and items in lists mainly when there are three or more items listed. The word comma comes from the Greek κόμμα (kómma), which originally meant a cut-off piece, specifically in grammar, a short clause.
A comma-shaped mark is used as a diacritic in several writing systems and is considered distinct from the cedilla. In Byzantine and modern copies of Ancient Greek, the "rough" and "smooth breathings" (ἁ, ἀ) appear above the letter. In Latvian, Romanian, and Livonian, the comma diacritic appears below the letter, as in ș.
In spoken language, a common rule of thumb is that the function of a comma is generally performed by a pause.
In this article, ⟨x⟩ denotes a grapheme (writing) and /x/ denotes a phoneme (sound).
Representations
System | Representation |
---|---|
Nº | 1548 |
UTF-8 | D8 8C |
UTF-16 | 06 0C |
UTF-32 | 00 00 06 0C |
URL-Quoted | %D8%8C |
HTML hex reference | ، |
Wrong windows-1252 Mojibake | ، |
Encoding: CP864 (hex bytes) | AC |
Encoding: CP1006 (hex bytes) | AB |
Encoding: CP1256 (hex bytes) | A1 |
Encoding: GB18030 (hex bytes) | 81 31 81 32 |
Encoding: ISO8859_6 (hex bytes) | AC |
Encoding: CP1097 (hex bytes) | 42 |
Encoding: CP420 (hex bytes) | 79 |
Adobe Glyph List | afii57388 |
Adobe Glyph List | commaarabic |
digraph | ,+ |
Related Characters
Confusables
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
ARABIC COMMA | |
— | |
Arabic | |
Other Punctuation | |
Common | |
Common Separator | |
Not Reordered | |
none | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
Yes | |
|
|
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Sentence Continue | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
MidNum | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
neutral | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Infix Numeric Separator | |
none | |
not a number | |
|
|
Arabic Garay N’Ko Hanifi Rohingya Syriac Thaana Yezidi | |
R |