Home: go to the homepage U+0000 to U+007F Basic Latin
Glyph for U+002C
Source: Noto Sans

U+002C Comma

U+002C was added in Unicode version 1.1 in 1993. It belongs to the block U+0000 to U+007F Basic Latin in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Punctuation and is commonly used, that is, in no specific script. The character is also known as the use as decimal or thousands separator is locale dependent.

The glyph is not a composition. Its East Asian Width is narrow. In bidirectional text it is written as number separator according to the number it separates. When changing direction it is not mirrored. It will not end a sentence. U+002C prohibits a line break after it, and before it, too, if preceded by a number. The glyph can be confused with 16 other glyphs.

The CLDR project calls this character “comma” for use in screen reading software.

The Wikipedia has the following information about this codepoint:

The comma , is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline of the text. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical. Other fonts give it the appearance of a miniature filled-in figure 9 on the baseline.

The comma is used in many contexts and languages, mainly to separate parts of a sentence such as clauses, and items in lists mainly when there are three or more items listed. The word comma comes from the Greek κόμμα (kómma), which originally meant a cut-off piece, specifically in grammar, a short clause.

A comma-shaped mark is used as a diacritic in several writing systems and is considered distinct from the cedilla. In Byzantine and modern copies of Ancient Greek, the "rough" and "smooth breathings" (ἁ, ἀ) appear above the letter. In Latvian, Romanian, and Livonian, the comma diacritic appears below the letter, as in ș.

In spoken language, a common rule of thumb is that the function of a comma is generally performed by a pause.

In this article, ⟨x⟩ denotes a grapheme (writing) and /x/ denotes a phoneme (sound).

Representations

System Representation
44
UTF-8 2C
UTF-16 00 2C
UTF-32 00 00 00 2C
URL-Quoted %2C
HTML hex reference ,
HTML named entity ,
alias the use as decimal or thousands separator is locale dependent
AGL: Latin-1 comma
AGL: Latin-2 comma
AGL: Latin-3 comma
AGL: Latin-4 comma
AGL: Latin-5 comma
Adobe Glyph List comma
digraph ,

Related Characters

Confusables

Elsewhere

Complete Record

Property Value
Age (age) 1.1 (1993)
Unicode Name (na) COMMA
Unicode 1 Name (na1)
Block (blk) Basic Latin
General Category (gc) Other Punctuation
Script (sc) Common
Bidirectional Category (bc) Common Separator
Combining Class (ccc) Not Reordered
Decomposition Type (dt) none
Decomposition Mapping (dm) Glyph for U+002C Comma
Lowercase (Lower)
Simple Lowercase Mapping (slc) Glyph for U+002C Comma
Lowercase Mapping (lc) Glyph for U+002C Comma
Uppercase (Upper)
Simple Uppercase Mapping (suc) Glyph for U+002C Comma
Uppercase Mapping (uc) Glyph for U+002C Comma
Simple Titlecase Mapping (stc) Glyph for U+002C Comma
Titlecase Mapping (tc) Glyph for U+002C Comma
Case Folding (cf) Glyph for U+002C Comma
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Deprecated (Dep)
Diacritic (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji Presentation (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Glyph for U+002C Comma
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideographic (Ideo)
InCB (InCB) None
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) Glyph for U+002C Comma
NFKC Quick Check (NFKC_QC) Yes
NFKC_SCF (NFKC_SCF) Glyph for U+002C Comma
NFKD Quick Check (NFKD_QC) Yes
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Sentence Continue
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) MidNum
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Glyph for U+002C Comma
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) narrow
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Infix Numeric Separator
Numeric Type (nt) none
Numeric Value (nv) not a number
Simple Case Folding (scf) Glyph for U+002C Comma
Script Extension (scx)
Vertical Orientation (vo) R