U+842C CJK UNIFIED IDEOGRAPH-842C

U+842C was added to Unicode in version 1.1 (1993). It belongs to the block CJK Unified Ideographs in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as ten thousand; innumerable. Its Pīnyīn pronunciation is w. The codepoint has the Numeric value 10000.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+842C behaves as Ideographic regarding line breaks. It has type OLetter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

Representations

System Representation
33836
UTF-8 E8 90 AC
UTF-16 84 2C
UTF-32 00 00 84 2C
URL-Quoted %E8%90%AC
HTML-Escape 萬
Wrong windows-1252 Mojibake 萬
Encoding: EUC-KR (hex bytes) D8 BF
Encoding: JIS0208 (hex bytes) E8 DF
Pīnyīn w
IRG_GSource G1-4D72
IRG_HSource HB1-
IRG_JSource J0-685F
IRG_KPSource KP0-DAC
IRG_KSource K0-583F
IRG_TSource T1-655C
IRG_VSource V1-653
BigFive B855
CCCII 214F22
CNS1986 1-655C
CNS1992 1-655C
EACC 214F22
GB1 4582
Jis0 7263
KPS0 DAC6
KSC0 5631
TaiwanTelegraph 5502
Xerox 242:161

Elsewhere

Complete Record

Property Value
Age (age) 1.1
Unicode Name (na) CJK UNIFIED IDEOGRAPH-842C
Unicode 1 Name (na1)
Block (blk) CJK
General Category (gc) Other Letter
Script (sc) Han
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) None
Decomposition Mapping (dm) 萬
Lowercase (Lower)
Simple Lowercase Mapping (slc) 萬
Lowercase Mapping (lc) 萬
Uppercase (Upper)
Simple Uppercase Mapping (suc) 萬
Uppercase Mapping (uc) 萬
Simple Titlecase Mapping (stc) 萬
Titlecase Mapping (tc) 萬
Case Folding (cf) 萬
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Bidi Paired Bracket (bpb) 萬
Bidi Paired Bracket Type (bpt) None
Cased (Cased)
Composition Exclusion (CE)
Case Ignorable (CI)
Full Composition Exclusion (Comp_Ex)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Dash (Dash)
Deprecated (Dep)
Default Ignorable Code Point (DI)
Diacritic (Dia)
East Asian Width (ea) Wide
Extender (Ext)
FC NFKC Closure (FC_NFKC) 萬
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Hex Digit (Hex)
Hangul Syllable Type (hst) Not Applicable
Hyphen (Hyphen)
ID Continue (IDC)
Ideographic (Ideo)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
InMC (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Join Control (Join_C)
Jamo Short Name (JSN)
Joining Type (jt) Non Joining
kIICore (kIICore) ATJ
kIRG_GSource (kIRG_GSource) G1-4D72
kIRG_HSource (kIRG_HSource) HB1-
kIRG_JSource (kIRG_JSource) J0-685F
kIRG_KPSource (kIRG_KPSource) KP0-DAC
kIRG_KSource (kIRG_KSource) K0-583F
kIRG_TSource (kIRG_TSource) T1-655C
kIRG_VSource (kIRG_VSource) V1-653
kIRGDaeJaweon (kIRGDaeJaweon) 1501.060
kIRGDaiKanwaZiten (kIRGDaiKanwaZiten) 31339
kIRGHanyuDaZidian (kIRGHanyuDaZidian) 53247.080
kIRGKangXi (kIRGKangXi) 1042.330
Accounting Numeric Value (kAccountingNumeric) 10000
kBigFive (kBigFive) B855
kCangjie (kCangjie) TWLB
kCantonese (kCantonese) maan6
kCCCII (kCCCII) 214F22
kCihaiT (kCihaiT) 1149.402
kCNS1986 (kCNS1986) 1-655C
kCNS1992 (kCNS1992) 1-655C
kCowles (kCowles) 2576
kDaeJaweon (kDaeJaweon) 1501.060
Unihan Definition (kDefinition) ten thousand; innumerable
kEACC (kEACC) 214F22
kFenn (kFenn) 576C
kFennIndex (kFennIndex) 593.03
kFourCornerCode (kFourCornerCode) 4442.7
kFrequency (kFrequency) 2
kGB1 (kGB1) 4582
kGradeLevel (kGradeLevel) 4
kGSR (kGSR) 0267a
kHangul (kHangul)
kHanYu (kHanYu) 53247.080
kHanyuPinlu (kHanyuPinlu) wàn(1335)
kHanyuPinyin (kHanyuPinyin) 53247.080:wàn
kHKGlyph (kHKGlyph) 2889
kJapaneseKun (kJapaneseKun) YOROZU OOKII
kJapaneseOn (kJapaneseOn) MAN
kJis0 (kJis0) 7263
kKangXi (kKangXi) 1042.330
kKorean (kKorean) MAN
kKPS0 (kKPS0) DAC6
kKSC0 (kKSC0) 5631
kLau (kLau) 2058
kMandarin (kMandarin) wàn
kMatthews (kMatthews) 7030
kMeyerWempe (kMeyerWempe) 1744
kMorohashi (kMorohashi) 31339
kNelson (kNelson) 3984
kPhonetic (kPhonetic) 866
Radical Stroke Count (Adobe Japan 1-6) (kRSAdobe_Japan1_6) C+6408+140.3.9
Radical Stroke Count (KangXi) (kRSKangXi) 140.9
Radical Stroke Count (Unicode) (kRSUnicode) 114.8
kSBGY (kSBGY) 397.37
Semantic Variant (kSemanticVariant) 万卍
Simplified Variant (kSimplifiedVariant) 万
kTaiwanTelegraph (kTaiwanTelegraph) 5502
kTang (kTang) *miæ̀n
Stroke Number (kTotalStrokes) 12
kVietnamese (kVietnamese) vạn
kXerox (kXerox) 242:161
kXHC1983 (kXHC1983) 1185.041:wàn
Line Break (lb) Ideographic
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) 萬
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Numeric Type (nt) Numeric
Numeric Value (nv) 10000
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Radical (Radical)
Sentence Break (SB) OLetter
Simple Case Folding (scf) 萬
Script Extension (scx) Han
Soft Dotted (SD)
STerm (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)