U+542B CJK UNIFIED IDEOGRAPH-542B

U+542B was added to Unicode in version 1.1 (1993). It belongs to the block CJK Unified Ideographs in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as hold in mouth; cherish; contain. Its Pīnyīn pronunciation is h.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+542B behaves as Ideographic regarding line breaks. It has type OLetter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

Representations

System Representation
21547
UTF-8 E5 90 AB
UTF-16 54 2B
UTF-32 00 00 54 2B
URL-Quoted %E5%90%AB
HTML-Escape 含
Wrong windows-1252 Mojibake 含
Encoding: EUC-KR (hex bytes) F9 DF
Encoding: JIS0208 (hex bytes) B4 DE
Pīnyīn h
IRG_GSource G0-3A2C
IRG_HSource HB1-
IRG_JSource J0-345E
IRG_KPSource KP0-F2D
IRG_KSource K0-795F
IRG_TSource T1-4956
IRG_VSource V1-4E5
BigFive A774
CCCII 213565
CNS1986 1-4956
CNS1992 1-4956
EACC 213565
GB0 2612
GB1 2612
Jis0 2062
KPS0 F2DF
KSC0 8963
MainlandTelegraph 0698
TaiwanTelegraph 0698
Xerox 246:335

Elsewhere

Complete Record

Property Value
Age (age) 1.1
Unicode Name (na) CJK UNIFIED IDEOGRAPH-542B
Unicode 1 Name (na1)
Block (blk) CJK
General Category (gc) Other Letter
Script (sc) Han
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) None
Decomposition Mapping (dm) 含
Lowercase (Lower)
Simple Lowercase Mapping (slc) 含
Lowercase Mapping (lc) 含
Uppercase (Upper)
Simple Uppercase Mapping (suc) 含
Uppercase Mapping (uc) 含
Simple Titlecase Mapping (stc) 含
Titlecase Mapping (tc) 含
Case Folding (cf) 含
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Bidi Paired Bracket (bpb) 含
Bidi Paired Bracket Type (bpt) None
Cased (Cased)
Composition Exclusion (CE)
Case Ignorable (CI)
Full Composition Exclusion (Comp_Ex)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Dash (Dash)
Deprecated (Dep)
Default Ignorable Code Point (DI)
Diacritic (Dia)
East Asian Width (ea) Wide
Extender (Ext)
FC NFKC Closure (FC_NFKC) 含
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Hex Digit (Hex)
Hangul Syllable Type (hst) Not Applicable
Hyphen (Hyphen)
ID Continue (IDC)
Ideographic (Ideo)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
InMC (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Join Control (Join_C)
Jamo Short Name (JSN)
Joining Type (jt) Non Joining
kIICore (kIICore) AGT
kIRG_GSource (kIRG_GSource) G0-3A2C
kIRG_HSource (kIRG_HSource) HB1-
kIRG_JSource (kIRG_JSource) J0-345E
kIRG_KPSource (kIRG_KPSource) KP0-F2D
kIRG_KSource (kIRG_KSource) K0-795F
kIRG_TSource (kIRG_TSource) T1-4956
kIRG_VSource (kIRG_VSource) V1-4E5
kIRGDaeJaweon (kIRGDaeJaweon) 0395.190
kIRGDaiKanwaZiten (kIRGDaiKanwaZiten) 03350
kIRGHanyuDaZidian (kIRGHanyuDaZidian) 10592.010
kIRGKangXi (kIRGKangXi) 0178.140
kBigFive (kBigFive) A774
kCangjie (kCangjie) OINR
kCantonese (kCantonese) ham4
kCCCII (kCCCII) 213565
kCihaiT (kCihaiT) 258.202
kCNS1986 (kCNS1986) 1-4956
kCNS1992 (kCNS1992) 1-4956
kCowles (kCowles) 1152
kDaeJaweon (kDaeJaweon) 0395.190
Unihan Definition (kDefinition) hold in mouth; cherish; contain
kEACC (kEACC) 213565
kFenn (kFenn) 437D
kFennIndex (kFennIndex) 147.10 152.08
kFourCornerCode (kFourCornerCode) 8060.2
kFrequency (kFrequency) 3
kGB0 (kGB0) 2612
kGB1 (kGB1) 2612
kGradeLevel (kGradeLevel) 4
kGSR (kGSR) 0651l'
kHangul (kHangul)
kHanYu (kHanYu) 10592.010
kHanyuPinlu (kHanyuPinlu) hán(320)
kHanyuPinyin (kHanyuPinyin) 10592.010:hán,hàn
kHKGlyph (kHKGlyph) 0532
kJapaneseKun (kJapaneseKun) FUKUMU FUKUMERU
kJapaneseOn (kJapaneseOn) GAN
kJis0 (kJis0) 2062
kKangXi (kKangXi) 0178.140
kKarlgren (kKarlgren) 62
kKorean (kKorean) HAM
kKPS0 (kKPS0) F2DF
kKSC0 (kKSC0) 8963
kLau (kLau) 1122
kMainlandTelegraph (kMainlandTelegraph) 0698
kMandarin (kMandarin) hán
kMatthews (kMatthews) 2017
kMeyerWempe (kMeyerWempe) 774
kMorohashi (kMorohashi) 03350
kNelson (kNelson) 0402
kPhonetic (kPhonetic) 497 565
Radical Stroke Count (Adobe Japan 1-6) (kRSAdobe_Japan1_6) C+1562+30.3.4
Radical Stroke Count (KangXi) (kRSKangXi) 30.4
Radical Stroke Count (Unicode) (kRSUnicode) 30.4
kSBGY (kSBGY) 222.07
Specialized Semantic Variant (kSpecializedSemanticVariant) 唅
kTaiwanTelegraph (kTaiwanTelegraph) 0698
kTang (kTang) *hom
Stroke Number (kTotalStrokes) 7
kVietnamese (kVietnamese) hàm
kXerox (kXerox) 246:335
kXHC1983 (kXHC1983) 0438.050:hán
Line Break (lb) Ideographic
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) 含
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Numeric Type (nt) None
Numeric Value (nv) NaN
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Radical (Radical)
Sentence Break (SB) OLetter
Simple Case Folding (scf) 含
Script Extension (scx) Han
Soft Dotted (SD)
STerm (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)