U+54B1 was added to Unicode in version 1.1 (1993). It belongs to the block CJK Unified Ideographs in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as we, us. Its Pīnyīn pronunciation is z.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+54B1 behaves as Ideographic regarding line breaks. It has type OLetter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.


System Representation
UTF-8 E5 92 B1
UTF-16 54 B1
UTF-32 00 00 54 B1
URL-Quoted %E5%92%B1
HTML-Escape 咱
Wrong windows-1252 Mojibake 咱
Pīnyīn z
IRG_GSource G0-545B
IRG_HSource HB1-
IRG_JSource J1-3533
IRG_KPSource KP1-39A
IRG_KSource K2-267A
IRG_TSource T1-5047
BigFive ABA5
CCCII 213630
CNS1986 1-5047
CNS1992 1-5047
EACC 213630
GB0 5259
GB1 5259
Jis1 2119
KPS1 39A5
MainlandTelegraph 0749
TaiwanTelegraph 0749
Xerox 251:367


Complete Record

Property Value
Age (age) 1.1
Unicode Name (na) CJK UNIFIED IDEOGRAPH-54B1
Unicode 1 Name (na1)
Block (blk) CJK
General Category (gc) Other Letter
Script (sc) Han
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) None
Decomposition Mapping (dm) 咱
Lowercase (Lower)
Simple Lowercase Mapping (slc) 咱
Lowercase Mapping (lc) 咱
Uppercase (Upper)
Simple Uppercase Mapping (suc) 咱
Uppercase Mapping (uc) 咱
Simple Titlecase Mapping (stc) 咱
Titlecase Mapping (tc) 咱
Case Folding (cf) 咱
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Bidi Paired Bracket (bpb) 咱
Bidi Paired Bracket Type (bpt) None
Cased (Cased)
Composition Exclusion (CE)
Case Ignorable (CI)
Full Composition Exclusion (Comp_Ex)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Dash (Dash)
Deprecated (Dep)
Default Ignorable Code Point (DI)
Diacritic (Dia)
East Asian Width (ea) Wide
Extender (Ext)
FC NFKC Closure (FC_NFKC) 咱
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Hex Digit (Hex)
Hangul Syllable Type (hst) Not Applicable
Hyphen (Hyphen)
ID Continue (IDC)
Ideographic (Ideo)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Join Control (Join_C)
Jamo Short Name (JSN)
Joining Type (jt) Non Joining
kIICore (kIICore) AGT
kIRG_GSource (kIRG_GSource) G0-545B
kIRG_HSource (kIRG_HSource) HB1-
kIRG_JSource (kIRG_JSource) J1-3533
kIRG_KPSource (kIRG_KPSource) KP1-39A
kIRG_KSource (kIRG_KSource) K2-267A
kIRG_TSource (kIRG_TSource) T1-5047
kIRGDaeJaweon (kIRGDaeJaweon) 0406.090
kIRGDaiKanwaZiten (kIRGDaiKanwaZiten) 03552
kIRGHanyuDaZidian (kIRGHanyuDaZidian) 10618.060
kIRGKangXi (kIRGKangXi) 0187.070
kBigFive (kBigFive) ABA5
kCangjie (kCangjie) RHBU
kCantonese (kCantonese) zaa1
kCCCII (kCCCII) 213630
kCihaiT (kCihaiT) 273.101
kCNS1986 (kCNS1986) 1-5047
kCNS1992 (kCNS1992) 1-5047
kDaeJaweon (kDaeJaweon) 0406.090
Unihan Definition (kDefinition) we, us
kEACC (kEACC) 213630
kFenn (kFenn) 852A
kFennIndex (kFennIndex) 536.07 541.09
kFourCornerCode (kFourCornerCode) 6600.0
kFrequency (kFrequency) 4
kGB0 (kGB0) 5259
kGB1 (kGB1) 5259
kHanYu (kHanYu) 10618.060
kHanyuPinlu (kHanyuPinlu) zán(741) zan(11)
kHanyuPinyin (kHanyuPinyin) 10618.060:zá,zán,zǎ,zan
kHKGlyph (kHKGlyph) 0574
kJapaneseKun (kJapaneseKun) WARE
kJapaneseOn (kJapaneseOn) SATSU SACHI SA SHA
kJis1 (kJis1) 2119
kKangXi (kKangXi) 0187.070
kKorean (kKorean) CHAL CHA
kKPS1 (kKPS1) 39A5
kMainlandTelegraph (kMainlandTelegraph) 0749
kMandarin (kMandarin) zán
kMatthews (kMatthews) 6645
kMeyerWempe (kMeyerWempe) 3247a
kMorohashi (kMorohashi) 03552
kPhonetic (kPhonetic) 146
Radical Stroke Count (Adobe Japan 1-6) (kRSAdobe_Japan1_6) C+19228+30.3.6 C+19228+132.6.3
Radical Stroke Count (KangXi) (kRSKangXi) 30.6
Radical Stroke Count (Unicode) (kRSUnicode) 30.6
Semantic Variant (kSemanticVariant) 偺
Specialized Semantic Variant (kSpecializedSemanticVariant) 喒
kTaiwanTelegraph (kTaiwanTelegraph) 0749
Stroke Number (kTotalStrokes) 9
kXerox (kXerox) 251:367
kXHC1983 (kXHC1983) 1435.070:zá 1439.040:zán 1440.070:zan
Line Break (lb) Ideographic
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) 咱
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Numeric Type (nt) None
Numeric Value (nv) NaN
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Radical (Radical)
Sentence Break (SB) OLetter
Simple Case Folding (scf) 咱
Script Extension (scx) Han
Soft Dotted (SD)
STerm (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)