U+57CF was added to Unicode in version 1.1 (1993). It belongs to the block CJK Unified Ideographs in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as a boundary, a limit. Its Pīnyīn pronunciation is shān.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+57CF behaves as Ideographic regarding line breaks. It has type OLetter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.


System Representation
UTF-8 E5 9F 8F
UTF-16 57 CF
UTF-32 00 00 57 CF
URL-Quoted %E5%9F%8F
HTML-Escape 埏
Wrong windows-1252 Mojibake 埏
Pīnyīn shān
IRG_GSource G0-5B6F
IRG_HSource HB2-
IRG_JSource J3-2F50
IRG_KPSource KP1-3BB
IRG_KSource K1-6744
IRG_TSource T2-343D
BigFive D4BA
CCCII 21765A
CNS1986 2-343D
CNS1992 2-343D
EACC 21765A
GB0 5979
GB1 5979
JIS0213 1,15,48
Jis1 2374
KSC1 7136
MainlandTelegraph 1007
TaiwanTelegraph 1007
Xerox 302:173


Complete Record

Property Value
Age (age) 1.1
Unicode 1 Name (na1)
Block (blk) CJK
General Category (gc) Other Letter
Script (sc) Han
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) None
Decomposition Mapping (dm) 埏
Lowercase (Lower)
Simple Lowercase Mapping (slc) 埏
Lowercase Mapping (lc) 埏
Uppercase (Upper)
Simple Uppercase Mapping (suc) 埏
Uppercase Mapping (uc) 埏
Simple Titlecase Mapping (stc) 埏
Titlecase Mapping (tc) 埏
Case Folding (cf) 埏
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Bidi Paired Bracket (bpb) 埏
Bidi Paired Bracket Type (bpt) None
Cased (Cased)
Composition Exclusion (CE)
Case Ignorable (CI)
Full Composition Exclusion (Comp_Ex)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Dash (Dash)
Deprecated (Dep)
Default Ignorable Code Point (DI)
Diacritic (Dia)
East Asian Width (ea) Wide
Extender (Ext)
FC NFKC Closure (FC_NFKC) 埏
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Hex Digit (Hex)
Hangul Syllable Type (hst) Not Applicable
Hyphen (Hyphen)
ID Continue (IDC)
Ideographic (Ideo)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Join Control (Join_C)
Jamo Short Name (JSN)
Joining Type (jt) Non Joining
kIRG_GSource (kIRG_GSource) G0-5B6F
kIRG_HSource (kIRG_HSource) HB2-
kIRG_JSource (kIRG_JSource) J3-2F50
kIRG_KPSource (kIRG_KPSource) KP1-3BB
kIRG_KSource (kIRG_KSource) K1-6744
kIRG_TSource (kIRG_TSource) T2-343D
kIRGDaeJaweon (kIRGDaeJaweon) 0466.010
kIRGDaiKanwaZiten (kIRGDaiKanwaZiten) 05121
kIRGHanyuDaZidian (kIRGHanyuDaZidian) 10445.100
kIRGKangXi (kIRGKangXi) 0230.010
kBigFive (kBigFive) D4BA
kCangjie (kCangjie) GNKM
kCantonese (kCantonese) jin4
kCCCII (kCCCII) 21765A
kCihaiT (kCihaiT) 320.401
kCNS1986 (kCNS1986) 2-343D
kCNS1992 (kCNS1992) 2-343D
kDaeJaweon (kDaeJaweon) 0466.010
Unihan Definition (kDefinition) a boundary, a limit
kEACC (kEACC) 21765A
kFourCornerCode (kFourCornerCode) 4214.1
kGB0 (kGB0) 5979
kGB1 (kGB1) 5979
kGSR (kGSR) 0203e
kHangul (kHangul)
kHanYu (kHanYu) 10445.100
kHanyuPinyin (kHanyuPinyin) 10445.100:yán,shān
kJapaneseOn (kJapaneseOn) SEN ZEN EN
kJIS0213 (kJIS0213) 1,15,48
kJis1 (kJis1) 2374
kKangXi (kKangXi) 0230.010
kKorean (kKorean) YEN
kKPS1 (kKPS1) 3BB6
kKSC1 (kKSC1) 7136
kMainlandTelegraph (kMainlandTelegraph) 1007
kMandarin (kMandarin) shān
kMatthews (kMatthews) 7343
kMeyerWempe (kMeyerWempe) 896f
kMorohashi (kMorohashi) 05121
kPhonetic (kPhonetic) 1578
Radical Stroke Count (Adobe Japan 1-6) (kRSAdobe_Japan1_6) C+16817+32.3.7
Radical Stroke Count (KangXi) (kRSKangXi) 32.7
Radical Stroke Count (Unicode) (kRSUnicode) 32.7
kSBGY (kSBGY) 137.33 138.20
kTaiwanTelegraph (kTaiwanTelegraph) 1007
Stroke Number (kTotalStrokes) 9
kXerox (kXerox) 302:173
kXHC1983 (kXHC1983) 0997.020:shān
Line Break (lb) Ideographic
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) 埏
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Numeric Type (nt) None
Numeric Value (nv) NaN
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Radical (Radical)
Sentence Break (SB) OLetter
Simple Case Folding (scf) 埏
Script Extension (scx) Han
Soft Dotted (SD)
STerm (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)