U+5140 CJK UNIFIED IDEOGRAPH-5140

U+5140 was added to Unicode in version 1.1 (1993). It belongs to the block CJK Unified Ideographs in the Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as to cut off the feet. Its Pīnyīn pronunciation is .

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. The glyph can, under circumstances, be confused with 2 other glyphs. In text U+5140 behaves as Ideographic regarding line breaks. It has type OLetter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

Representations

System Representation
20800
UTF-8 E5 85 80
UTF-16 51 40
UTF-32 00 00 51 40
URL-Quoted %E5%85%80
HTML-Escape 兀
Wrong windows-1252 Mojibake 兀
Encoding: EUC-KR (hex bytes) E8 B4
Encoding: JIS0208 (hex bytes) D1 BA
Pīnyīn
IRG_GSource G0-5823
IRG_HSource HB1-
IRG_JSource J0-513A
IRG_KPSource KP0-F9F
IRG_KSource K0-6834
IRG_TSource T1-4442
IRG_VSource V1-4C3
BigFive A461
CCCII 21326C
CNS1986 1-4442
CNS1992 1-4442
EACC 21326C
GB0 5603
GB1 5603
Jis0 4926
KPS0 F9F2
KSC0 7220
MainlandTelegraph 0335
TaiwanTelegraph 0335
Xerox 263:362

Related Characters

  • 兀

Confusables

  • ⺎
  • 兀

Elsewhere

Complete Record

Property Value
Age (age) 1.1
Unicode Name (na) CJK UNIFIED IDEOGRAPH-5140
Unicode 1 Name (na1)
Block (blk) CJK
General Category (gc) Other Letter
Script (sc) Han
Bidirectional Category (bc) Left To Right
Combining Class (ccc) Not Reordered
Decomposition Type (dt) None
Decomposition Mapping (dm) 兀
Lowercase (Lower)
Simple Lowercase Mapping (slc) 兀
Lowercase Mapping (lc) 兀
Uppercase (Upper)
Simple Uppercase Mapping (suc) 兀
Uppercase Mapping (uc) 兀
Simple Titlecase Mapping (stc) 兀
Titlecase Mapping (tc) 兀
Case Folding (cf) 兀
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Bidi Paired Bracket (bpb) 兀
Bidi Paired Bracket Type (bpt) None
Cased (Cased)
Composition Exclusion (CE)
Case Ignorable (CI)
Full Composition Exclusion (Comp_Ex)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Dash (Dash)
Deprecated (Dep)
Default Ignorable Code Point (DI)
Diacritic (Dia)
East Asian Width (ea) Wide
Extender (Ext)
FC NFKC Closure (FC_NFKC) 兀
Grapheme Cluster Break (GCB) Any
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Hex Digit (Hex)
Hangul Syllable Type (hst) Not Applicable
Hyphen (Hyphen)
ID Continue (IDC)
Ideographic (Ideo)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
InMC (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Join Control (Join_C)
Jamo Short Name (JSN)
Joining Type (jt) Non Joining
kIICore (kIICore) AGT
kIRG_GSource (kIRG_GSource) G0-5823
kIRG_HSource (kIRG_HSource) HB1-
kIRG_JSource (kIRG_JSource) J0-513A
kIRG_KPSource (kIRG_KPSource) KP0-F9F
kIRG_KSource (kIRG_KSource) K0-6834
kIRG_TSource (kIRG_TSource) T1-4442
kIRG_VSource (kIRG_VSource) V1-4C3
kIRGDaeJaweon (kIRGDaeJaweon) 0257.220
kIRGDaiKanwaZiten (kIRGDaiKanwaZiten) 01337
kIRGHanyuDaZidian (kIRGHanyuDaZidian) 10264.050
kIRGKangXi (kIRGKangXi) 0123.020
kBigFive (kBigFive) A461
kCangjie (kCangjie) MU
kCantonese (kCantonese) ngat6
kCCCII (kCCCII) 21326C
kCihaiT (kCihaiT) 131.204
kCNS1986 (kCNS1986) 1-4442
kCNS1992 (kCNS1992) 1-4442
kCowles (kCowles) 3036
kDaeJaweon (kDaeJaweon) 0257.220
Unihan Definition (kDefinition) to cut off the feet
kEACC (kEACC) 21326C
kFenn (kFenn) 387K
kFennIndex (kFennIndex) 610.10
kFourCornerCode (kFourCornerCode) 1021.0
kFrequency (kFrequency) 5
kGB0 (kGB0) 5603
kGB1 (kGB1) 5603
kGSR (kGSR) 0487a
kHangul (kHangul)
kHanYu (kHanYu) 10264.050
kHanyuPinyin (kHanyuPinyin) 10264.050:wù
kHKGlyph (kHKGlyph) 0260
kJapaneseKun (kJapaneseKun) TAKAI HAGERU ASHIKIRU
kJapaneseOn (kJapaneseOn) KOTSU GOTSU
kJis0 (kJis0) 4926
kKangXi (kKangXi) 0123.020
kKorean (kKorean) OL
kKPS0 (kKPS0) F9F2
kKSC0 (kKSC0) 7220
kLau (kLau) 2342
kMainlandTelegraph (kMainlandTelegraph) 0335
kMandarin (kMandarin)
kMatthews (kMatthews) 7205
kMeyerWempe (kMeyerWempe) 2046
kMorohashi (kMorohashi) 01337
kNelson (kNelson) 0004
kPhonetic (kPhonetic) 963 1455
Radical Stroke Count (Adobe Japan 1-6) (kRSAdobe_Japan1_6) C+4209+1.1.2 C+4209+10.2.1 C+4209+43.3.0
Radical Stroke Count (KangXi) (kRSKangXi) 10.1
Radical Stroke Count (Unicode) (kRSUnicode) 10.1
kSBGY (kSBGY) 481.34
kTaiwanTelegraph (kTaiwanTelegraph) 0335
kTang (kTang) *nguət nguət
Stroke Number (kTotalStrokes) 3
kVietnamese (kVietnamese) ngột
kXerox (kXerox) 263:362
kXHC1983 (kXHC1983) 1211.020:wū 1223.140:wù
z Variant (kZVariant) 兀
Line Break (lb) Ideographic
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) Yes
NFD Quick Check (NFD_QC) Yes
NFKC Casefold (NFKC_CF) 兀
NFKC Quick Check (NFKC_QC) Yes
NFKD Quick Check (NFKD_QC) Yes
Numeric Type (nt) None
Numeric Value (nv) NaN
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Radical (Radical)
Sentence Break (SB) OLetter
Simple Case Folding (scf) 兀
Script Extension (scx) Han
Soft Dotted (SD)
STerm (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Other
White Space (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)