Home U+4E00 to U+9FFF CJK Unified Ideographs
Glyph for U+5C4E
Source: Noto CJK

U+5C4E CJK UNIFIED IDEO­GRAPH-​5C4E

U+5C4E was added to Unicode in version 1.1 (1993). It belongs to the block U+4E00 to U+9FFF CJK Unified Ideographs in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as excrement, shit, dung. Its Pīnyīn pronunciation is shǐ.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+5C4E behaves as Ideographic regarding line breaks. It has type Other Letter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

Representations

System Representation
23630
UTF-8 E5 B1 8E
UTF-16 5C 4E
UTF-32 00 00 5C 4E
URL-Quoted %E5%B1%8E
HTML-Escape 屎
Wrong windows-1252 Mojibake 屎
Encoding: EUC-KR (hex bytes) E3 BA
Encoding: JIS0208 (hex bytes) D5 FD
Pīnyīn shǐ

Elsewhere

Complete Record

Property Value
Age 1.1 (1993)
Unicode Name CJK UNIFIED IDEOGRAPH-5C4E
Unicode 1 Name
Block CJK Unified Ideographs
General Category Other Letter
Script Han
Bidirectional Category Left To Right
Combining Class Not Reordered
Decomposition Type None
Decomposition Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Lowercase
Simple Lowercase Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Lowercase Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Uppercase
Simple Uppercase Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Uppercase Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Simple Titlecase Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Titlecase Mapping Glyph for U+5C4E CJK Unified Ideograph-5C4E
Case Folding Glyph for U+5C4E CJK Unified Ideograph-5C4E
ASCII Hex Digit
Alphabetic
Bidi Control
Bidi Mirrored
Bidi Paired Bracket Glyph for U+5C4E CJK Unified Ideograph-5C4E
Bidi Paired Bracket Type None
Cased
Composition Exclusion
Case Ignorable
Full Composition Exclusion
Changes When Casefolded
Changes When Casemapped
Changes When NFKC Casefolded
Changes When Lowercased
Changes When Titlecased
Changes When Uppercased
Dash
Deprecated
Default Ignorable Code Point
Diacritic
East Asian Width Wide
Emoji Modifier Base
Emoji Component
Emoji Modifier
Emoji
Emoji Presentation
Extender
Extended Pictographic
FC NFKC Closure Glyph for U+5C4E CJK Unified Ideograph-5C4E
Grapheme Cluster Break Any
Grapheme Base
Grapheme Extend
Grapheme Link
Hex Digit
Hangul Syllable Type Not Applicable
Hyphen
ID Continue
Ideographic
ID Start
IDS Binary Operator
IDS Trinary Operator and
Indic Mantra Category
Indic Positional Category NA
Indic Syllabic Category Other
ISO 10646 Comment
Joining Group No_Joining_Group
Join Control
Jamo Short Name
Joining Type Non Joining
Line Break Ideographic
Logical Order Exception
Math
Noncharacter Code Point
NFC Quick Check Yes
NFD Quick Check Yes
NFKC Casefold Glyph for U+5C4E CJK Unified Ideograph-5C4E
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value not a number
Other Alphabetic
Other Default Ignorable Code Point
Other Grapheme Extend
Other ID Continue
Other ID Start
Other Lowercase
Other Math
Other Uppercase
Pattern Syntax
Pattern White Space
Prepended Concatenation Mark
Quotation Mark
Radical
Regional Indicator
Sentence Break Other Letter
Simple Case Folding Glyph for U+5C4E CJK Unified Ideograph-5C4E
Script Extension
Soft Dotted
Sentence Terminal
Terminal Punctuation
Unified Ideograph
Vertical Orientation U
Variation Selector
Word Break Other
White Space
XID Continue
XID Start
Expands On NFC
Expands On NFD
Expands On NFKC
Expands On NFKD
Big Five Mapping ABCB
Cangjie Input Code SFD
kCantonese si2
CCCII Mapping 213B57
kCihaiT 449.302
CNS 11643-1986 Mapping 1-506D
CNS 11643-1992 Mapping 1-506D
kCowles 3902
kDaeJaweon 0598.150
Unihan Definition excrement, shit, dung
kEACC 213B57
kFenn 724H
kFennIndex 453.07
kFourCornerCode 7729.4
kFrequency 5
kGB0 4226
kGB1 4226
kGSR 0561d
kHangul 시:0N 히:N
kHanYu 20973.060
kHanyuPinlu shǐ(14)
kHanyuPinyin 20973.060:shǐ,xī
kHKGlyph 1073
kIICore AGTJHKMP
kIRG_GSource G0-4A3A
kIRG_HSource HB1-ABCB
kIRG_JSource J0-557D
kIRG_KPSource KP0-E5A1
kIRG_KSource K0-633A
kIRG_TSource T1-506D
kIRGDaeJaweon 0598.150
kIRGDaiKanwaZiten 07689
kIRGHanyuDaZidian 20973.060
kIRGKangXi 0301.250
kJapaneseKun KUSO
kJapaneseOn SHI KI
kJis0 5393
kKangXi 0301.250
kKorean SI
kKoreanName 2015
kKPS0 E5A1
kKSC0 6726
kLau 2760
kMainlandTelegraph 1452
kMandarin shǐ
kMatthews 5757
kMeyerWempe 2684
kMorohashi 07689
kNelson 1390
kPhonetic 1176
Radical Stroke Count (Adobe Japan 1-6) C+4652+44.3.6 C+4652+119.6.3
Radical Stroke Count (KangXi) 44.6
Radical Stroke Count (Unicode) 44.6
kSBGY 058.41 248.35
Taiwanese Telegraph Code 1452
kTGH 2013:1725
kTGHZ2013 336.060:shǐ
Stroke Number 9
UnihanCore2020 Set GHJKMPT
Xerox Code 265:122
kXHC1983 1045.040:shǐ