Home U+4E00 to U+9FFF CJK Unified Ideographs
Glyph for U+9A37
Source: Noto CJK

U+9A37 CJK UNIFIED IDEO­GRAPH-​9A37

U+9A37 was added to Unicode in version 1.1 (1993). It belongs to the block U+4E00 to U+9FFF CJK Unified Ideographs in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as harass, bother, annoy, disturb, agitate; sad, grieved. Its Pīnyīn pronunciation is sāo.

The glyph is not a composition. It has a Wide East Asian Width. In bidirectional context it acts as Left To Right and is not mirrored. In text U+9A37 behaves as Ideographic regarding line breaks. It has type Other Letter for sentence and Other for word breaks. The Grapheme Cluster Break is Any.

Representations

System Representation
39479
UTF-8 E9 A8 B7
UTF-16 9A 37
UTF-32 00 00 9A 37
URL-Quoted %E9%A8%B7
HTML-Escape 騷
Wrong windows-1252 Mojibake 騷
Encoding: EUC-KR (hex bytes) E1 D3
Encoding: JIS0208 (hex bytes) F1 DB
Pīnyīn sāo

Elsewhere

Complete Record

Property Value
Age 1.1 (1993)
Unicode Name CJK UNIFIED IDEOGRAPH-9A37
Unicode 1 Name
Block CJK Unified Ideographs
General Category Other Letter
Script Han
Bidirectional Category Left To Right
Combining Class Not Reordered
Decomposition Type None
Decomposition Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Lowercase
Simple Lowercase Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Lowercase Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Uppercase
Simple Uppercase Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Uppercase Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Simple Titlecase Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Titlecase Mapping Glyph for U+9A37 CJK Unified Ideograph-9A37
Case Folding Glyph for U+9A37 CJK Unified Ideograph-9A37
ASCII Hex Digit
Alphabetic
Bidi Control
Bidi Mirrored
Bidi Paired Bracket Glyph for U+9A37 CJK Unified Ideograph-9A37
Bidi Paired Bracket Type None
Cased
Composition Exclusion
Case Ignorable
Full Composition Exclusion
Changes When Casefolded
Changes When Casemapped
Changes When NFKC Casefolded
Changes When Lowercased
Changes When Titlecased
Changes When Uppercased
Dash
Deprecated
Default Ignorable Code Point
Diacritic
East Asian Width Wide
Emoji Modifier Base
Emoji Component
Emoji Modifier
Emoji
Emoji Presentation
Extender
Extended Pictographic
FC NFKC Closure Glyph for U+9A37 CJK Unified Ideograph-9A37
Grapheme Cluster Break Any
Grapheme Base
Grapheme Extend
Grapheme Link
Hex Digit
Hangul Syllable Type Not Applicable
Hyphen
ID Continue
Ideographic
ID Start
IDS Binary Operator
IDS Trinary Operator and
Indic Mantra Category
Indic Positional Category NA
Indic Syllabic Category Other
ISO 10646 Comment
Joining Group No_Joining_Group
Join Control
Jamo Short Name
Joining Type Non Joining
Line Break Ideographic
Logical Order Exception
Math
Noncharacter Code Point
NFC Quick Check Yes
NFD Quick Check Yes
NFKC Casefold Glyph for U+9A37 CJK Unified Ideograph-9A37
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value not a number
Other Alphabetic
Other Default Ignorable Code Point
Other Grapheme Extend
Other ID Continue
Other ID Start
Other Lowercase
Other Math
Other Uppercase
Pattern Syntax
Pattern White Space
Prepended Concatenation Mark
Quotation Mark
Radical
Regional Indicator
Sentence Break Other Letter
Simple Case Folding Glyph for U+9A37 CJK Unified Ideograph-9A37
Script Extension
Soft Dotted
Sentence Terminal
Terminal Punctuation
Unified Ideograph
Vertical Orientation U
Variation Selector
Word Break Other
White Space
XID Continue
XID Start
Expands On NFC
Expands On NFD
Expands On NFKC
Expands On NFKD
Big Five Mapping C4CC
Cangjie Input Code SFEII
kCantonese sou1
CCCII Mapping 216145
kCihaiT 1509.301
CNS 11643-1986 Mapping 1-7A57
CNS 11643-1992 Mapping 1-7A57
kCowles 3682
kDaeJaweon 1967.230
Unihan Definition harass, bother, annoy, disturb, agitate; sad, grieved
kEACC 216145
kFenn 787I
kFennIndex 430.04
kFourCornerCode 7733.6
kFrequency 4
kGB1 4107
kGradeLevel 5
kGSR 1112g
kHangul 소:0E
kHanYu 74567.100
kHanyuPinyin 74567.100:sāo,sǎo,xiāo
kHKGlyph 4599
kIICore ATHKMP
kIRG_GSource G1-4927
kIRG_HSource HB1-C4CC
kIRG_JSource J0-715B
kIRG_KPSource KP0-E3C0
kIRG_KSource K0-6153
kIRG_TSource T1-7A57
kIRG_VSource V1-6C49
kIRGDaeJaweon 1967.230
kIRGDaiKanwaZiten 44935
kIRGHanyuDaZidian 74567.100
kIRGKangXi 1443.190
kJapaneseKun SAWAGU
kJapaneseOn SOU
kJinmeiyoKanji 2010:U+9A12
kJis0 8159
kKangXi 1443.190
kKorean SO
kKoreanEducationHanja 2007
kKPS0 E3C0
kKSC0 6551
kLau 2865
kMandarin sāo
kMatthews 5433
kMeyerWempe 2836a
kMorohashi 44935
kNelson 5224
kPhonetic 224 1223B
Radical Stroke Count (Adobe Japan 1-6) C+7250+187.10.10
Radical Stroke Count (KangXi) 187.10
Radical Stroke Count (Unicode) 187.10
kSBGY 157.02
Simplified Variant Glyph for U+9A9A U+9A9A
Taiwanese Telegraph Code 7510
kTang sɑu
Stroke Number 19
UnihanCore2020 Set HJKMPT
Quốc ngữ Pronunciation tao
Xerox Code 261:046