Inicio: go to the homepage Todos los planos
Glifo para U+4FFFE
Fuente: Noto Sans

U+4FFFE Noncharacter *

U+4FFFE was added in Unicode version 2.0 in 1996. It doesn’t belong to a defined block but is located in the U+40000 para U+4FFFF Plane 5 (unassigned).

This character is a Sin asignar and has no script assigned.

The glyph is not a composition. It has no designated width in East Asian texts. In bidirectional text it acts as Boundary Neutral. When changing direction it is not mirrored. U+4FFFE does not participate in line break calculations.

This is a so-called “noncharacter”, one of 66 in Unicode. These codepoints are reserved solely for internal use. For further information, see Unicode’s FAQ on noncharacters.

El Wikipedia tiene la siguiente información acerca de este punto de código:

The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set (abbr. UCS, official designation: ISO/IEC 10646), is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.

UCS has a potential capacity of over 1 million characters. Each UCS character is abstractly represented by a code point, an integer between 0 and 1,114,111 (1,114,112 = 220 + 216 or 17 × 216 = 0x110000 code points), used to represent each character within the internal logic of text processing software. As of Unicode 16.0, released in September 2024, 299,056 (27%) of these code points are allocated, 155,063 (14%) have been assigned characters, 137,468 (12%) are reserved for private use, 2,048 are used to enable the mechanism of surrogates, and 66 are designated as noncharacters, leaving the remaining 815,056 (73%) unallocated. The number of encoded characters is made up as follows:

  • 149,641 graphical characters (some of which do not have a visible glyph, but are still counted as graphical)
  • 237 special purpose characters for control and formatting.

ISO maintains the basic mapping of characters from character name to code point. Often, the terms character and code point will be used interchangeably. However, when a distinction is made, a code point refers to the integer of the character: what one might think of as its address. Meanwhile, a character in ISO/IEC 10646 includes the combination of the code point and its name, Unicode adds many other useful properties to the character set, such as block, category, script, and directionality.

In addition to the UCS, the supplementary Unicode Standard, (not a joint project with ISO, but rather a publication of the Unicode Consortium,) provides other implementation details such as:

  1. mappings between UCS and other character sets
  2. different collations of characters and character strings for different languages
  3. an algorithm for laying out bidirectional text ("the BiDi algorithm"), where text on the same line may shift between left-to-right ("LTR") and right-to-left ("RTL")
  4. a case-folding algorithm

Computer software end users enter these characters into programs through various input methods, for example, physical keyboards or virtual character palettes.

The UCS can be divided in various ways, such as by plane, block, character category, or character property.

Representaciones

Sistema Representación
N.º 327678
UTF-8 F1 8F BF BE
UTF-16 D8 FF DF FE
UTF-32 00 04 FF FE
URL-Quoted %F1%8F%BF%BE
HTML hex reference 
Mojibake mal de windows-1252 �
Codificación: GB18030 (hexadecimales bytes) A4 38 87 32

Otros sitios

Registro completo

Propiedad Valor
Antigüedad (age) 2.0 (1996)
Nombre Unicode (na)
Nombre Unicode 1 (na1)
Categoría general (gc) Sin asignar
Script (sc) Desconocido
Categoría de bidireccionalidad (bc) Boundary Neutral
Combining Class (ccc) Not Reordered
Tipo de descomposición (dt) none
Decomposition Mapping (dm) Glifo para U+4FFFE Noncharacter*
Minúscula (Lower)
Simple Lowercase Mapping (slc) Glifo para U+4FFFE Noncharacter*
Lowercase Mapping (lc) Glifo para U+4FFFE Noncharacter*
Mayúscula (Upper)
Simple Uppercase Mapping (suc) Glifo para U+4FFFE Noncharacter*
Uppercase Mapping (uc) Glifo para U+4FFFE Noncharacter*
Simple Titlecase Mapping (stc) Glifo para U+4FFFE Noncharacter*
Titlecase Mapping (tc) Glifo para U+4FFFE Noncharacter*
Case Folding (cf) Glifo para U+4FFFE Noncharacter*
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi Control (Bidi_C)
Bidi Mirrored (Bidi_M)
Exclusión de descomposición (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Exclusión de composición completa (Comp_Ex)
Default Ignorable Code Point (DI)
Raya (Dash)
Deprecated (Dep)
Diacrítico (Dia)
Base de modificador de emoyi (EBase)
Componente de emoyi (EComp)
Modificador de emoyi (EMod)
Presentación de emoyi (EPres)
Emoyi (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC)
Grapheme Cluster Break (GCB) Any
Base de grafema (Gr_Base)
Extensión de grafema (Gr_Ext)
Enlace de grafema (Gr_Link)
Hex Digit (Hex)
Guion (Hyphen)
ID Continue (IDC)
ID Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
Ideographic (Ideo)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Join Control (Join_C)
Logical Order Exception (LOE)
Math (Math)
Noncharacter Code Point (NChar)
NFC Quick Check (NFC_QC) 1
NFD Quick Check (NFD_QC) 1
NFKC Casefold (NFKC_CF) Glifo para U+4FFFE Noncharacter*
NFKC Quick Check (NFKC_QC) 1
NFKD Quick Check (NFKD_QC) 1
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Otra extensión de grafema (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Comilla (QMark)
Indicador regional (RI)
Radical (Radical)
Salto de oración (SB) Other
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Ideograma unificado (UIdeo)
Selector de variación (VS)
Salto de palabra (WB) Other
Espacio en blanco (WSpace)
XID Continue (XIDC)
XID Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Mirrored Glyph (bmg)
Bidi Paired Bracket Type (bpt) None
East Asian Width (ea) neutral
Hangul Syllable Type (hst) Not Applicable
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Non Joining
Line Break (lb) Desconocido
Numeric Type (nt) none
Valor numérico (nv) not a number
Simple Case Folding (scf) Glifo para U+4FFFE Noncharacter*
Script Extension (scx) Desconocido
Orientación vertical (vo) R