In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older...
24 KB (3,288 words) - 12:19, 24 November 2024
often included similar or identical characters. Unicode provides two such notions, canonical equivalence and compatibility. Code point sequences that are defined...
16 KB (1,913 words) - 08:57, 16 April 2025
CJK Compatibility is a Unicode block containing square symbols (both CJK and Latin alphanumeric) encoded for compatibility with East Asian character sets...
8 KB (278 words) - 21:26, 3 March 2025
twelve character code points in total. UCS includes thousands of characters that Unicode designates as compatibility characters. These are characters that...
57 KB (7,025 words) - 22:05, 3 June 2025
Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for...
13 KB (2,215 words) - 23:18, 28 December 2024
Number Forms (redirect from Number Forms (Unicode block))
block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily...
17 KB (132 words) - 15:59, 14 September 2024
Latin characters in Unicode Dead key Compose key Combining character Unicode equivalence Complex text layout Unicode compatibility characters Alphabetic...
6 KB (669 words) - 01:56, 27 March 2025
(lowercase ij; Dutch pronunciation: [ɛi] ; also encountered as Unicode compatibility characters IJ and ij) is a digraph of the letters i and j. Occurring in...
33 KB (3,372 words) - 18:16, 21 May 2025
characters described below, but only Arabic numerals are available.) Unicode also includes a handful of vulgar fractions as compatibility characters,...
14 KB (1,620 words) - 05:03, 2 November 2024
contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point...
30 KB (2,383 words) - 17:46, 6 June 2025
The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)...
26 KB (3,834 words) - 22:28, 11 June 2025
uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode or The Unicode Standard or...
111 KB (11,534 words) - 15:04, 12 June 2025
characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode...
54 KB (3,017 words) - 11:39, 12 June 2025
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to...
32 KB (3,919 words) - 10:47, 12 June 2025
Unicode emoticons or emojis. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the intended characters...
28 KB (648 words) - 13:08, 17 May 2025
(U+4DC0–U+4DFF) Special characters Unicode block Universal Character Set characters "Section 22: Symbols". The Unicode Standard. The Unicode Consortium. September...
8 KB (840 words) - 01:02, 23 May 2025
that it has been agreed that no further Arabic compatibility characters will be encoded. Each Unicode point also has a property called "General Category"...
8 KB (826 words) - 08:04, 6 June 2025
and some additional related characters. HTML and XML provide ways to reference Unicode characters when the characters themselves either cannot or should...
158 KB (1,929 words) - 12:54, 20 May 2025
Fullwidth Forms is a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation...
11 KB (453 words) - 00:58, 7 April 2025
settings) can also affect whitespace. Many of the Unicode space characters were created for compatibility with classic print typography. Even if digital...
26 KB (2,570 words) - 15:34, 18 May 2025
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...
107 KB (488 words) - 02:43, 25 May 2025
featured in Unicode. Some characters in the Letterlike Symbols block can be substituted with characters in the ASCII range. Latin script Unicode collation...
13 KB (156 words) - 21:42, 10 June 2025
CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character...
23 KB (721 words) - 13:39, 23 February 2025
encoded for usage in Afrikaans and for compatibility for ISO/IEC 6937, has been deprecated by Unicode (since Unicode 5.2), and the preferred representation...
322 KB (3,512 words) - 15:34, 15 June 2025
information; they can be converted back. To achieve this, Unicode compatibility characters have been introduced. An application can claim to round-trip...
6 KB (741 words) - 05:11, 14 April 2025
backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase...
7 KB (160 words) - 02:17, 26 July 2024
article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode has subscripted and superscripted...
41 KB (2,858 words) - 18:29, 10 June 2025
Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001 (formerly...
3 KB (129 words) - 22:09, 4 September 2024
CJK Compatibility Ideographs Supplement is a Unicode block containing Han characters used only for roundtrip compatibility mapping with planes 3, 4, 5...
10 KB (80 words) - 15:54, 27 November 2024
Han unification (redirect from Giga Character Set)
an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages...
60 KB (6,329 words) - 09:52, 18 May 2025