• In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older...
    24 KB (3,290 words) - 19:13, 28 July 2025
  • often included similar or identical characters. Unicode provides two such notions, canonical equivalence and compatibility. Code point sequences that are defined...
    16 KB (1,913 words) - 08:57, 16 April 2025
  • Thumbnail for CJK Compatibility
    CJK Compatibility is a Unicode block containing square symbols (both CJK and Latin alphanumeric) encoded for compatibility with East Asian character sets...
    8 KB (278 words) - 21:26, 3 March 2025
  • Thumbnail for Universal Character Set characters
    twelve character code points in total. UCS includes thousands of characters that Unicode designates as compatibility characters. These are characters that...
    52 KB (6,421 words) - 20:12, 25 July 2025
  • block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily...
    18 KB (132 words) - 03:06, 18 July 2025
  • Latin characters in Unicode Dead key Compose key Combining character Unicode equivalence Complex text layout Unicode compatibility characters Alphabetic...
    6 KB (669 words) - 01:56, 27 March 2025
  • characters described below, but only Arabic numerals are available.) Unicode also includes a handful of vulgar fractions as compatibility characters,...
    14 KB (1,618 words) - 18:49, 21 July 2025
  • Thumbnail for IJ (digraph)
    (lowercase ij; Dutch pronunciation: [ɛi] ; also encountered as Unicode compatibility characters IJ and ij) is a digraph of the letters i and j. Occurring in...
    33 KB (3,372 words) - 15:13, 19 June 2025
  • contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point...
    30 KB (2,383 words) - 17:14, 18 July 2025
  • Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for...
    13 KB (2,215 words) - 23:18, 28 December 2024
  • (U+4DC0–U+4DFF) Special characters Unicode block Universal Character Set characters "Section 22: Symbols". The Unicode Standard. The Unicode Consortium. September...
    8 KB (840 words) - 05:41, 25 July 2025
  • Thumbnail for List of Unicode characters
    and some additional related characters. HTML and XML provide ways to reference Unicode characters when the characters themselves either cannot or should...
    158 KB (1,931 words) - 21:34, 27 July 2025
  • Fullwidth Forms is a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation...
    11 KB (453 words) - 00:58, 7 April 2025
  • The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)...
    26 KB (3,834 words) - 22:28, 11 June 2025
  • encoded for usage in Afrikaans and for compatibility for ISO/IEC 6937, has been deprecated by Unicode (since Unicode 5.2), and the preferred representation...
    322 KB (3,512 words) - 19:13, 2 August 2025
  • This is a list of precomposed Latin characters in Unicode. Unicode typefaces may be needed for these to display correctly. DZ, Dz, dz DŽ, Dž, dž ff ffi ffl fi fl IJ...
    13 KB (156 words) - 22:21, 30 June 2025
  • CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character...
    23 KB (721 words) - 13:39, 23 February 2025
  • Thumbnail for Unicode
    uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode (also known as The Unicode Standard...
    112 KB (11,593 words) - 22:02, 29 July 2025
  • Thumbnail for Mathematical operators and symbols in Unicode
    almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their...
    15 KB (889 words) - 06:49, 10 June 2025
  • settings) can also affect whitespace. Many of the Unicode space characters were created for compatibility with classic print typography. Even if digital...
    27 KB (2,579 words) - 10:03, 15 July 2025
  • Thumbnail for Character encoding
    representing more characters were created, such as ASCII, ISO/IEC 8859, and Unicode encodings such as UTF-8 and UTF-16. The most popular character encoding on...
    31 KB (3,793 words) - 16:38, 7 July 2025
  • characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode...
    55 KB (3,028 words) - 10:03, 31 July 2025
  • block) Kana Supplement (Unicode block) Small Kana Extension (Unicode block) Hiragana (Unicode block) CJK Compatibility (Unicode block) Enclosed CJK Letters...
    4 KB (99 words) - 19:13, 9 October 2024
  • symbols are supported by the Unicode character encoding standard. As of version 16.0 of the Unicode Standard, 518 characters in the following blocks are...
    27 KB (296 words) - 06:41, 9 June 2025
  • article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode has subscripted and superscripted...
    42 KB (2,895 words) - 00:18, 30 July 2025
  • Thumbnail for Hangul Compatibility Jamo
    Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001 (formerly...
    3 KB (129 words) - 23:32, 28 June 2025
  • contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Ghost characters (Japanese: 幽霊文字,...
    29 KB (2,734 words) - 22:38, 18 July 2025
  • that it has been agreed that no further Arabic compatibility characters will be encoded. Each Unicode point also has a property called "General Category"...
    8 KB (826 words) - 08:04, 6 June 2025
  • backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase...
    7 KB (160 words) - 02:17, 26 July 2024
  • Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...
    107 KB (488 words) - 02:43, 25 May 2025