these characters correctly in both Arial Unicode MS and in other (correctly designed) Unicode fonts. This bug affects the rendering of text written in...
12 KB (1,322 words) - 13:55, 4 July 2025
Byte order mark (redirect from Unicode Byte-Order Mark)
The byte-order mark (BOM) is a particular usage of the special Unicode character code, U+FEFF ZERO WIDTH NO-BREAK SPACE, whose appearance as a magic number...
15 KB (1,910 words) - 21:50, 27 June 2025
uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode (also known as The Unicode Standard...
112 KB (11,593 words) - 22:02, 29 July 2025
Private Use Areas (redirect from Unicode Private Use Area)
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the standard. Three Private Use...
29 KB (3,120 words) - 13:30, 19 July 2025
CJK Unified Ideographs is a Unicode block containing the most common CJK ideographs used in modern Chinese, Japanese, Korean and Vietnamese characters...
58 KB (160 words) - 07:15, 21 December 2024
Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same...
16 KB (1,913 words) - 08:57, 16 April 2025
Bush hid the facts (redirect from George bush notepad bug)
the bug.[citation needed] UTF-8 without the byte order mark would still trigger the bug, as it is identical to the "ANSI" file. Saving as "Unicode", which...
6 KB (642 words) - 20:42, 26 June 2025
UTF-8 (redirect from Unicode (UTF-8))
used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode Transformation Format – 8-bit. As of July 2025, almost...
50 KB (5,055 words) - 00:36, 6 August 2025
multilingual text represented with the Unicode universal character set. Key to the relationship between Unicode and HTML is the relationship between the...
22 KB (2,590 words) - 21:13, 10 October 2024
29 November 2014. Mozilla.org: Bug 343129 – Big5-HKSCS 2004 <==> Unicode Table Update Bug 162431 – add non-BMP Unicode (plane 1 and above. surrogate)...
23 KB (2,512 words) - 15:27, 18 May 2025
Zalgo text (category Software bugs)
digital text that has been modified with numerous combining characters, Unicode symbols used to add diacritics above or below letters, to appear frightening...
11 KB (963 words) - 02:30, 14 July 2025
The Japanese calendar era bug is a possible computer bug related to the change of the Japanese era name. The Japanese calendar has era names that change...
5 KB (493 words) - 23:41, 23 July 2024
International Components for Unicode (ICU) is an open-source project of mature C/C++ and Java libraries for Unicode support, software internationalization...
18 KB (1,363 words) - 14:44, 21 April 2024
This article compares Unicode encodings in two types of environments: 8-bit clean environments, and environments that forbid the use of byte values with...
18 KB (2,272 words) - 19:49, 6 April 2025
own. In May 2015, iPhone users discovered a bug where sending a certain sequence of characters and Unicode symbols as a text to another iPhone user would...
41 KB (4,448 words) - 12:14, 31 March 2025
Telugu script (section iOS character crash bug)
etc. Telugu script was added to the Unicode Standard in October, 1991 with the release of version 1.0. The Unicode block for Telugu is U+0C00–U+0C7F: In...
49 KB (1,488 words) - 18:19, 24 July 2025
platforms, produced for JavaSoft by Symantec Internationalization and Unicode support originating from Taligent The release on December 8, 1998 and subsequent...
205 KB (11,285 words) - 20:43, 21 July 2025
Bengali Unicode block contains characters for the Bengali, Assamese, Bishnupriya Manipuri, Daphla, Garo, Hallam, Khasi, Mizo, Munda, Naga, Riang, and...
17 KB (119 words) - 02:05, 26 July 2024
ASCII art (redirect from Unicode art)
if a significant subset of Unicode is desired. (Modern UNIX-style operating systems do provide complete fixed-width Unicode fonts, e.g. for xterm. Windows...
52 KB (5,394 words) - 14:20, 31 July 2025
software Unicode character map program, being one of the GNOME Core Applications. This program allows characters to be displayed by Unicode block or script...
3 KB (317 words) - 04:05, 31 July 2025
UTF-16 (category Unicode Transformation Formats)
UTF-16 (16-bit Unicode Transformation Format) is a character encoding that supports all 1,112,064 valid code points of Unicode. The encoding is variable-length...
36 KB (4,121 words) - 22:15, 25 June 2025
is a Unicode block containing characters for writing the Khmer (Cambodian) language. For details of the characters, see Khmer alphabet – Unicode. The...
14 KB (63 words) - 23:41, 28 June 2025
GB 18030 (category Unicode Transformation Formats)
GB 18030-2022" (PDF). www.unicode.org. Retrieved 2024-02-12. "[JDK-8301119] Support for GB18030-2022 - Java Bug System". bugs.openjdk.org. Retrieved 2023-08-14...
45 KB (3,241 words) - 05:24, 1 August 2025
Issues relating to iOS (section "30% battery bug")
crash bugs encountered when sending photos or certain Unicode characters via text messages sent through the Messages application, and general bugs and security...
74 KB (7,299 words) - 18:07, 28 July 2025
Liberation fonts (category Free software Unicode typefaces)
GNU FreeFont, derived from Nimbus, but with a better Unicode support. Other Open-source Unicode typefaces Liberation Mono is styled closer to Liberation...
13 KB (1,181 words) - 07:03, 17 April 2025
C0 and C1 control codes (section Unicode)
cp037_IBMUSCanada to Unicode table. Microsoft/Unicode Consortium. "23.1: Control Codes" (PDF). The Unicode Standard (15.0.0 ed.). Unicode Consortium. 2022...
41 KB (3,046 words) - 03:11, 18 July 2025
Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake...
30 KB (348 words) - 23:47, 28 June 2025
SpringBoard (redirect from Effective power bug)
with tweaks like FrontPage. A bug was discovered in May 2015 where users pasted a certain set of characters and Unicode in a set order, causing the SpringBoard...
23 KB (3,095 words) - 05:58, 6 August 2025
unpredictable. Unicode property support may be incomplete (products are continuously updated!). All will be incomplete when a new Unicode revision is released...
34 KB (641 words) - 07:52, 29 April 2025
quantifiers are ungreedy (lazy) by default, while ? makes them greedy. Unicode defines several properties for each character. Patterns in PCRE2 can match...
26 KB (2,516 words) - 14:15, 6 July 2025