Showing changes from revision #3 to #4:
Added | Removed | Changed
If a font encoding is UTF-8, then rather than taking straight 8 and 16 bit strings, strings are decoded according to UTF-8 or UTF-16.
For example, the byte sequence C2 A3 in UTF-8 represents character &A3. The halfword sequence D800 DC00 in UTF-16 represents character &10000.
32-bit strings are treated normally (hence the strings are treated as UCS-4).