X-Nico

2 unusual facts about character encoding


Character encoding

Early binary repertoires include Bacon's cipher, Braille, International maritime signal flags, and the 4-digit encoding of Chinese characters for a Chinese telegraph code (Hans Schjellerup, 1869).

Combinatorics on words

It is possible to encode a word, since a word is constructed by symbols, and encode the data by using a tree.


Code page

IBM introduced the concept of systematically assigning a small, but globally unique, 16 bit number to each character encoding that a computer system or collection of computer systems might encounter.

HP roman8

The HP roman 8 charset is a single byte character encoding that is mainly used on HP-UX and some Hewlett-Packard printers.


see also

Adobe InDesign

InDesign supports Unicode character encoding and there is a special Middle East version supporting complex text layout for Arabic and Hebrew types of complex script.

Code page

When, early in the history of personal computers, users didn't find their character encoding requirements met, private or local code pages were created using Terminate and Stay Resident utilities or by re-programming BIOS EPROMs.

Mensural notation

For quoting mensural notation symbols in inline text, a number of characters have been included in the character encoding standard Unicode, in the "musical symbols" block.

UTF-16

IBM iSeries systems designate code page CCSID 13488 for UCS-2 character encoding, CCSID 1200 for UTF-16 encoding, and CCSID 1208 for UTF-8 encoding.