UniFM/res/enc
2018-11-14 19:08:56 +02:00
..
cp437.ini Initial 2018-11-14 19:08:56 +02:00
cp866.ini Initial 2018-11-14 19:08:56 +02:00
cp1251.ini Initial 2018-11-14 19:08:56 +02:00
cp1252.ini Initial 2018-11-14 19:08:56 +02:00
enc.ini Initial 2018-11-14 19:08:56 +02:00
koi8r.ini Initial 2018-11-14 19:08:56 +02:00
ReadMe.txt Initial 2018-11-14 19:08:56 +02:00

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

Encodings, or code pages (CP), are now supported as text files along with binary files. Here only text variation is described, binaries are considered as obsolete.

A typical code page would consist of 128 lines - for 128 bytes from 0x80 to 0xFF. It's those bytes the meaning of which varies from CP to CP, the meaning of the upper half of the code page (bytes 0x0 to 0x7F) is constant.
Each line maps byte value to Unicode character value, like that:

80	=	20AC	;	ˆ

Here 80 is a byte value, 20AC is a character value, and the rest - semicolon and actual symbol - is an optional comment, placed here just to make code page easier for human reading. All numbers are hexadecimal.

All code pages must be either in Unicode with BOM, or in a one-byte encoding denoted in enc.ini (see) as DEFAULT.