Internal characteristics, this character arrangement order is called "coding".
Font coding is the basis of font organization and word processing. Different countries and regions have different series.
Code standards, commonly used codes related to Chinese fonts are: single-byte code, GB23 12-80, GB 12345-90, GBK and U.
Nicode coding, ISO 10646/Unicode character set, GB 18030-2000, BIG5 coding, here is a brief introduction.
Next:
Single byte coding
Microsoft Windows System: Windows Latin 1(ANSI)
MS-DOS:MS-DOS Latin
Macintosh: Macintosh Rome
GB23 12-80
The full name of GB23 12-80, Basic Character Set of Chinese Characters for Information Interchange, with publication number 1980, is a Chinese letter.
In Chinese mainland and overseas areas where simplified Chinese is used (such as Singapore), national standards for information processing are mandatory.
A China code. P-Windows3.2 and Apple OS are based on GB23 12, while Windows 95/98 is.
GBK is the basic Chinese character coding, but it is compatible with GB23 12.
Double byte coding
Scope: FEFE. FeFe
A 1-A9: symbol area, containing 682 symbols.
B0-F7: Chinese character area, containing 6763 Chinese characters.
GB code * * * contains 6763 simplified characters and 682 symbols, of which 3755 are Chinese characters, arranged according to pinyin.
Order, secondary word 3008, sorted by radical. The formulation and application of this standard has played an important role in standardizing and promoting the informatization process in China.
The effect is great.
GB 12345-90
1990, coding standard of traditional Chinese characters GB 12345-90 "First Auxiliary Set of Chinese Characters for Information Interchange"
Set ",the purpose is to standardize the use of traditional Chinese characters in various occasions, as well as the collation of ancient books. The standard * * * includes 6866.
A Chinese character (more than GB23 12 103 characters, and most fonts from other manufacturers do not contain these characters) probably has pure traditional Chinese characters.
More than 2200.
Double byte coding
Scope: FEFE. FeFe
A 1-A9: symbol area, adding vertical symbols.
B0-F9: Chinese character area, containing 6866 Chinese characters.
Unicode encoding (universal multi-coded character set)
ISO/IEC JTC 1/SC2/WG2 working group was established in April 1984, aiming at national characters and symbols,
Unified coding. 199 1 year American multinational companies set up Unicode Consortium, 199 1 year cooperated with WG2.
To reach an agreement, use the same set of coded words. At present, Unicode adopts 16 bit coding system, and the content of character set is the same as ISO 1.
The same is true for BMP (Basic Multilingual Plane) of 0646. Unicode passed DIS(Draf) 1992 in June.
International standard), the current version V2.0 is published in 1996, including 68 1 1 symbols and 209 Chinese characters.
02, Korean Pinyin 1 1 172, word-making areas 6400, 20249 reserved, * * * 6534.
ISO 10646/Unicode character set
Coded character set that can be enjoyed all over the world.
UCS-4: octet of octet in octet plane and octet in octet row
The 00 plane in UCS-2: 00 group is the Basic Multilingual Plane (BMP), 4E00 ~ 9FFF Chinese, Japanese and Korean characters.
Ext a (cjk): 3400 ~ 4db7, ***6584 words.
Extension B (CJK): 42,807 Chinese characters, second plane 0 100~A836.
GBK (Chinese internal code specification)
GBK coding is a new extended national standard for Chinese coding formulated by Chinese mainland, which is equivalent to UCS. GBK working group
The GBK specification was completed in 1995 10, and in the same year 12. The coding standard is compatible with GB23 12, and * * contains 2 1003 Chinese characters.
There are 883 symbols, providing 1894 word-making code points. Simplified and traditional Chinese characters are integrated into a library.
GBK is used for surface coding of fonts in simplified Chinese version of Windows95/98, which corresponds to UCS one by one.
The code table contacts the bottom font.
English name: Chinese internal code specification
Chinese name: 1.0 version of Chinese character internal code extension specification
Double-byte coding is an extension of GB23 12-80, which is compatible with GB23 12-80 in code position.
Range: 8 140~FEFE (excluding X27F) * * * 23940 code.
It contains 2 1003 Chinese characters, including all Chinese characters of ISO/IEC 10646- 1.
GB 18030-2000
English name: Chinese internal code specification
Chinese name: Chinese coded character set for information exchange in information technology
Extension of basic set (released and implemented in March 2000-17)
Single-byte, double-byte and four-byte coding
Backwards compatibility national standard GB 23 12 "Information Processing Interchange Code" corresponds to the de facto internal code standard.
Lexically, all Chinese, Japanese and Korean (CJK) unified Chinese characters and all CJK unified Chinese character extensions of GB 13000. 1 are supported.
Characters filled with-.
BIG5 coding
It is a coding standard of traditional Chinese characters widely used in Taiwan Province Province and Hongkong, including 440 symbols, level one.
There are 540 1 Chinese characters, 7652 secondary Chinese characters and * * * 13060 Chinese characters.