(1) national standard code: refers to the Chinese Character Code for Standard Information Exchange between People's Republic of China (PRC) and China published by 198 1, with the code name "GB23 12-80". Consists of two consecutive bytes.
2. Built-in code: The code that represents Chinese characters in the computer is the built-in code of Chinese characters, which evolved from the national standard code. Add "1" to the highest order of the two bytes representing the national standard code, and it becomes the built-in code of Chinese characters.
3. Input code: Chinese character input code refers to the codes of various Chinese character input methods directly input from the keyboard, which belongs to the external code.
4. Font lattice code: Chinese fonts are constructed by dot matrix method, and then stored in a computer to form a Chinese font library. The purpose is to display and print Chinese characters.
Examples of transcoding
Chinese Character Positioning Code 16 Basic National Standard Code Internal Code
Text 4636 2E24H 4E44H CEC4H
The area code of Wen is 4636, and the area code and location code are represented by 16, which are "2E24H", 001011000100 respectively.
Converted to the national standard code "4E44H", 01001100100.
The built-in code is "CEC4H",11101010.
Answer time: September 2006-1211:50: 48.
The computer only recognizes codes consisting of 0 and 1 ASCII code is the standard code for English information processing, and there must be a unified standard code for Chinese character information processing. The National Bureau of Standards of China issued the Code Character Set of Chinese Characters for Information Interchange-Basic Set in May of 198 1, with the code name of GB23 12-80. * * 6763 Chinese characters and 682 graphic characters are encoded. The encoding principle is that Chinese characters are represented by two bytes, and each byte is represented by a seven-bit code (high bit). The national standard arranges Chinese characters and graphic symbols in a two-dimensional code table with 94 rows and 94 columns. Every two bytes are coded with two decimal places. The coding of the first byte is called area code, and the coding of the last byte is called bit code, which is called area code. If the word "Bao" is in the third place in the 17 area in the two-dimensional code table, the area code is "1703".
The country code is not equal to the area code, but is obtained by slightly converting the area code. The conversion method is as follows: firstly, the decimal area code and bit code are converted into hexadecimal area code and bit code; In this way, a code with a relative position difference with the national standard code is obtained; Add 20H to the first byte and the second byte of this code to get the national standard code. Such as: "guarantee"? The national standard code of Chinese characters is 3 123H, which is obtained by the following transformation:1703d->; 1 103h->; +20H-& gt; 3 123H .
The national standard code is the standard code for information exchange of Chinese characters, but it conflicts with ASCII code because the highest bit of its front and back bytes is 0, such as "Bao"? Chinese characters, the national standard codes are 3 1H and 23H, and the SCII of western characters "1" and "#" are also 3 1H and 23H. Now if there are two bytes in the memory, they are 3 1H and 23H. Is this Chinese character? , or two Spanish characters "1"; And "#"? So there is ambiguity. Obviously, the national standard code cannot be directly used inside the computer. The internal code of Chinese characters adopts the deformed national standard code, and its conversion method is: adding 128 to each byte of the national standard code, that is, changing the highest bit of two bytes from 0 to 1, and leaving the other seven bits unchanged. For example, from the above, we know that the national standard code of the word "Bao" is 3 123H, and the first byte is 0 18. The last byte is 00 1000 1 1B, and the upper byte is changed to 10 1000 1B and10/kloc. The internal code of the word is b1a3h; .