Internal Code
   HOME

TheInfoList



OR:

In
China China, officially the People's Republic of China (PRC), is a country in East Asia. It is the world's most populous country, with a population exceeding 1.4 billion, slightly ahead of India. China spans the equivalent of five time zones and ...
, neima (內碼, 内码;
pinyin Hanyu Pinyin (), often shortened to just pinyin, is the official romanization system for Standard Mandarin Chinese in China, and to some extent, in Singapore and Malaysia. It is often used to teach Mandarin, normally written in Chinese for ...
: nèimă;
jyutping Jyutping is a romanisation system for Cantonese developed by the Linguistic Society of Hong Kong (LSHK), an academic group, in 1993. Its formal name is the Linguistic Society of Hong Kong Cantonese Romanization Scheme. The LSHK advocates for ...
: noi6 maa5, literally internal code) is the encoding of a
character Character or Characters may refer to: Arts, entertainment, and media Literature * ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk * ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
in some
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
, or to the
character encoding Character encoding is the process of assigning numbers to Graphics, graphical character (computing), characters, especially the written characters of Language, human language, allowing them to be Data storage, stored, Data communication, transmi ...
being used. It is not an encoding in itself, and the actual encoding being referred to has to be determined by context. On any computer system, the internal code is the native encoding being used. For example, in a Big5-based system (e.g.,
Microsoft Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for serv ...
3.1 localized for traditional Chinese), the internal code is
Big5 Big-5 or Big5 is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for traditional Chinese characters. The People's Republic of China (PRC), which uses simplified Chinese characters, uses the GB 18030 character set inst ...
; similarly, in a GB-based system (e.g.,
DOS DOS is shorthand for the MS-DOS and IBM PC DOS family of operating systems. DOS may also refer to: Computing * Data over signalling (DoS), multiplexing data onto a signalling channel * Denial-of-service attack (DoS), an attack on a communicat ...
running CCDOS), the internal code would be . On early computers, the
Chinese language card A Chinese language card or Chinese character card is a computer expansion card that improves the ability of computers to process Chinese text. Early computers were limited in processing speed and storage capacity. If a software like CC-DOS or :zh ...
takes the internal codes and renders the corresponding Chinese characters on the screen. On many modern operating systems (such as all modern Microsoft Windows systems), the internal code is a form of
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
. Within a particular encoding, the internal code of a certain character simply means the value of the
code point In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but ...
used to represent that character. For example, in the Big5 encoding, the character "一" (Chinese, lit. ''one'') has the internal code of A440 (
hexadecimal In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
); in the GB encoding, the same character has the internal code of D2BB.


The "neima method"

{{For, more examples when the internal code is Unicode, Unicode#Input methods The internal code can be used as an
input method An input method (or input method editor, commonly abbreviated IME) is an operating system component or program that enables users to generate characters not natively available on their input devices by using sequences of characters (or mouse o ...
for inputting
Han characters Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji' ...
. This input method is usually called 內碼 in Chinese and is usually provided for Big5 and GB internal codes; in English, it may be variously called "neima", "internal code", "raw code", or other similar names. For example, in a Big5-based system, one can input the character "一" by typing "A440" using the "internal code" input method. On the other hand, in a GB-based system, to input the character one would itype "D2BB". NeiMa expects the user to input the desired character by providing its value within the user-chosen
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
. For example, to input the Chinese character "不" (
Pinyin Hanyu Pinyin (), often shortened to just pinyin, is the official romanization system for Standard Mandarin Chinese in China, and to some extent, in Singapore and Malaysia. It is often used to teach Mandarin, normally written in Chinese for ...
"bù" – English "not") one can start the Neima editor, switch to
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
character encoding mode and then type "bù"'s
hexadecimal In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
value within Unicode's table, which is 4E0D. NeiMa is a very awkward way of typing in characters, as a user would need to know the code points of all needed characters. More generally speaking, NeiMa in Unicode mode accepts any code point within the Unicode table, so users are not limited to inputting Chinese characters, but also any other character that can be found within the Unicode table. For example, Latin Capital Letter A, ''A'', may be inputted with NeiMa using A's Unicode code point, which is 0041. On a Unicode-based system, one might be able to input a character by typing its Unicode number in hexadecimal. Such an input method might also be called "neima", or it might be called "Unicode". On a JIS-based system (Japanese), there might be a ''kuten'' input method that allows characters to be input using a form of the internal code called the "kuten form"; this kind of input method is called ''quwei'' on GB-based systems (Chinese). Although the kuten (quwei) form is related to the internal code, such input methods are not usually referred as "internal code" input methods. The use of an "internal code" input method to input characters would not be normally very practical. However, it is useful for inputting special symbols that may otherwise be impossible to input using other input methods. However, the user would need to have a table of characters with their internal codes.


See also

*
Alt code On personal computers with numeric keypads that use Microsoft operating systems, such as Windows, many characters that do not have a dedicated key combination on the keyboard may nevertheless be entered using the Alt code (the Alt numpad input me ...


References

Character encoding Han character input