VNI Character Set
   HOME

TheInfoList



OR:

VNI Software Company is a developer of various education, entertainment, office, and utility
software Software is a set of computer programs and associated documentation and data. This is in contrast to hardware, from which the system is built and which actually performs the work. At the lowest programming level, executable code consists ...
packages. They are known for developing an
encoding In communications and information processing, code is a system of rules to convert information—such as a letter, word, sound, image, or gesture—into another form, sometimes shortened or secret, for communication through a communication ...
(VNI encoding) and a popular
input method An input method (or input method editor, commonly abbreviated IME) is an operating system component or program that enables users to generate characters not natively available on their input devices by using sequences of characters (or mouse o ...
(VNI Input) for
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia ** A citizen of Vietnam. See Demographics of Vietnam. * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overse ...
on for computers. VNI is often available on computer systems to type Vietnamese, alongside
TELEX The telex network is a station-to-station switched network of teleprinters similar to a Public switched telephone network, telephone network, using telegraph-grade connecting circuits for two-way text-based messages. Telex was a major method of ...
input method as well. The most common pairing is the use of VNI on keyboard and computers, whilst TELEX is more common on phones or touchscreens.


History

The VNI company is a family-owned company and based in
Westminster, California Westminster is a city in northern Orange County, California, known for its many Vietnamese refugees who immigrated to the city during the 1980s. They settled largely in Little Saigon, and the city is known as the "capital" of overseas Vietnames ...
. It was founded in 1987 by
Hồ Thành Việt Hồ Thành Việt ( en, John Hồ; 1955–2003) was a Vietnamese-American computer engineer and entrepreneur who is credited with making desktop publishing more accessible to Vietnamese speakers. Biography Việt was born in Nha Trang, South V ...
to develop software that eases
Vietnamese language Vietnamese ( vi, tiếng Việt, links=no) is an Austroasiatic languages, Austroasiatic language originating from Vietnam where it is the national language, national and official language. Vietnamese is spoken natively by over 70 million people, ...
use on computers. Among their products were the VNI Encoding and VNI
Input Method An input method (or input method editor, commonly abbreviated IME) is an operating system component or program that enables users to generate characters not natively available on their input devices by using sequences of characters (or mouse o ...
. The VNI Input Method has since grown to become the top two most popular input methods for Vietnamese, alongside TELEX which is more advantageous for phones and touchscreens whilst VNI has found more use on keyboard computer systems.


VNI vs. Microsoft

In the 1990s,
Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washing ...
recognized the potential of VNI's products and incorporated VNI Input Method into
Windows 95 Windows 95 is a consumer-oriented operating system developed by Microsoft as part of its Windows 9x family of operating systems. The first operating system in the 9x family, it is the successor to Windows 3.1x, and was released to manufacturin ...
Vietnamese Edition and
MSDN Microsoft Developer Network (MSDN) was the division of Microsoft responsible for managing the firm's relationship with developers and testers, such as hardware developers interested in the operating system (OS), and software developers developing ...
, in use worldwide. Upon Microsoft's unauthorized use of these technologies, VNI took Microsoft to court over the matter. Microsoft settled the case out of court, withdrew the input method from their entire product line, and developed their own input method. It has, although virtually unknown, appeared in every
Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for serv ...
release since
Windows 98 Windows 98 is a consumer-oriented operating system developed by Microsoft as part of its Windows 9x family of Microsoft Windows operating systems. The second operating system in the 9x line, it is the successor to Windows 95, and was released to ...
. Starting with Windows 10 version 1903, the VNI Input Method (as "Vietnamese Number Key-based"), along with the
Telex The telex network is a station-to-station switched network of teleprinters similar to a Public switched telephone network, telephone network, using telegraph-grade connecting circuits for two-way text-based messages. Telex was a major method of ...
input method, are now natively supported.


Unicode

Despite the growing popularity of
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
in computing, the
VNI Encoding VNI Software Company is a developer of various education, entertainment, office, and utility computer software, software packages. They are known for developing an Character encoding, encoding (VNI encoding) and a popular input method (VNI Input) ...
(see
below Below may refer to: *Earth *Ground (disambiguation) *Soil *Floor *Bottom (disambiguation) Bottom may refer to: Anatomy and sex * Bottom (BDSM), the partner in a BDSM who takes the passive, receiving, or obedient role, to that of the top or ...
) is still in wide use by Vietnamese speakers both in
Vietnam Vietnam or Viet Nam ( vi, Việt Nam, ), officially the Socialist Republic of Vietnam,., group="n" is a country in Southeast Asia, at the eastern edge of mainland Southeast Asia, with an area of and population of 96 million, making i ...
and abroad. All professional printing facilities in the
Little Saigon Little Saigon ( vi, Sài Gòn nhỏ or Tiểu Sài Gòn) is a name given to ethnic enclaves of expatriate Vietnamese mainly in English-speaking countries. Alternate names include Little Vietnam and Little Hanoi (mainly in historically communist ...
neighborhood of
Orange County, California Orange County is located in the Los Angeles metropolitan area in Southern California. As of the 2020 census, the population was 3,186,989, making it the third-most-populous county in California, the sixth-most-populous in the United States, a ...
continue to use the VNI Encoding when processing Vietnamese text. For this reason,
print job In computing, a print job is a file or set of files that has been submitted to be printed with a printer. Jobs are typically identified by a unique number, and are assigned to a particular destination, usually a printer. Jobs can also have options ...
s submitted using the VNI Character Set are compatible with local printers.


Input methods

VNI invented, popularized, and commercialized an input method and an encoding, the
VNI Character Set VNI Software Company is a developer of various education, entertainment, office, and utility software packages. They are known for developing an encoding (VNI encoding) and a popular input method (VNI Input) for Vietnamese on for computers. VNI is ...
, to assist computer users entering Vietnamese on their computers. The user can type using only
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
characters found on standard computer
keyboard layout A keyboard layout is any specific physical, visual or functional arrangement of the keys, legends, or key-meaning associations (respectively) of a computer keyboard, mobile phone, or other computer-controlled typographic keyboard. is the actua ...
s. Because the
Vietnamese alphabet The Vietnamese alphabet ( vi, chữ Quốc ngữ, lit=script of the National language) is the modern Latin writing script or writing system for Vietnamese language, Vietnamese. It uses the Latin script based on Romance languages originally develo ...
uses a complex system of
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s for tones and other letters of the Vietnamese alphabet, the keyboard would need 133 alphanumeric keys and a
Shift key The Shift key is a modifier key on a keyboard, used to type capital letters and other alternate "upper" characters. There are typically two shift keys, on the left and right sides of the row below the home row. The Shift key's name originated f ...
to cover all possible characters.


VNI Input Method

Originally, VNI's input method utilized
function key A function key is a key on a computer or terminal keyboard that can be programmed so as to cause an operating system command interpreter or application program to perform certain actions, a form of soft key. On some keyboards/computers, function ...
s (F1, F2, ...) to enter the tone marks, which later turned out to be problematic, as the operating system used those keys for other purposes. VNI then turned to the numerical keys along the top of the keyboard (as opposed to the
numpad A numeric keypad, number pad, numpad, or ten key, is the palm-sized, usually-17-key section of a standard computer keyboard, usually on the far right. It provides calculator-style efficiency for entering numbers. The idea of a 10-key n ...
) for entering tone marks. This arrangement survives today, but users also have the option of customizing the keys used for tone marks. With VNI Tan Ky mode on, the user can type in diacritical marks anywhere within a word, and the marks will appear at their proper locations. For example, the word '' trường'', which means 'school', can be typed in the following ways: *truong-7-2 (most conventional way) *72truong *t72ruong *tr72uong *tru7o72ng *truo72ng *truo7ng2 The first way is the conventional method, following handwriting and spelling convention, where the base is written first (') and then the tonal marks added later one by one.


VNI Tan Ky

With the release of VNI Tan Ky 4 in the 1990s, VNI freed users from having to remember where to correctly insert tone marks within a word, because, as long as the user enters all the required characters and tone marks, the software will group them correctly. This feature is especially useful for newcomers to the language.


VNI Auto Accent

VNI Auto Accent is the company's most recent software release (2006), with the purpose of alleviating
repetitive strain injury A repetitive strain injury (RSI) is an injury to part of the musculoskeletal or nervous system caused by repetitive use, vibrations, compression or long periods in a fixed position. Other common names include repetitive stress disorders, cumula ...
(RSI) caused by prolonged use of
computer keyboard A computer keyboard is a peripheral input device modeled after the typewriter keyboard which uses an arrangement of buttons or keys to act as mechanical levers or electronic switches. Replacing early punched cards and paper tape technology ...
s. Auto Accent helps reduce the number of keystrokes needed to type each word by automatically adding diacritical marks for the user. The user must still enter every base letter in the word.


Character encodings


VNI Encoding (Windows/Unix)

The VNI Encoding uses up to two bytes to represent one Vietnamese vowel character, with the second byte supplying additional
diacritical marks A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacritic ...
, therefore removing the need to replace control characters with Vietnamese characters, a problematic system found in TCVN1 ( VSCII-1) and in
VISCII VISCII is an unofficially-defined modified ASCII character encoding for using the Vietnamese language with computers. It should not be confused with the similarly-named officially registered VSCII encoding. VISCII keeps the 95 printable charac ...
, or using two different
font In metal typesetting, a font is a particular size, weight and style of a typeface. Each font is a matched set of type, with a piece (a "sort") for each glyph. A typeface consists of a range of such fonts that shared an overall design. In mod ...
s such as is sometimes employed for TCVN3 ( VSCII-3), one containing lowercase characters and the other uppercase characters. A similar approach is taken by
Windows-1258 Windows-1258 is a code page used in Microsoft Windows to represent Vietnamese texts. It makes use of combining diacritical marks. Windows-1258 is compatible with neither the Vietnamese standard (TCVN 5712 / VSCII), nor the various other encodings ...
and VSCII-2. This solution is more
portable Portable may refer to: General * Portable building, a manufactured structure that is built off site and moved in upon completion of site and utility work * Portable classroom, a temporary building installed on the grounds of a school to provide a ...
between different versions of Windows and between different
platform Platform may refer to: Technology * Computing platform, a framework on which applications may be run * Platform game, a genre of video games * Car platform, a set of components shared by several vehicle models * Weapons platform, a system or ...
s. However, due to the presence of multiple characters in a file to represent one written character increases the file size. The increased file size can usually be accounted for by
compress compress is a Unix shell compression program based on the LZW compression algorithm. Compared to more modern compression utilities such as gzip and bzip2, compress performs faster and with less memory usage, at the cost of a significantly l ...
ing the data into a file format such as ZIP. The VNI encoding was used extensively in the south of Vietnam, and sometimes used overseas, while
TCVN 5712 VSCII (Vietnamese Standard Code for Information Interchange), also known as TCVN 5712, ISO-IR-180, .VN, ABC or simply the TCVN encodings, is a set of three closely related Vietnamese national standard character encodings for using the Vietname ...
was dominant in the north. Points 0x00 through 0x7F follow
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
.


VNI Encoding for Macintosh

A version intended for use on Macintosh systems, with a different arrangement (corresponding to the different arrangement between
Windows-1252 Windows-1252 or CP-1252 ( code page 1252) is a single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German. It ...
and
Mac OS Roman Mac OS Roman is a character encoding created by Apple Computer, Inc. for use by Macintosh computers. It is suitable for representing text in English and several other Western languages. Mac OS Roman encodes 256 characters, the first 128 of which ...
).


VNI Encoding for DOS

The VNI encoding for use on DOS does not use separate characters for diacritics, instead replacing certain ASCII punctuation characters with tone-marked uppercase letters (compare
ISO 646 ISO/IEC 646 is a set of ISO/IEC standards, described as ''Information technology — ISO 7-bit coded character set for information interchange'' and developed in cooperation with ASCII at least since 1964. Since its first edition in 1 ...
).


VIQR and VNI-Internet Mail

The use of
Vietnamese Quoted-Readable Vietnamese Quoted-Readable (usually abbreviated VIQR), also known as Vietnet, is a convention for writing Vietnamese language, Vietnamese using ASCII characters encoded in only 7 bits, making possible for Vietnamese to be supported in computing and ...
(VIQR), a convention for writing in Vietnamese using ASCII characters, began during the
Vietnam War The Vietnam War (also known by #Names, other names) was a conflict in Vietnam, Laos, and Cambodia from 1 November 1955 to the fall of Saigon on 30 April 1975. It was the second of the Indochina Wars and was officially fought between North Vie ...
, when
typewriter A typewriter is a mechanical or electromechanical machine for typing characters. Typically, a typewriter has an array of keys, and each one causes a different single character to be produced on paper by striking an inked ribbon selectivel ...
s were the main tool for
word processing A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consen ...
. Because the
U.S. military The United States Armed Forces are the military forces of the United States. The armed forces consists of six service branches: the Army, Marine Corps, Navy, Air Force, Space Force, and Coast Guard. The president of the United States is the ...
required a way to represent Vietnamese scripts accurately on official documents, VIQR was invented for the military. Due to its longstanding use, VIQR was a natural choice for computer word processing, prior to the appearance of VNI, VPSKeys,
VSCII VSCII (Vietnamese Standard Code for Information Interchange), also known as TCVN 5712, ISO-IR-180, .VN, ABC or simply the TCVN encodings, is a set of three closely related Vietnamese national standard character encodings for using the Vietname ...
,
VISCII VISCII is an unofficially-defined modified ASCII character encoding for using the Vietnamese language with computers. It should not be confused with the similarly-named officially registered VSCII encoding. VISCII keeps the 95 printable charac ...
, and Unicode. It is still widely used for information exchange on computers, but is not desirable for design and layout, due to its cryptic appearance. VIQR's main issue was the difficulty of reading VIQR text, especially for inexperienced computer users. VNI created and released a free font called ''VNI-Internet Mail'', which utilized a variant of the VIQR notation and VNI's
combining character In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks (including combining accents). Unicode also ...
technique to give VIQR text a more natural appearance by replacing certain ASCII punctuation with combining characters. The following table compares VNI-Internet Mail to other codified VIQR or VIQR-like conventions.


See also

*
Telex (input method) Telex or TELEX ( vi, Quốc ngữ điện tín, lit=national language telex), is a convention for encoding Vietnamese text in plain ASCII characters. Originally used for transmitting Vietnamese text over telex systems, it is one of the most used in ...
*
Vietnamese Quoted-Readable Vietnamese Quoted-Readable (usually abbreviated VIQR), also known as Vietnet, is a convention for writing Vietnamese language, Vietnamese using ASCII characters encoded in only 7 bits, making possible for Vietnamese to be supported in computing and ...
(VIQR) *
VISCII VISCII is an unofficially-defined modified ASCII character encoding for using the Vietnamese language with computers. It should not be confused with the similarly-named officially registered VSCII encoding. VISCII keeps the 95 printable charac ...
* VPSKeys *
VNLabs VNLabs or VN Labs is a software company based in San Jose, California, that specializes in input methods for various languages. History VN Labs was founded in 1984 by Viet Nguyen, an engineer at Teledyne, in Newport Beach, California. The previo ...
* Guide to inputting Vietnamese text at the
Vietnamese Wikipedia The Vietnamese Wikipedia ( vi, Wikipedia tiếng Việt) is the Vietnamese-language edition of Wikipedia, a free, publicly editable, online encyclopedia supported by the Wikimedia Foundation. As with other language editions of Wikipedia, the ...
*
Vietnamese language and computers The Vietnamese language is written with a Latin script with diacritics ( accent tones) which requires several accommodations when typing on phone or computers. Software-based systems are a form of writing Vietnamese on phones or computers with softw ...


References

{{reflist


External links


VNI Software Co.VietUni Converter


VNI products



Character encoding Companies based in Orange County, California Companies based in Westminster, California Educational software Vietnamese character input