Unihan
   HOME
*



picture info

Unihan
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese (chữ Hán). Modern Chinese, Japanese and Korean typefaces typically use regional or historical variants of a given Han character. In the formulation of Unicode, an attempt was made to unify these variants by considering them different glyphs representing the same "grapheme", or orthographic unit, hence, "Han unification", with the resulting character repertoire sometimes contracted to Unihan. Nevertheless, many characters have regional variants assigned to different code points, such as Traditional (U+500B) versus Simplified (U+4E2A). Unihan can also refer to the Unihan Database maintained by the Unicode Consortium, which provides informatio ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

CJK Unified Ideographs
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. In the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 15.0, Unicode defines a total of 97,058 CJK Unified Ideographs. The term ''ideographs'' is a misnomer, as the Chinese script is not ideographic but rather logographic. Historically, Vietnam used Chinese characters too, so sometimes the abbreviation CJKV is used. Vietnamese use was replaced by the Latin-based Vietnamese alphabet in the 1920s. Sources The Ideographic Research Group (IRG) is responsible for developing extensions to the encoded repertoires of CJK unified ideographs. IRG processes proposals for new CJK unified ideographs submitted by its member bodies, and after undergoing several rounds of expert review, IRG submits a consolidated set of characters to ISO/IEC JTC 1/SC 2 Working Group 2 (WG2) and the Unicode Technical Commit ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




CEDICT
The CEDICT project was started by Paul Denisowski in 1997 and is maintained by a team on mdbg.net under the name CC-CEDICT, with the aim to provide a complete Chinese to English dictionary with pronunciation in pinyin for the Chinese characters. Content CEDICT is a text file; other programs (or simply Notepad or egrep or equivalent) are needed to search and display it. This project is considered a standard Chinese-English reference on the Internet and is used by several other Chinese-English projects. The Unihan Database uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database. Features: * Traditional Chinese and Simplified Chinese * Pinyin (several pronunciations) * American English (several) * , it had 119,494 entries in UTF-8. The basic format of a CEDICT entry is: Traditional Simplified in1 yin1/American English equivalent 1/equivalent 2/ 漢字 汉字 an4 zi4/Chinese charact ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Han Characters
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji''. Chinese characters in South Korea, which are known as ''hanja'', retain significant use in Korean academia to study its documents, history, literature and records. Vietnam once used the ''chữ Hán'' and developed chữ Nôm to write Vietnamese before turning to a romanized alphabet. Chinese characters are the oldest continuously used system of writing in the world. By virtue of their widespread current use throughout East Asia and Southeast Asia, as well as their profound historic use throughout the Sinosphere, Chinese characters are among the most widely adopted writing systems in the world by number of users. The total number of Chinese characters ever to appear in a dictionary is in the tens of thousands, though most are graphic v ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Hanzi
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji''. Chinese characters in South Korea, which are known as ''hanja'', retain significant use in Korean academia to study its documents, history, literature and records. Vietnam once used the ''chữ Hán'' and developed chữ Nôm to write Vietnamese before turning to a romanized alphabet. Chinese characters are the oldest continuously used system of writing in the world. By virtue of their widespread current use throughout East Asia and Southeast Asia, as well as their profound historic use throughout the Sinosphere, Chinese characters are among the most widely adopted writing systems in the world by number of users. The total number of Chinese characters ever to appear in a dictionary is in the tens of thousands, though most are graphic v ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Source Han Sans Version Difference
Source may refer to: Research * Historical document * Historical source * Source (intelligence) or sub source, typically a confidential provider of non open-source intelligence * Source (journalism), a person, publication, publishing institute or other record or document that gives information * Source document, a document in which data collected for a clinical trial is first recorded * Source text, in research (especially in the humanities), a source of information referred to by citation ** Primary source, a first-hand written evidence of history made at the time of the event by someone who was present ** Secondary source, a written account of history based upon the evidence from primary sources ** Tertiary source, a compilation based upon primary and secondary sources * Sources (website), a directory of expert contacts and media spokespersons * Open source, a philosophy of dissemination of intellectual products Law * Sources of international law, the materials and processes ou ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Orthography
An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation. Most transnational languages in the modern period have a writing system, and most of these systems have undergone substantial standardization, thus exhibiting less dialect variation than the spoken language. These processes can fossilize pronunciation patterns that are no longer routinely observed in speech (e.g., "would" and "should"); they can also reflect deliberate efforts to introduce variability for the sake of national identity, as seen in Noah Webster's efforts to introduce easily noticeable differences between American and British spelling (e.g., "honor" and "honour"). Some nations (e.g. France and Spain) have established language academies in an attempt to regulate orthography officially. For most languages (including English) however, there are no such authorities and a sense of 'correct' orthography evol ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Sememe
__NOTOC__ A sememe () is a semantic language unit of meaning, analogous to a morpheme. The concept is relevant in structural semiotics. A seme is a proposed unit of transmitted or intended meaning; it is atomic or indivisible. A sememe can be the meaning expressed by a morpheme, such as the English pluralizing morpheme ''-s'', which carries the sememic feature plural Alternatively, a single sememe (for example oor ove can be conceived as the abstract representation of such verbs as ''skate, roll, jump, slide, turn'', or ''boogie''. It can be thought of as the semantic counterpart to any of the following: a meme in a culture, a gene in a genome, or an atom (or, more generally, an elementary particle) in a substance. A seme is the name for the smallest unit of meaning recognized in semantics, referring to a single characteristic of a sememe. There are five types of sememes: two denotational and three connotational, the latter occurring only in phrase units (they do not reflect t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Japan Electronic Industries Development Association
The (Formerly ) was an industry research, development, and standards body for electronics in Japan. It was merged with EIAJ to form JEITA on November 1, 2000. JEIDA was similar to SEMATECH of the US, ECMA of Europe. JEIDA developed a number of standards, including the JEIDA memory card, and the Exif graphical file format. History in 1967, Ryoko Communications Association Co., Ltd. has first appeared in Shibuya, Tokyo, Japan. in 1989, Ryoko Communications Association Co., Ltd. was Re-branded into Japan Electronic Industries Development Association. in 2000, JEIDA became a Pending merger with EIAJ and was Reorganized into JEITA The is a Japanese trade organization for the electronics and IT industries. It was formed in 2000 from two earlier organizations, the Electronic Industries Association of Japan and the Japan Electronic Industries Development Association. Histor .... External links JEITA Press Releases: JEITA inaugurated today on November 1, 2000 Electronics indus ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Ideograms
An ideogram or ideograph (from Greek "idea" and "to write") is a graphic symbol that represents an idea or concept, independent of any particular language, and specific words or phrases. Some ideograms are comprehensible only by familiarity with prior convention; others convey their meaning through pictorial resemblance to a physical object, and thus may also be referred to as ''pictograms''. The numerals and mathematical symbols are ideograms – 1 'one', 2 'two', + 'plus', = 'equals', and so on (compare the section "Mathematics" below). In English, the ampersand & is used for 'and' and (as in many languages) for Latin ' (as in &c for '), % for ' percent' ('per cent'), # for 'number' (or 'pound', among other meanings), § for 'section', $ for 'dollar', € for 'euro', £ for 'pound', ° for 'degree', @ for 'at', and so on. The reason they are ideograms rather than logograms is that they do not denote fixed morphemes: they can be read in many different languages, not just ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Ideographic Research Group
The Ideographic Research Group (IRG), formerly called the Ideographic Rapporteur Group, is a subgroup of Working Group 2 (WG2) of ISO/IEC JTC 1/SC 2 (SC 2), the subcommittee of the Joint Technical Committee of ISO and IEC which is responsible for developing standards within the field of coded character sets. IRG is composed of experts from China, Japan, South Korea, Vietnam and other countries and regions that use Han characters, as well as experts representing the Unicode Consortium. The group is responsible for coordinating the addition of new CJK unified ideographs to the Universal Multiple-Octet Coded Character Set (ISO/IEC 10646) and the Unicode Standard. The group meets twice a year for 4-5 days each time, and reports its activity to the subsequent meeting of WG2. History The precursor to the Ideographic Rapporteur Group was the CJK Joint Research Group (CJK-JRG), which was established in 1990. In October 1993 this group was established as a subgroup of WG2 under SC2 with t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


EDICT
An edict is a decree or announcement of a law, often associated with monarchism, but it can be under any official authority. Synonyms include "dictum" and "pronouncement". ''Edict'' derives from the Latin edictum. Notable edicts * Telepinu Proclamation, by Telipinu, king of the Hittites. Written c. 1550 BC, it helped archeologists to construct a succession of Hittite Kings. It also recounts Mursili I's conquest of Babylon. * Edicts of Ashoka, by the Mauryan emperor, Ashoka, during his reign from 272 BC to 231 BC. * Reform of Roman Calendar, Julian Calendar, took effect on 1 January AUC 709 (45 BC). * Edictum perpetuum (129), an Imperial revision of the long-standing Praetor's Edict, a periodic document which first began under the late Roman Republic (c.509–44 BC). * Edict on Maximum Prices (301), by Roman Emperor Diocletian. It attempted to reform the Roman system of taxation and to stabilize the coinage. * Edict of Toleration (311), by Galerius before his death. This proclam ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]