The CEDICT project was started by Paul Denisowski in 1997 and is maintained by a team on mdbg.net under the name CC-CEDICT, with the aim to provide a complete
Chinese
Chinese can refer to:
* Something related to China
* Chinese people, people of Chinese nationality, citizenship, and/or ethnicity
**''Zhonghua minzu'', the supra-ethnic concept of the Chinese nation
** List of ethnic groups in China, people of ...
to
English
English usually refers to:
* English language
* English people
English may also refer to:
Peoples, culture, and language
* ''English'', an adjective for something of, from, or related to England
** English national ide ...
dictionary with pronunciation in
pinyin
Hanyu Pinyin (), often shortened to just pinyin, is the official romanization system for Standard Mandarin Chinese in China, and to some extent, in Singapore and Malaysia. It is often used to teach Mandarin, normally written in Chinese for ...
for the Chinese characters.
Content
CEDICT is a
text file
A text file (sometimes spelled textfile; an old alternative name is flatfile) is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system. In operating ...
; other programs (or simply
Notepad
A notebook (also known as a notepad, writing pad, drawing pad, or legal pad) is a book or stack of paper pages that are often Ruled paper, ruled and used for purposes such as note-taking, diary, journaling or other writing, drawing, or scrapbook ...
or
egrep
grep is a command-line utility for searching plain-text data sets for lines that match a regular expression. Its name comes from the ed command ''g/re/p'' (''globally search for a regular expression and print matching lines''), which has the sam ...
or equivalent) are needed to search and display it. This project is considered a standard Chinese-English reference on the Internet and is used by several other Chinese-English projects. The
Unihan Database
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature s ...
uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database.
Features:
*
Traditional Chinese
A tradition is a belief or behavior (folk custom) passed down within a group or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examples include holidays or ...
and
Simplified Chinese
Simplification, Simplify, or Simplified may refer to:
Mathematics
Simplification is the process of replacing a mathematical expression by an equivalent one, that is simpler (usually shorter), for example
* Simplification of algebraic expressions, ...
* Pinyin (several pronunciations)
* American English (several)
* , it had 119,494 entries in
UTF-8
UTF-8 is a variable-width encoding, variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit'' ...
.
The basic format of a CEDICT entry is:
Traditional Simplified
in1 yin1/American English equivalent 1/equivalent 2/
漢字 汉字
an4 zi4/Chinese character/CL:個, 个/
Example of a simple egrep search:
$ egrep -i 有勇無謀 cedict.txt
有勇無謀 有勇无谋
ou3 yong3 wu2 mou2/bold but not very astute/
History
Related projects
CEDICT has shown the way to some other projects:
*
HanDeDict (~156,000 Chinese entries)
*
CFDICT (~44,000 entries) for French
* Some older CEDICT data is also found in the
Adsotrans dictionary.
* February 2012
ChE-DICC the Spanish-Chinese free dictionary starts (currently beta)
* May 2017: CHDICT (11,000 entries) for Hungarian
* CC-Canto is
Pleco Software
Pleco Software (pronounced Pl-ee-ko) provides an English and Chinese Dictionary application for iOS and Android devices. The Pleco Software company was founded in May 2000 by Michael Love when he was studying abroad in China. Having difficulty ...
's addition of
Cantonese language
Cantonese ( zh, t=廣東話, s=广东话, first=t, cy=Gwóngdūng wá) is a language within the Chinese (Sinitic) branch of the Sino-Tibetan languages originating from the city of Guangzhou (historically known as Canton) and its surrounding are ...
readings in
Jyutping
Jyutping is a romanisation system for Cantonese developed by the Linguistic Society of Hong Kong (LSHK), an academic group, in 1993. Its formal name is the Linguistic Society of Hong Kong Cantonese Romanization Scheme. The LSHK advocates for ...
transcription to CC-CEDICT
* Cantonese CEDICT features
Cantonese language
Cantonese ( zh, t=廣東話, s=广东话, first=t, cy=Gwóngdūng wá) is a language within the Chinese (Sinitic) branch of the Sino-Tibetan languages originating from the city of Guangzhou (historically known as Canton) and its surrounding are ...
readings in
Yale transcription and has Cantonese-specific words, many of which were taken from "A Dictionary of Cantonese Slang" in possible
copyright infringement
Copyright infringement (at times referred to as piracy) is the use of works protected by copyright without permission for a usage where such permission is required, thereby infringing certain exclusive rights granted to the copyright holder, s ...
.
References
External links
CC-CEDICT EditorProject home page
more information on the formatting of CC-CEDICTMDBG free online Chinese–English dictionaryuses CC-CEDICT, supports adding / editing entries and offers recent CC-CEDICT downloads.
Flashonaryis a Chinese-English Dictionary with integrated flashcards that uses CC-CEDICT.
Example of CEDICT data for the han character " 中 ", use by Unihan(Section "Chinese Compounds")
Chinese DictionariesDiscussion group about Chinese->"foreign language" dictionaries
* The homepage o
Paul Denisowski the founder of CEDICT
uses CEDICT
Mandarin Text Projectuses CEDICT
HanDeDict @ Zydeo Open-source Chinese-German dictionary
CHDICT kínai-magyar szótár Open-source Chinese-Hungarian dictionary
ZhongaChinese-English dictionary with handwriting recognition and pronunciation, uses CEDICT.
HSK.HELPuses CEDICT
{{Dictionaries of Chinese
Chinese dictionaries
Translation dictionaries