In a

written language A written language is the representation of a language by means of writing. This involves the use of visual symbols, known as graphemes, to represent linguistic units such as phonemes, syllables, morphemes, or words. However, written language is ...

, a logogram (from

Ancient Greek Ancient Greek (, ; ) includes the forms of the Greek language used in ancient Greece and the classical antiquity, ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Greek ...

'word', and 'that which is drawn or written'), also logograph or lexigraph, is a written character that represents a

semantic Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...

component of a language, such as a

word A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguist ...

morpheme A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...

Chinese characters Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...

as used in Chinese as well as other languages are logograms, as are

Egyptian hieroglyphs Ancient Egyptian hieroglyphs ( ) were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined Ideogram, ideographic, logographic, syllabic and alphabetic elements, with more than 1,000 distinct char ...

and characters in

cuneiform script Cuneiform is a Logogram, logo-Syllabary, syllabic writing system that was used to write several languages of the Ancient Near East. The script was in active use from the early Bronze Age until the beginning of the Common Era. Cuneiform script ...

. A

writing system A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...

that primarily uses logograms is called a ''logography''. Non-logographic writing systems, such as

alphabet An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...

s and

syllabaries In the linguistic study of written languages, a syllabary is a set of written symbols that represent the syllables or (more frequently) morae which make up words. A symbol in a syllabary, called a syllabogram, typically represents an (option ...

, are ''phonemic'': their individual symbols represent sounds directly and lack any inherent meaning. However, all known logographies have some phonetic component, generally based on the rebus principle, and the addition of a phonetic component to pure ideographs is considered to be a key innovation in enabling the writing system to adequately encode human language.

Types of logographic systems

Some of the earliest recorded writing systems are logographic; the first historical civilizations of Mesopotamia, Egypt, China and Mesoamerica all used some form of logographic writing. All logographic scripts ever used for natural languages rely on the rebus principle to extend a relatively limited set of logograms: A subset of characters is used for their phonetic values, either consonantal or syllabic. The term logosyllabary is used to emphasize the partially phonetic nature of these scripts when the phonetic domain is the syllable. In Ancient Egyptian

hieroglyph Ancient Egyptian hieroglyphs ( ) were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined ideographic, logographic, syllabic and alphabetic elements, with more than 1,000 distinct characters. ...

s, Ch'olti', and in Chinese, there has been the additional development of determinatives, which are combined with logograms to narrow down their possible meaning. In Chinese, they are fused with logographic elements used phonetically; such " radical and phonetic" characters make up the bulk of the script. Ancient Egyptian and Chinese relegated the active use of rebus to the spelling of foreign and dialectical words.

Logoconsonantal

Logoconsonantal scripts have graphemes that may be extended phonetically according to the consonants of the words they represent, ignoring the vowels. For example, Egyptian was used to write both ''sȝ'' 'duck' and ''sȝ'' 'son', though it is likely that these words were not pronounced the same except for their consonants. G38 The primary examples of logoconsonantal scripts are

, hieratic, and demotic:

Ancient Egyptian Ancient Egypt () was a cradle of civilization concentrated along the lower reaches of the Nile River in Northeast Africa. It emerged from prehistoric Egypt around 3150BC (according to conventional Egyptian chronology), when Upper and Lower E ...

Logosyllabic

Logosyllabic scripts have graphemes which represent morphemes, often polysyllabic morphemes, but when extended phonetically represent single syllables. They include cuneiform,

Anatolian hieroglyphs Anatolian hieroglyphs are an indigenous logographic script native to central Anatolia, consisting of some 500 signs. They were once commonly known as Hittite hieroglyphs, but the language they encode proved to be Luwian language, Luwian, not Hitt ...

, Cretan hieroglyphs, Linear A and

Linear B Linear B is a syllabary, syllabic script that was used for writing in Mycenaean Greek, the earliest Attested language, attested form of the Greek language. The script predates the Greek alphabet by several centuries, the earliest known examp ...

, Maya script, Aztec script, Mixtec script, and the first five phases of the Bamum script.

Others

A peculiar system of logograms developed within the Pahlavi scripts (developed from the

abjad An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...

Aramaic Aramaic (; ) is a Northwest Semitic language that originated in the ancient region of Syria and quickly spread to Mesopotamia, the southern Levant, Sinai, southeastern Anatolia, and Eastern Arabia, where it has been continually written a ...

) used to write

Middle Persian Middle Persian, also known by its endonym Pārsīk or Pārsīg ( Inscriptional Pahlavi script: , Manichaean script: , Avestan script: ) in its later form, is a Western Middle Iranian language which became the literary language of the Sasania ...

during much of the Sassanid period; the logograms were composed of letters that spelled out the word in

but were pronounced as in Persian (for instance, the combination ' would be pronounced "shah"). These logograms, called (a form of heterograms), were dispensed with altogether after the Arab conquest of Persia and the adoption of a variant of the

Arabic alphabet The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is a unicase, unicameral script written from right-to-left in a cursive style, and includes 28 letters, of which most ...

Semantic and phonetic dimensions

All historical logographic systems include a phonetic dimension, as it is impractical to have a separate basic character for every word or morpheme in a language. In some cases, such as cuneiform as it was used for Akkadian, the vast majority of glyphs are used for their sound values rather than logographically. Many logographic systems also have a semantic/ideographic component (see ideogram), called "determinatives" in the case of Egyptian and "radicals" in the case of Chinese. Typical Egyptian usage was to augment a logogram, which may potentially represent several words with different pronunciations, with a determinate to narrow down the meaning, and a phonetic component to specify the pronunciation. In the case of Chinese, the vast majority of characters are a fixed combination of a radical that indicates its nominal category, plus a phonetic to give an idea of the pronunciation. The Mayan system used logograms with phonetic complements like the Egyptian, while lacking ideographic components.

Universal logograms

Not all logograms are associated with one specific language, and some are not associated with any language at all. The ampersand is a logogram in the Latin script, a combination of the letters "e" and "t." In Latin, "et" translates to "and," and the ampersand is still used to represent this word today, however, it does so in a variety of languages, being a representative of morphemes "and," "y," or "en," if they are a speaker of English, Spanish, or Dutch, respectively. Outside of any script is

Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...

, a compilation of characters of various meanings. They state their intention to build the standard to include every character from every language. It's the generally accepted standard for computer character encoding, but others, like

ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...

and Baudot, exist and serve various purposes in digital communication. Many logograms in these databases are ubiquitous, and are used on the Internet by users worldwide.

Chinese characters

Chinese scholars have traditionally classified the Chinese characters ('' hànzì'') into six types by etymology. The first two types are "single-body", meaning that the character was created independently of other characters. "Single-body" pictograms and ideograms make up only a small proportion of Chinese logograms. More productive for the Chinese script were the two "compound" methods, i.e. the character was created from assembling different characters. Despite being called "compounds", these logograms are still single characters, and are written to take up the same amount of space as any other logogram. The final two types are methods in the usage of characters rather than the formation of characters themselves. # The first type, and the type most often associated with Chinese writing, are

pictogram A pictogram (also pictogramme, pictograph, or simply picto) is a graphical symbol that conveys meaning through its visual resemblance to a physical object. Pictograms are used in systems of writing and visual communication. A pictography is a wri ...

s, which are pictorial representations of the

represented, e.g. for 'mountain'. # The second type are the ideograms that attempt to visualize abstract

concept A concept is an abstract idea that serves as a foundation for more concrete principles, thoughts, and beliefs. Concepts play an important role in all aspects of cognition. As such, concepts are studied within such disciplines as linguistics, ...

s, such as 'up' and 'down'. Also considered ideograms are pictograms with an ideographic indicator; for instance, is a pictogram meaning 'knife', while is an ideogram meaning 'blade'. # Radical–radical compounds, in which each element of the character (called radical) hints at the meaning. For example, 'rest' is composed of the characters for 'person' () and 'tree' (), with the intended idea of someone leaning against a tree, i.e. resting. # Radical–phonetic compounds, in which one component (the radical) indicates the general meaning of the character, and the other (the phonetic) hints at the pronunciation. An example is (''liáng''), where the phonetic ''liáng'' indicates the pronunciation of the character and the radical ('wood') indicates its meaning of 'supporting beam'. Characters of this type constitute around 90% of Chinese logograms. # Changed-annotation characters are characters which were originally the same character but have bifurcated through orthographic and often

drift. For instance, can mean both 'music' (''yuè'') and 'pleasure' (''lè''). # Improvisational characters (lit. 'improvised-borrowed-words') come into use when a native spoken word has no corresponding character, and hence another character with the same or a similar sound (and often a close meaning) is "borrowed"; occasionally, the new meaning can supplant the old meaning. For example, used to be a pictographic word meaning 'nose', but was borrowed to mean 'self', and is now used almost exclusively to mean the latter; the original meaning survives only in stock phrases and more archaic compounds. Because of their derivational process, the entire set of Japanese kana can be considered to be of this type of character, hence the name '' kana'' (lit. 'borrowed names'). Example: Japanese ; is a simplified form of Chinese used in Korea and Japan, and is the Chinese name for this type of characters. The most productive method of Chinese writing, the radical-phonetic, was made possible by ignoring certain distinctions in the phonetic system of syllables. In

Old Chinese Old Chinese, also called Archaic Chinese in older works, is the oldest attested stage of Chinese language, Chinese, and the ancestor of all modern varieties of Chinese. The earliest examples of Chinese are divinatory inscriptions on oracle bones ...

, post-final ending consonants and were typically ignored; these developed into tones in

Middle Chinese Middle Chinese (formerly known as Ancient Chinese) or the Qieyun system (QYS) is the historical variety of Chinese language, Chinese recorded in the ''Qieyun'', a rime dictionary first published in 601 and followed by several revised and expande ...

, which were likewise ignored when new characters were created. Also ignored were differences in aspiration (between aspirated vs. unaspirated obstruents, and voiced vs. unvoiced sonorants); the Old Chinese difference between type-A and type-B syllables (often described as presence vs. absence of palatalization or pharyngealization); and sometimes, voicing of initial obstruents and/or the presence of a medial after the initial consonant. In earlier times, greater phonetic freedom was generally allowed. During Middle Chinese times, newly created characters tended to match pronunciation exactly, other than the tone – often by using as the phonetic component a character that itself is a radical-phonetic compound. Due to the long period of language evolution, such component "hints" within characters as provided by the radical-phonetic compounds are sometimes useless and may be misleading in modern usage. As an example, based on 'each', pronounced ''měi'' in

Standard Mandarin Standard Chinese ( zh, s=现代标准汉语, t=現代標準漢語, p=Xiàndài biāozhǔn hànyǔ, l=modern standard Han speech) is a modern Standard language, standard form of Mandarin Chinese that was first codified during the Republic of ...

, are the characters 'to humiliate', 'to regret', and 'sea', pronounced respectively ''wǔ'', ''huǐ'', and ''hǎi'' in Mandarin. Three of these characters were pronounced very similarly in Old Chinese – (每), (悔), and (海) according to a recent reconstruction by William H. Baxter and Laurent Sagart – but sound changes in the intervening 3,000 years or so (including two different dialectal developments, in the case of the last two characters) have resulted in radically different pronunciations.

Chinese characters used in Japanese and Korean

Within the context of the Chinese language, Chinese characters (known as hanzi) by and large represent words and morphemes rather than pure ideas; however, the adoption of Chinese characters by the Japanese and Korean languages (where they are known as

kanji are logographic Chinese characters, adapted from Chinese family of scripts, Chinese script, used in the writing of Japanese language, Japanese. They were made a major part of the Japanese writing system during the time of Old Japanese and are ...

and

hanja Hanja (; ), alternatively spelled Hancha, are Chinese characters used to write the Korean language. After characters were introduced to Korea to write Literary Chinese, they were adapted to write Korean as early as the Gojoseon period. () ...

, respectively) have resulted in some complications to this picture. Many Chinese words, composed of Chinese morphemes, were borrowed into Japanese and Korean together with their character representations; in this case, the morphemes and characters were borrowed together. In other cases, however, characters were borrowed to represent native Japanese and Korean morphemes, on the basis of meaning alone. As a result, a single character can end up representing multiple morphemes of similar meaning but with different origins across several languages. Because of this, kanji and hanja are sometimes described as morphographic writing systems.

Differences in processing of logographic and phonologic writing systems

Because much research on language processing has centered on English and other alphabetically written languages, many theories of language processing have stressed the role of phonology in producing speech. Contrasting logographically coded languages, where a single character is represented phonetically and ideographically, with phonetically/phonemically spelled languages has yielded insights into how different languages rely on different processing mechanisms. Studies on the processing of logographically coded languages have amongst other things looked at neurobiological differences in processing, with one area of particular interest being hemispheric lateralization. Since logographically coded languages are more closely associated with images than alphabetically coded languages, several researchers have hypothesized that right-side activation should be more prominent in logographically coded languages. Although some studies have yielded results consistent with this hypothesis there are too many contrasting results to make any final conclusions about the role of hemispheric lateralization in orthographically versus phonetically coded languages. Another topic that has been given some attention is differences in processing of homophones. Verdonschot et al. examined differences in the time it took to read a homophone out loud when a picture that was either related or unrelated to a homophonic character was presented before the character. Both Japanese and Chinese homophones were examined. Whereas word production of alphabetically coded languages (such as English) has shown a relatively robust immunity to the effect of context stimuli, Verdschot et al. found that Japanese homophones seem particularly sensitive to these types of effects. Specifically, reaction times were shorter when participants were presented with a phonologically related picture before being asked to read a target character out loud. An example of a phonologically related stimulus from the study would be for instance when participants were presented with a picture of an elephant, which is pronounced ''zou'' in Japanese, before being presented with the Chinese character , which is also read ''zou''. No effect of phonologically related context pictures were found for the reaction times for reading Chinese words. A comparison of the (partially) logographically coded languages Japanese and Chinese is interesting because whereas the Japanese language consists of more than 60% homographic heterophones (characters that can be read two or more different ways), most Chinese characters only have one reading. Because both languages are logographically coded, the difference in latency in reading aloud Japanese and Chinese due to context effects cannot be ascribed to the logographic nature of the writing systems. Instead, the authors hypothesize that the difference in latency times is due to additional processing costs in Japanese, where the reader cannot rely solely on a direct orthography-to-phonology route, but information on a lexical-syntactical level must also be accessed in order to choose the correct pronunciation. This hypothesis is confirmed by studies finding that Japanese

Alzheimer's disease Alzheimer's disease (AD) is a neurodegenerative disease and the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in remembering recent events. As the disease advances, symptoms can include problems wit ...

patients whose comprehension of characters had deteriorated still could read the words out loud with no particular difficulty. Studies contrasting the processing of English and Chinese homophones in

lexical decision task The lexical decision task (LDT) is a procedure used in many psychology and psycholinguistics experiments. The basic procedure involves measuring how quickly people classify stimuli as words or nonwords. Although versions of the task had been used ...

s have found an advantage for homophone processing in Chinese, and a disadvantage for processing homophones in English. The processing disadvantage in English is usually described in terms of the relative lack of homophones in the English language. When a homophonic word is encountered, the phonological representation of that word is first activated. However, since this is an ambiguous stimulus, a matching at the orthographic/lexical ("mental dictionary") level is necessary before the stimulus can be disambiguated, and the correct pronunciation can be chosen. In contrast, in a language (such as Chinese) where many characters with the same reading exists, it is hypothesized that the person reading the character will be more familiar with homophones, and that this familiarity will aid the processing of the character, and the subsequent selection of the correct pronunciation, leading to shorter reaction times when attending to the stimulus. In an attempt to better understand homophony effects on processing, Hino et al. conducted a series of experiments using Japanese as their target language. While controlling for familiarity, they found a processing advantage for homophones over non-homophones in Japanese, similar to what has previously been found in Chinese. The researchers also tested whether orthographically similar homophones would yield a disadvantage in processing, as has been the case with English homophones, but found no evidence for this. It is evident that there is a difference in how homophones are processed in logographically coded and alphabetically coded languages, but whether the advantage for processing of homophones in the logographically coded languages Japanese and Chinese (i.e. their writing systems) is due to the logographic nature of the scripts, or if it merely reflects an advantage for languages with more homophones regardless of script nature, remains to be seen.

Advantages and disadvantages

Separating writing and pronunciation

The main difference between logograms and other writing systems is that the graphemes are not linked directly to their pronunciation. An advantage of this separation is that understanding of the pronunciation or language of the writer is unnecessary, e.g. 1 is understood regardless of whether it be called ''one'', ''ichi'' or ''wāḥid'' by its reader. Likewise, people speaking different

varieties of Chinese There are hundreds of local Chinese language varieties forming a branch of the Sino-Tibetan languages, Sino-Tibetan language family, many of which are not Mutual intelligibility, mutually intelligible. Variation is particularly strong in the m ...

may not understand each other in speaking, but may do so to a significant extent in writing even if they do not write in

Standard Chinese Standard Chinese ( zh, s=现代标准汉语, t=現代標準漢語, p=Xiàndài biāozhǔn hànyǔ, l=modern standard Han speech) is a modern standard form of Mandarin Chinese that was first codified during the republican era (1912–1949). ...

. Therefore, in China, Vietnam, Korea, and Japan before modern times, communication by writing () was the norm of

East Asia East Asia is a geocultural region of Asia. It includes China, Japan, Mongolia, North Korea, South Korea, and Taiwan, plus two special administrative regions of China, Hong Kong and Macau. The economies of Economy of China, China, Economy of Ja ...

n international trade and diplomacy using

Classical Chinese Classical Chinese is the language in which the classics of Chinese literature were written, from . For millennia thereafter, the written Chinese used in these works was imitated and iterated upon by scholars in a form now called Literary ...

. This separation, however, also has the great disadvantage of requiring the memorization of the logograms when learning to read and write, separately from the pronunciation. Though not from an inherent feature of logograms but due to its unique history of development, Japanese has the added complication that almost every logogram has more than one pronunciation. Conversely, a phonetic character set is written precisely as it is spoken, but with the disadvantage that slight pronunciation differences introduce ambiguities. Many alphabetic systems such as those of Greek,

Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...

, Italian, Spanish, and Finnish make the practical compromise of standardizing how words are written while maintaining a nearly one-to-one relation between characters and sounds. Orthographies in some other languages, such as English, French, Thai and Tibetan, are all more complicated than that; character combinations are often pronounced in multiple ways, usually depending on their history.

Hangul The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs ...

, the

Korean language Korean is the first language, native language for about 81 million people, mostly of Koreans, Korean descent. It is the national language of both South Korea and North Korea. In the south, the language is known as () and in the north, it is kn ...

's writing system, is an example of an alphabetic script that was designed to replace the logogrammatic

in order to increase literacy. The latter is now rarely used, but retains some currency in South Korea, sometimes in combination with hangul. According to government-commissioned research, the most commonly used 3,500 characters listed in the

People's Republic of China China, officially the People's Republic of China (PRC), is a country in East Asia. With population of China, a population exceeding 1.4 billion, it is the list of countries by population (United Nations), second-most populous country after ...

's " Chart of Common Characters of Modern Chinese" (, ''Xiàndài Hànyǔ Chángyòngzì Biǎo'') cover 99.48% of a two-million-word sample. As for the case of traditional Chinese characters, 4,808 characters are listed in the " Chart of Standard Forms of Common National Characters" () by the Ministry of Education of the

Republic of China Taiwan, officially the Republic of China (ROC), is a country in East Asia. The main geography of Taiwan, island of Taiwan, also known as ''Formosa'', lies between the East China Sea, East and South China Seas in the northwestern Pacific Ocea ...

, while 4,759 in the "

List of Graphemes of Commonly-Used Chinese Characters The ''List of Graphemes of Commonly-Used Chinese Characters'' () is a list of 4762 commonly used Chinese characters and their standardized forms prescribed by the Hong Kong Education Bureau. The list is meant to be taught in primary and middl ...

" () by the Education and Manpower Bureau of

Hong Kong Hong Kong)., Legally Hong Kong, China in international treaties and organizations. is a special administrative region of China. With 7.5 million residents in a territory, Hong Kong is the fourth most densely populated region in the wor ...

, both of which are intended to be taught during elementary and junior secondary education. Education after elementary school includes not as many new characters as new words, which are mostly combinations of two or more already learned characters.

Characters in information technology

Entering complex characters can be cumbersome on electronic devices due to a practical limitation in the number of input keys. There exist various input methods for entering logograms, either by breaking them up into their constituent parts such as with the Cangjie and Wubi methods of typing Chinese, or using phonetic systems such as

Bopomofo Bopomofo, also called Zhuyin Fuhao ( ; ), or simply Zhuyin, is a Chinese transliteration, transliteration system for Standard Chinese and other Sinitic languages. It is the principal method of teaching Chinese Mandarin pronunciation in Taiwa ...

Pinyin Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...

where the word is entered as pronounced and then selected from a list of logograms matching it. While the former method is (linearly) faster, it is more difficult to learn. With the Chinese alphabet system however, the strokes forming the logogram are typed as they are normally written, and the corresponding logogram is then entered. Also due to the number of glyphs, in programming and computing in general, more memory is needed to store each grapheme, as the character set is larger. As a comparison,

ISO 8859 ISO/IEC 8859 is a joint International Organization for Standardization, ISO and International Electrotechnical Commission, IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC ...

requires only one

byte The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable un ...

for each grapheme, while the Basic Multilingual Plane encoded in

UTF-8 UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8. UTF-8 supports all 1,112,0 ...

requires up to three bytes. On the other hand, English words, for example, average five characters and a space per word and thus need six bytes for every word. Since many logograms contain more than one grapheme, it is not clear which is more memory-efficient. Variable-width encodings allow a unified character encoding standard such as

to use only the bytes necessary to represent a character, reducing the overhead that results merging large character sets with smaller ones.

Notes

References

Citations

Sources

* * * *

External links

古代文字資料館　Ancient Writing Library
{{list of writing systems Graphemes