HOME



picture info

Chōonpu
The , also known as , , , or Katakana-Hiragana Prolonged Sound Mark by the Unicode Consortium, is a Japanese symbol that indicates a , or a long vowel of two morae in length. Its form is a horizontal or vertical line in the center of the text with the width of one kanji or kana character. It is written horizontally in horizontal text and vertically in vertical text (). The is usually used to indicate a long vowel sound in katakana writing, rarely in hiragana writing, and never in romanized Japanese. The is a distinct mark from the dash, and in most Japanese typefaces it can easily be distinguished. In horizontal writing it is similar in appearance to, but should not be confused with, the kanji character ("one"). The symbol is sometimes used with hiragana, for example in the signs of ramen restaurants, which are often written in hiragana, while the most standard orthography would be in katakana: . Canonically, however, hiragana never uses the ; instead, another vowel k ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Hiragana
is a Japanese language, Japanese syllabary, part of the Japanese writing system, along with ''katakana'' as well as ''kanji''. It is a phonetic lettering system. The word ''hiragana'' means "common" or "plain" kana (originally also "easy", as contrasted with kanji). Hiragana and katakana are both kana systems. With few exceptions, each mora (linguistics), mora in the Japanese language is represented by one character (or one digraph) in each system. This may be a vowel such as /a/ (hiragana wikt:あ, あ); a consonant followed by a vowel such as /ka/ (wikt:か, か); or /N/ (wikt:ん, ん), a nasal stop, nasal sonorant which, depending on the context and dialect, sounds either like English ''m'', ''n'' or ''ng'' () when syllable-final or like the nasal vowels of French language, French, Portuguese language, Portuguese or Polish language, Polish. Because the characters of the kana do not represent single consonants (except in the case of the aforementioned ん), the kana are r ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Japanese Typographic Symbols
This article lists Japanese typographic symbols that are not included in kana or kanji groupings. Repetition marks Brackets and quotation marks Phonetic marks Punctuation marks Other special marks Organization-specific symbols See also * Japanese map symbols * Japanese punctuation * Emoji An emoji ( ; plural emoji or emojis; , ) is a pictogram, logogram, ideogram, or smiley embedded in text and used in electronic messages and web pages. The primary function of modern emoji is to fill in emotional cues otherwise missing from type ..., which originated in Japanese mobile phone culture ReferencesJapanese Symbols
Retrieved 18 December 2022. {{reflist Typographic symbols
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

KPS 9566
KPS 9566 ("''DPRK Standard Korean Graphic Character Set for Information Interchange''") is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK. KPS 9566 differs in approach from KS X 1001, its South Korean counterpart, in using a different ordering of Chosŏn'gŭl, in encoding explicit vertical presentation forms of punctuation, in not encoding duplicate Hanja for multiple readings, and in including several characters specific to the North Korean political system, including special encodings for the names of the country's past and present leaders (Kim Il Sung, Kim Jong Il and Kim Jong Un). Although KPS 9566 was the original source of several characters added to Unicode, not al ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Katakana
is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived from components or fragments of more complex kanji. Katakana and hiragana are both kana systems. With one or two minor exceptions, each syllable (strictly mora (linguistics), mora) in the Japanese language is represented by one character or ''kana'' in each system. Each kana represents either a vowel such as "''a''" (katakana wikt:ア, ア); a consonant followed by a vowel such as "''ka''" (katakana wikt:カ, カ); or "''n''" (katakana wikt:ン, ン), a nasal stop, nasal sonorant which, depending on the context, sounds like English ''m'', ''n'' or ''ng'' () or like the nasal vowels of Portuguese language, Portuguese or Galician language, Galician. In contrast to the hiragana syllabary, which is used for Japanese words not covered by kanji an ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Sokuon
The is a Japanese typographic symbols, Japanese symbol in the form of a small hiragana or katakana , as well as the various consonants represented by it. In less formal language, it is called or , meaning "small ". It serves multiple purposes in Japanese writing. Appearance In both hiragana and katakana, the appears as a reduced in size: Use in Japanese The main use of the is to mark a geminate consonant, which is represented in most romanization of Japanese, romanization systems by the doubling of the consonant, except that Hepburn romanization writes a geminate ''ch'' as ''tch''. It denotes the gemination of the initial consonant of the symbol that follows it. Examples: The sokuon never appears at the beginning of a word or before a vowel (''a'', ''i'', ''u'', ''e'', or ''o''), and rarely appears before a syllable that begins with the consonants ''n'', ''m'', ''r'', ''w'', or ''y''. (In words and loanwords that require geminating these consonants, , , , , and are ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


WHATWG
The Web Hypertext Application Technology Working Group (WHATWG) is a community of people interested in evolving HTML and related technologies. The WHATWG was founded by individuals from Apple Inc., the Mozilla Foundation and Opera Software, leading web browser vendors in 2004. WHATWG is responsible for maintaining multiple web-related technical standards, including the specifications for the HyperText Markup Language (HTML) and the Document Object Model (DOM). The central organizational membership and control of WHATWG – its "Steering Group" – consists of Apple, Mozilla, Google, and Microsoft. WHATWG community members work with the editor of the specifications to ensure correct implementation. History The WHATWG was formed in response to the slow development of World Wide Web Consortium (W3C) Web standards and W3C's decision to abandon HTML in favor of XML-based technologies. The WHATWG mailing list was announced on 4 June 2004, two days after the initiatives of a j ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Hong Kong Supplementary Character Set
The Hong Kong Supplementary Character Set (; commonly abbreviated to HKSCS) is a set of Chinese characters – 4,702 in total in the initial release—used in Standard Cantonese, Cantonese, as well as when writing the List of places in Hong Kong, names of some places in Hong Kong (whether in written Cantonese or Vernacular Chinese, standard written Chinese sentences). It evolved from the preceding Government Chinese Character Set () or GCCS. GCCS is a set of supplementary Chinese character Chinese characters are logographs used to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represent the only on ...s coded in the user-defined areas of the Big5 character set. It was originally used within the Government of Hong Kong, Hong Kong Government and later used by the public. It later evolved into Hong Kong Supplementary Character Set when the charact ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Big5
Big-5 or Big5 ( zh, t=大五碼) is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for traditional Chinese characters. The People's Republic of China (PRC), which uses simplified Chinese characters, uses the GB 18030 character set instead (though it can also substitute Big-5 or UTF-8). Big5 gets its name from the consortium of five companies in Taiwan that developed it. Encoding The original Big5 character set is sorted first by usage frequency, second by stroke count, lastly by Kangxi radical. The original Big5 character set lacked many commonly used characters. To solve this problem, each vendor developed its own extension. The ETen extension became part of the current Big5 standard through popularity. The structure of Big5 does not conform to the ISO 2022 standard, but rather bears a certain similarity to the encoding. It is a double-byte character set (DBCS) with the following structure: (the prefix 0x signifying hexadecimal numbers). Sta ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Japanese Braille
Japanese Braille is the braille script of the Japanese language. It is based on the original braille script, though the connection is tenuous. In Japanese it is known as , literally "dot characters". It transcribes Japanese more or less as it would be written in the ''hiragana'' or ''katakana'' syllabaries, without any provision for writing ''kanji''. Japanese Braille is a vowel-based abugida. That is, the glyphs are syllabic, but unlike kana they contain separate symbols for consonant and vowel, and the vowel takes primacy. The vowels are written in the upper left corner (dots 1, 2, 4) and may be used alone. The consonants are written in the lower right corner (dots 3, 5, 6) and cannot occur alone. However, the semivowel ''y'' is indicated by dot 4, one of the vowel dots, and the vowel combination is dropped to the bottom of the cell. When this dot is written in isolation, it indicates that the following syllable has a medial ''y'', as in ''mya''. Syllables beginning with ''w' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Character (computing), characters and 168 script (Unicode), scripts used in various ordinary, literary, academic, and technical contexts. Unicode has largely supplanted the previous environment of a myriad of incompatible character sets used within different locales and on different computer architectures. The entire repertoire of these sets, plus many additional characters, were merged into the single Unicode set. Unicode is used to encode the vast majority of text on the Internet, including most web pages, and relevant Unicode support has become a common consideration in contemporary software development. Unicode is ultimately capable of encoding more than 1.1 million characters. The Unicode character repertoire is synchronized with Univers ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


GB 18030
GB 18030 is a Chinese government standard, described as ''Information Technology — Chinese coded character set'' and defines the required language and character support necessary for software in China. GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC) superseding GB2312. As a Unicode Transformation Format (i.e. an encoding of all Unicode code points), GB18030 supports both simplified and traditional Chinese characters. It is also compatible with legacy encodings including GB/T 2312, CP936, and GBK 1.0. The Unicode Consortium has warned implementers that the latest version of this Chinese standard, GB 18030-2022, introduces what they describe as "disruptive changes" from the previous version GB 18030-2005 "involving 33 different characters and 55 code positions". GB 18030-2022 was enforced from 1 August 2023. It has been implemented in ICU 73.2; and in Java 21, and backported to older ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


International Components For Unicode
International Components for Unicode (ICU) is an open-source project of mature C/ C++ and Java libraries for Unicode support, software internationalization, and software globalization. ICU is widely portable to many operating systems and environments. It gives applications the same results on all platforms and between C, C++, and Java software. The ICU project is a technical committee of the Unicode Consortium and sponsored, supported, and used by IBM and many other companies. ICU has been included as a standard component with Microsoft Windows since Windows 10 version 1703. ICU provides the following services: Unicode text handling, full character properties, and character set conversions; Unicode regular expressions; full Unicode sets; character, word, and line boundaries; language-sensitive collation and searching; normalization, upper and lowercase conversion, and script transliterations; comprehensive locale data and resource bundle architecture via the Common Loca ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]