য়
   HOME

TheInfoList



OR:

The Bengali script or Bangla alphabet (,
romanized In linguistics, romanization is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and transcription, ...
:
''Bāṅlā bôrṇômālā'') is the standard
writing system A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
used to write the
Bengali language Bengali, also known by its endonym and exonym, endonym Bangla (, , ), is an Indo-Aryan languages, Indo-Aryan language belonging to the Indo-Iranian languages, Indo-Iranian branch of the Indo-European languages, Indo-European language family. ...
, and has historically been used to write
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
within
Bengal Bengal ( ) is a Historical geography, historical geographical, ethnolinguistic and cultural term referring to a region in the Eastern South Asia, eastern part of the Indian subcontinent at the apex of the Bay of Bengal. The region of Benga ...
. An estimated 300 million people use this syllabic alphabet, which makes it 5th most commonly used writting system in the world. It is the sole national script of
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
and one of the official scripts of India, especifically used in the Indian states of
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
,
Tripura Tripura () is a States and union territories of India, state in northeastern India. The List of states and union territories of India by area, third-smallest state in the country, it covers ; and the seventh-least populous state with a populat ...
and the Barak Valley of
Assam Assam (, , ) is a state in Northeast India, northeastern India, south of the eastern Himalayas along the Brahmaputra Valley, Brahmaputra and Barak River valleys. Assam covers an area of . It is the second largest state in Northeast India, nor ...
. The script is also used for the
Meitei language Meitei (; ) also known as Manipuri ), is a Tibeto-Burman language of northeast India. It is the official language and the lingua franca of Manipur and an additional official language in four districts of Assam. It is one of the scheduled ...
in
Manipur Manipur () is a state in northeastern India with Imphal as its capital. It borders the Indian states of Assam to the west, Mizoram to the south, and Nagaland to the north and shares the international border with Myanmar, specifically t ...
, defined by the '' Manipur Official Language (Amendment) Act, 2021''. From a classificatory point of view, the Bengali writing system is derived from the
Brahmi script Brahmi ( ; ; ISO 15919, ISO: ''Brāhmī'') is a writing system from ancient India. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as ...
. It is written from left to right. It is an
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
, i.e. its vowel
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived from Ancient Greek ('write'), and the suffix ''-eme'' by analogy with ''phoneme'' and other emic units. The study of graphemes ...
s are mainly realised not as independent letters, but as
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s modifying the vowel inherent in the base letter they are added to. There are no distinct upper and lower case letter forms, which makes it a unicameral script. The script is characterized by many conjuncts, upstrokes, downstrokes, and other features that hang from a horizontal line running along the tops of the graphemes that links them together called ''matra(মাত্রা)''. The punctuation is all borrowed from 19th-century English, with the exception of one.


Characters

The Bengali script can be divided into
vowel A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s and vowel diacritics,
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s and conjunct consonants, diacritical and other symbols, digits, and
punctuation marks Punctuation marks are marks indicating how a piece of written text should be read (silently or aloud) and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from the 9th century BC, consisti ...
. Vowels and consonants are used as letters and also as diacritical marks.


Vowels

The Bengali script has a total of 11 vowel graphemes, each of which is called a 'vowel letter'. The 's represent six of the seven main vowel sounds of Bengali, along with two vowel
diphthongs A diphthong ( ), also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of ...
. All of them are used in both Bengali and Assamese languages. * ( , 'vocalic ô') sounds as the default
inherent vowel An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol. There are many known abugida scripts, including most of the Brahmic scripts and Kharosthi, the c ...
for the entire Bengali script. Bengali, Assamese and Odia which are Eastern languages have this value for the inherent vowel, while other languages using
Brahmic scripts The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout South Asia, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used b ...
have ''a'' for their inherent vowel. * Even though the
near-open front unrounded vowel The near-open front unrounded vowel, or near-low front unrounded vowel, is a type of vowel sound. The symbol in the International Phonetic Alphabet that represents this sound is , a lowercase of the ligature. Both the symbol and the sound ar ...
is one of the seven main vowel sounds in the standard Bengali language, no distinct vowel symbol has been allotted for it in the script since there is no sound in Sanskrit, the primary written language when the script was conceived. The use of is very popular and is found to be used in many contexts of æ, such as , meaning acid. The sound is also orthographically realised by multiple means in modern Bengali orthography, usually using some combination of ( , 'vocalic e') , , ( ) and the (diacritic form of the consonant grapheme ). Thus /k/ with the vowel / æ/ will be written as . * There are two graphemes for the vowel sound and two graphemes for the vowel sound . The redundancy stems from the time when this script was used to write
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
, a language that had short and long vowels: ( , 'short i') and ( , 'long ī') , and ( ) and ( ) . The letters are preserved in the Bengali script with their traditional names despite the fact that they are no longer pronounced differently in ordinary speech. These graphemes serve an etymological function, however, in preserving the original Sanskrit spelling in Bengali words (words borrowed from Sanskrit). * The grapheme called (or , 'short ri', as it used to be) does not really represent a vowel phoneme in Bengali but the consonant-vowel combination . Nevertheless, it is included in the vowel section of the inventory of the Bengali script. This inconsistency is also a remnant from
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
, where the grapheme represents the vocalic equivalent of a retroflex approximant (possibly an
r-colored vowel An r-colored or rhotic vowel (also called a retroflex vowel, vocalic r, or a rhotacized vowel) is a vowel that is modified in a way that results in a lowering in frequency of the third formant. R-colored vowels can be articulated in various w ...
). Another grapheme called (or as it used to be) representing the vocalic equivalent of a
dental approximant The voiced dental fricative is a consonant sound used in some spoken languages. It is familiar to English-speakers as the ''th'' sound in ''father''. Its symbol in the International Phonetic Alphabet is eth, or and was taken from the Old Engl ...
in Sanskrit but actually representing the consonant-vowel combination in Bengali instead of a vowel phoneme, was also included in the vowel section but unlike , it was recently discarded from the inventory since its usage was extremely limited even in Sanskrit. * When a vowel sound occurs syllable-initially or when it follows another vowel, it is written using a distinct letter. When a vowel sound follows a consonant (or a consonant cluster), it is written with a diacritic which, depending on the vowel, can appear above, below, before or after the consonant. These vowel marks cannot appear without a consonant and are called . * An exception to the above system is the vowel , which has no vowel mark but is considered inherent in every consonant letter. To denote the absence of the inherent vowel following a consonant, a diacritic called the (্) may be written underneath the consonant. * Although there are only two diphthongs in the inventory of the script: ( , 'vocalic oi') and ( ) , the Bengali phonetic system has, in fact, many diphthongs.Different Bengali linguists give different numbers of Bengali diphthongs in their works depending on methodology, e.g. 25 (Chatterji 1939: 40), 31 (Hai 1964), 45 (Ashraf and Ashraf 1966: 49), 28 (Kostic and Das 1972:6–7) and 17 (Sarkar 1987). Most diphthongs are represented by juxtaposing the graphemes of their constituent vowels, as in . * There also used to be two long vowels: ( , 'long rri') and ( ), which were removed from the inventory during the Vidyasagarian reform of the script due to peculiarity to Sanskrit. The table below shows the vowels present in the modern (since the late 19th century) inventory of the Bengali alphabet:


Notes


Consonants

Consonant letters are called 'consonant letter' in Bengali. The names of the letters are typically just the consonant sound plus the inherent vowel . Since the inherent vowel is assumed and not written, most letters' names look identical to the letter itself (the name of the letter is itself , not ). * Some letters that have lost their distinctive pronunciation in modern Bengali are called by more elaborate names. For example, since the consonant phoneme is written as both and , the letters are not called simply ; instead, they are called (' dental ') and ('
retroflex A retroflex () or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the hard palate. They are sometimes referred to as cerebral consona ...
'). What was once pronounced and written as a retroflex nasal is now pronounced as an alveolar (unless conjoined with another
retroflex consonant A retroflex () or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the hard palate. They are sometimes referred to as cerebral consona ...
such as , , and ) although the spelling does not reflect the change. * Although still named when they are being taught, retroflex consonants do not exist in Bengali and are instead fronted to their postalveolar and alveolar equivalents. * The voiceless palato-alveolar sibilant phoneme can be written as , ( , ' palatal '), ( , 'retroflex '), or ( , 'dental ' voiceless alveolar fricative), depending on the word. * The voiced palato-alveolar affricate phoneme can be written in two ways, as ( ) or ( ). In many varieties of Bengali, are not distinct from this phoneme, but speakers who distinguish them may use the letters and with contrast. * Post-
reform Reform refers to the improvement or amendment of what is wrong, corrupt, unsatisfactory, etc. The modern usage of the word emerged in the late 18th century and is believed to have originated from Christopher Wyvill's Association movement, which ...
, the letter was introduced to distinguish it from note">Bengali_alphabet#cite_note-21.html" ;"title="nowiki/>Bengali alphabet#cite note-21">note ** The semivowel cannot occur at the beginning of a word. The name of is ('semi-vowel y') [the y is silent in the pronunciation of its name]. The pronunciation of varies between ⟨w⟩ and ⟨j⟩ ['w' and 'y']. ** The name of is ('semi-vowel j'). It is found almost entirely at the beginning of words. ** When present in the middle of words, in conjuncts, is represented as a distinct letter: ( ) which is mostly silent or semi-silent (see below). may alter the pronunciation of the surrounding vowel or double the preceding consonant or be completely silent. * Since the nasalization, nasals and cannot occur at the beginning of a word in Bengali, their names are not and respectively but (pronounced by some as or ) and (pronounced by some as or ) respectively. * There is a difference in the pronunciation of ( , ' (as) with a zero' — the figure is used analogous to the ring below diacritic as the Bengali equivalent of the
Devanagari Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
, which is again analogous to the underdot) and ( ) with that of (sometimes called for distinguishing purpose), similar to other Indic languages. This is especially true in the parlance of western and southern part of Bengal but lesser on the dialects of the eastern side of the
Padma River The Padma () is a major river in Bangladesh. It is the eastern and main distributary of the Ganges, flowing generally southeast for to its confluence with the Meghna River, near the Bay of Bengal. The city of Rajshahi is situated on the banks ...
. and were introduced to the inventory during the Vidyasagarian reform to indicate the
retroflex flap The voiced retroflex flap is a type of consonantal sound, used in some Speech communication, spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , a letter ''r'' with tail, and the equivalent X-SAMPA ...
in the pronunciation of and in the middle or end of a word. It is an
allophonic In phonology, an allophone (; from the Greek , , 'other' and , , 'voice, sound') is one of multiple possible spoken soundsor '' phones''used to pronounce a single phoneme in a particular language. For example, in English, the voiceless plosi ...
development in some Indic languages not present in Sanskrit. Yet in ordinary speech these letters are pronounced the same as in modern Bengali.


Notes


Consonant conjuncts

Clusters of up to four consonants can be orthographically represented as a
typographic ligature In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters and used in English and French, in which the letters and are joined for the first ligature ...
called a consonant conjunct ( ''juktakkhôr''/''juktôbôrnô'' or more specifically ). Typically, the first consonant in the conjunct is shown above and/or to the left of the following consonants. Many consonants appear in an abbreviated or compressed form when serving as part of a conjunct. Others simply take exceptional forms in conjuncts, bearing little or no resemblance to the base character. Often, consonant conjuncts are not actually pronounced as would be implied by the pronunciation of the individual components. For example, adding ''lô'' underneath ''shô'' in Bengali creates the conjunct , which is not pronounced ''shlô'' but ''slô'' in Bengali. Many conjuncts represent Sanskrit sounds that were lost centuries before modern Bengali was ever spoken as in . It is a combination of ǰô and ñô but it is not pronounced "ǰñô" or "jnô". Instead, it is pronounced ''ggô'' in modern Bengali. Thus, as conjuncts often represent (combinations of) sounds that cannot be easily understood from the components, the following descriptions are concerned only with the construction of the conjunct, and not the resulting pronunciation. (Some graphemes may appear in a form other than the mentioned form due to the font used)


Fused forms

Some consonants fuse in such a way that one stroke of the first consonant also serves as a stroke of the next. * The consonants can be placed on top of one another, sharing their vertical line: kkô gnô glô nnô pnô ppô llô etc. * As the last member of a conjunct, ব bô can hang on the vertical line under the preceding consonants, taking the shape of ব bô (includes বফলা ''bôphôla''): gbô "ṇbô" "dbô" lbô "shbô". * The consonants can also be placed side-by-side, sharing their vertical line: ddô ndô bdô bǰô pṭô sṭô shchô shchhô, etc.


Approximated forms

Some consonants are written closer to one another simply to indicate that they are in a conjunct together. * The consonants can be placed side-by-side, appearing unaltered: dgô dghô ḍḍô. * As the last member of a conjunct, bô can appear immediately to the right of the preceding consonant, taking the shape of bô (includes বফলা ''bôphôla''): "dhbô" bbô "hbô".


Compressed forms

Some consonants are compressed (and often simplified) when appearing as the first member of a conjunct. * As the first member of a conjunct, the consonants ngô chô ḍô and bô are often compressed and placed at the top-left of the following consonant, with little or no change to the basic shape: "ngkṣô" ngkhô ngghô ngmô chchô chchhô "chnô" ḍḍhô bbô. * As the first member of a conjunct, tô is compressed and placed above the following consonant, with little or no change to the basic shape: tnô "tmô" "tbô". * As the first member of a conjunct, mô is compressed and simplified to a curved shape. It is placed above or to the top-left of the following consonant: mnô mpô mfô mbô mbhô mmô mlô. * As the first member of a conjunct, ṣô is compressed and simplified to an oval shape with a diagonal stroke through it. It is placed to the top-left of the following consonants: ṣkô ṣṭô ṣṭhô ṣpô ṣfô ṣmô. * As the first member of a conjunct, sô is compressed and simplified to a ribbon shape. It is placed above or to the top-left of the following consonant: skô skhô stô sthô snô spô sfô "sbô" "smô" slô.


Abbreviated forms

Some consonants are abbreviated when appearing in conjuncts and lose part of their basic shape. * As the first member of a conjunct, ǰô can lose its final down-stroke: ǰǰô "ǰñô" "jbô". * As the first member of a conjunct, ñô can lose its bottom half: ñchô ñchhô ñǰô ñǰhô. * As the last member of a conjunct, ñô can lose its left half (the part): "ǰñô". * As the first member of a conjunct, ṇô and pô can lose their down-stroke: ṇṭhô ṇḍô ptô psô. * As the first member of a conjunct, tô and bhô can lose their final upward tail: ttô tthô trô bhrô. * As the last member of a conjunct, thô can lose its final upstroke, taking the form of hô instead: nthô sthô mthô * As the last member of a conjunct, mô can lose its initial down-stroke: "kmô" "gmô" ngmô "ṭmô" "ṇmô" "tmô" "dmô" nmô mmô "shmô" ṣmô "smô". * As the last member of a conjunct, sô can lose its top half: ksô. * As the last member of a conjunct ṭô, ḍô and ḍhô can lose their matra: pṭô ṇḍô ṇṭô ṇḍhô. * As the last member of a conjunct ḍô can change its shape: ṇḍô


Variant forms

Some consonants have forms that are used regularly but only within conjuncts. * As the first member of a conjunct, ঙ ngô can appear as a loop and curl: ঙ্ক ngkô ঙ্গ nggô. * As the last member of a conjunct, the curled top of ধ dhô is replaced by a straight downstroke to the right, taking the form of ঝ ǰhô instead: গ্ধ gdhô দ্ধ ddhô ন্ধ ndhô ব্ধ bdhô. * As the first member of a conjunct, র rô appears as a diagonal stroke (called রেফ ''ref'') above the following member: র্ক rkô র্খ rkhô র্গ rgô র্ঘ rghô, etc. * As the last member of a conjunct, র rô appears as a wavy horizontal line (called রফলা ''rôphôla'') under the previous member: খ্র khrô গ্র grô ঘ্র ghrô ব্র brô, etc. ** In some fonts, certain conjuncts with রফলা ''rôphôla'' appear using the compressed (and often simplified) form of the previous consonant: জ্র ǰrô ট্র ṭrô ঠ্র ṭhrô ড্র ḍrô ম্র mrô স্র srô. ** In some fonts, certain conjuncts with রফলা ''rôphôla'' appear using the abbreviated form of the previous consonant: ক্র krô ত্র trô ভ্র bhrô. * As the last member of a conjunct, য jô appears as a wavy vertical line (called যফলা ''jôphôla'') to the right of the previous member: ক্য "kyô" খ্য "khyô" গ্য "gyô" ঘ্য "ghyô" etc. ** In some fonts, certain conjuncts with যফলা ''jôphôla'' appear using special fused forms: দ্য "dyô" ন্য "nyô" শ্য "shyô" ষ্য "ṣyô" স্য "syô" হ্য "hyô".


Exceptions

* When followed by rô or tô, kô takes on the same form as tô would with the addition of a curl to the right: krô, ktô. * When preceded by the abbreviated form of ñô, chô takes the shape of bô: ñchô * When preceded by another ṭô, is reduced to a leftward curl: ṭṭô. * When preceded by ṣô, ṇô appears as two loops to the right: ṣṇô. * As the first member of a conjunct, or when at the end of a word and followed by no vowel, "tô" can appear as : "tsô", "tpô", "tkô" etc. * When preceded by hô, nô appears as a curl to the right: "hnô". * Certain combinations must be memorised: (+) "kṣô", (+) "hmô".


Certain compounds

When serving as a vowel mark, উ u, ঊ u, and ঋ ri take on many exceptional forms. * উ u ** When following গ gô or শ shô, it takes on a variant form resembling the final tail of ও o: গু gu শু shu. ** When following a ত tô that is already part of a conjunct with প pô, ন nô or স sô, it is fused with the ত to resemble ও o: ন্তু ntu স্তু stu প্তু ptu. ** When following র rô, and in many fonts also following the variant রফলা ''rôphôla'', it appears as an upward curl to the right of the preceding consonant as opposed to a downward loop below: রু ru গ্রু gru ত্রু tru থ্রু thru দ্রু dru ধ্রু dhru ব্রু bru ভ্রু bhru শ্রু shru. ** When following হ hô, it appears as an extra curl: হু hu. * ঊ u ** When following র rô, and in many fonts also following the variant রফলা ''rôphôla'', it appears as a downstroke to the right of the preceding consonant as opposed to a downward hook below: রূ rū গ্রূ grū থ্রূ thrū দ্রূ drū ধ্রূ dhrū ভ্রূ bhrū শ্রূ shrū. * ঋ ri ** When following হ hô, it takes the variant shape of ঊ u: হৃ hri. * Conjuncts of three consonants also exist, and follow the same rules as above: স sô + ত tô +র rô = স্ত্র strô, ম mô + প pô + র rô = ম্প্র mprô, জ ǰô + জ ǰô + ব bô = জ্জ্ব "ǰǰbô", ক্ষ "kṣô" + ম mô = ক্ষ্ম "kṣmô". * Theoretically, four-consonant conjuncts can also be created, as in র rô + স sô + ট ṭô + র rô = র্স্ট্র rsṭrô, but they are not found in native words. * Also theoretically, 5-letter conjuncts can be created, as র rô + স sô + ট tô + র rô + ঁ = র্স্ট্রঁ (pronounced rsṭrô but nasalised: rsṭrôñ). Here ঁ is a
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
which nasalises the previous vowel. A theoretical 6-letter conjunct would be র্স্ট্রাঁ (rsṭrañ/rsṭra), with the addition of a (আ) to র্স্ট্রঁ, and a theoretical 7-letter conjunct would be like র্স্ট্র‍্যাঁ (rsṭrya/rsṭryañ) with the addition of য to র্স্ট্রাঁ.


Diacritics and other symbols

These are mainly the Brahmi-Sanskrit diacritics, phones and punctuation marks present in languages with Sanskrit influence or Brahmi-derived scripts.


Notes


Digits and numerals

The Bengali script has ten
numerical digit A numerical digit (often shortened to just digit) or numeral is a single symbol used alone (such as "1"), or in combinations (such as "15"), to represent numbers in positional notation, such as the common base 10. The name "digit" origin ...
s (graphemes or symbols indicating the numbers from 0 to 9). Bengali numerals have no horizontal headstroke or মাত্রা "matra". Numbers larger than 9 are written in Bengali using a positional base 10 numeral system (the decimal system). A period or dot is used to denote the
decimal separator FIle:Decimal separators.svg, alt=Four types of separating decimals: a) 1,234.56. b) 1.234,56. c) 1'234,56. d) ١٬٢٣٤٫٥٦., Both a comma and a full stop (or period) are generally accepted decimal separators for international use. The apost ...
, which separates the integral and the fractional parts of a decimal number. When writing large numbers with many digits, commas are used as delimiters to group digits, indicating the thousand (হাজার ''hazar''), the hundred thousand or
lakh A lakh (; abbreviated L; sometimes written lac) is a unit in the Indian numbering system equal to one hundred thousand (100,000; scientific notation: 105). In the Indian 2, 2, 3 convention of digit grouping, it is written as 1,00,000. F ...
(লাখ ''lakh'' or লক্ষ ''lôkkhô''), and the ten million or hundred lakh or
crore Crore (; abbreviated cr) denotes the quantity ten million (107) and is equal to 100 lakh in the Indian numbering system. In many international contexts, the decimal quantity is formatted as 10,000,000, but when used in the context of the India ...
(কোটি ''koti'') units. In other words, leftwards from the decimal separator, the first grouping consists of three digits, and the subsequent groupings always consist of two digits. For example, the English number 17,557,345 will be written in traditional Bengali as ১,৭৫,৫৭,৩৪৫.


Punctuation marks

Bengali punctuation marks, apart from the downstroke দাড়ি ''dari'' (।), the Bengali equivalent of a full stop, have been adopted from western scripts and their usage is similar: Commas, semicolons, colons, quotation marks, etc. are the same as in English. Capital letters are absent in the Bengali script so proper names are unmarked. An apostrophe, known in Bengali as ঊর্ধ্বকমা ''urdhbôkôma'' "upper comma", is sometimes used to distinguish between
homograph A homograph (from the , and , ) is a word that shares the same written form as another word but has a different meaning. However, some dictionaries insist that the words must also be pronounced differently, while the Oxford English Dictionar ...
s, as in পাটা ''pata'' "plank" and পাʼটা ''pa'ta'' "the leg". Sometimes, a hyphen is used for the same purpose (as in পা-টা, an alternative of পাʼটা).


Characteristics of the Bengali text

Bengali text is written and read horizontally, from left to right. The consonant graphemes and the full form of vowel graphemes fit into an imaginary rectangle of uniform size (uniform width and height). The size of a consonant conjunct, regardless of its complexity, is deliberately maintained the same as that of a single consonant grapheme, so that diacritic vowel forms can be attached to it without any distortion. In a typical Bengali text, orthographic words, words as they are written, can be seen as being separated from each other by an even spacing. Graphemes within a word are also evenly spaced, but that spacing is much narrower than the spacing between words. Unlike in western scripts (
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
,
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
, etc.) for which the letter-forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called মাত্রা ''matra''. The presence and absence of this matra can be important. For example, the letter ত ''tô'' and the numeral ৩ "3" are distinguishable only by the presence or absence of the ''matra'', as is the case between the consonant cluster ত্র ''trô'' and the independent vowel এ ''e''. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline). According to Bengali linguist Munier Chowdhury, there are about nine graphemes that are the most frequent in Bengali texts, shown with its percentage of appearance in the adjacent table.See Chowdhury 1963


Comparison of Bengali script with ancestral and related scripts


Vowels


Consonants


Vowel diacritics


Standardization

In the script, clusters of consonants are represented by different and sometimes quite irregular forms; thus, learning to read is complicated by the sheer size of the full set of letters and letter combinations, numbering about 350.
Ishwar Chandra Vidyasagar Ishwar Chandra Bandyopadhyay (26 September 1820 – 29 July 1891), popularly known as Ishwar Chandra Vidyasagar (), was an Indian educator and social reformer of the nineteenth century. His efforts to simplify and modernise Bengali language, Ben ...
introduced punctuation marks in
Bengali language Bengali, also known by its endonym and exonym, endonym Bangla (, , ), is an Indo-Aryan languages, Indo-Aryan language belonging to the Indo-Iranian languages, Indo-Iranian branch of the Indo-European languages, Indo-European language family. ...
and wrote a book named Barnaparichay to standardize Bengali alphabets. While efforts at standardising the alphabet for the Bengali language continue in such notable centres as the Bangla Academy at
Dhaka Dhaka ( or ; , ), List of renamed places in Bangladesh, formerly known as Dacca, is the capital city, capital and list of cities and towns in Bangladesh, largest city of Bangladesh. It is one of the list of largest cities, largest and list o ...
(
Bangladesh Bangladesh, officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world and among the List of countries and dependencies by ...
) and the Pôshchimbônggô Bangla Akademi at
Kolkata Kolkata, also known as Calcutta ( its official name until 2001), is the capital and largest city of the Indian state of West Bengal. It lies on the eastern bank of the Hooghly River, west of the border with Bangladesh. It is the primary ...
(
West Bengal West Bengal (; Bengali language, Bengali: , , abbr. WB) is a States and union territories of India, state in the East India, eastern portion of India. It is situated along the Bay of Bengal, along with a population of over 91 million inhabi ...
, India), it is still not quite uniform yet, as many people continue to use various archaic forms of letters, resulting in concurrent forms for the same sounds.


Romanization

Romanization of Bengali is the representation of the
Bengali language Bengali, also known by its endonym and exonym, endonym Bangla (, , ), is an Indo-Aryan languages, Indo-Aryan language belonging to the Indo-Iranian languages, Indo-Iranian branch of the Indo-European languages, Indo-European language family. ...
in the
Latin script The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
. There are various ways of Romanization systems of Bengali, created in recent years but failed to represent the true Bengali phonetic sound. While different standards for romanisation have been proposed for Bengali, they have not been adopted with the degree of uniformity seen in languages such as Japanese or Sanskrit.In Japanese, there is some debate as to whether to accent certain distinctions, such as Tōhoku vs Tohoku. Sanskrit is well-standardized because the speaking community is relatively small, and
sound change In historical linguistics, a sound change is a change in the pronunciation of a language. A sound change can involve the replacement of one speech sound (or, more generally, one phonetic feature value) by a different one (called phonetic chan ...
is not a large concern.
The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation in which the true phonetic value of Bengali is never represented. Some of them are the International Alphabet of Sanskrit Transliteration or "IAST system", "Indian languages Transliteration" or ITRANS (uses upper case alphabets suited for
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
keyboards), and the extension of IAST intended for non-Sanskrit languages of the Indian region called the National Library at Kolkata romanisation.


Sample texts

Article 1 of the Universal Declaration of Human Rights The first line is the Bengali alphabet; the second a phonetic Romanization, the third IPA.


Unicode

Bengali script was added to the
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
Standard in October 1991 with the release of version 1.0. The Unicode block for Bengali is U+0980–U+09FF:


See also

* Bengali Braille * Robert B. Wray movable type for Bengali (1778) * Bengali phonology *
Bengali language Bengali, also known by its endonym and exonym, endonym Bangla (, , ), is an Indo-Aryan languages, Indo-Aryan language belonging to the Indo-Iranian languages, Indo-Iranian branch of the Indo-European languages, Indo-European language family. ...
* Barnaparichay, book by
Ishwar Chandra Vidyasagar Ishwar Chandra Bandyopadhyay (26 September 1820 – 29 July 1891), popularly known as Ishwar Chandra Vidyasagar (), was an Indian educator and social reformer of the nineteenth century. His efforts to simplify and modernise Bengali language, Ben ...
* Bangla Academy * Paschimbanga Bangla Academi * Eastern Nagari script * Bengali Unicode * Mayek (disambiguation)


Notes


References


Bibliography

* * * * * * * * {{list of writing systems Bengali language
Alphabet An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...
Languages of Bangladesh Languages of India