Speech Recognition Manager
   HOME

TheInfoList



OR:

Speech is a human vocal communication using language. Each language uses phonetic combinations of vowel and consonant sounds that form the sound of its words (that is, all English words sound different from all French words, even if they are the same word, e.g., "role" or "hotel"), and using those words in their semantic character as words in the
lexicon A lexicon is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Koine Greek language, Greek word (), neuter of () ...
of a language according to the
syntactic In linguistics, syntax () is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituency), ...
constraints that govern lexical words' function in a sentence. In speaking, speakers perform many different intentional
speech act In the philosophy of language and linguistics, speech act is something expressed by an individual that not only presents information but performs an action as well. For example, the phrase "I would like the kimchi; could you please pass it to me?" ...
s, e.g., informing, declaring, asking, persuading, directing, and can use enunciation, intonation, degrees of loudness, tempo, and other non-representational or paralinguistic aspects of vocalization to convey meaning. In their speech, speakers also unintentionally communicate many aspects of their social position such as sex, age, place of origin (through
accent Accent may refer to: Speech and language * Accent (sociolinguistics), way of pronunciation particular to a speaker or group of speakers * Accent (phonetics), prominence given to a particular syllable in a word, or a word in a phrase ** Pitch ac ...
), physical states (alertness and sleepiness, vigor or weakness, health or illness), psychological states (emotions or moods), physico-psychological states (sobriety or drunkenness, normal consciousness and trance states), education or experience, and the like. Although people ordinarily use speech in dealing with other persons (or animals), when people swear they do not always mean to communicate anything to anyone, and sometimes in expressing urgent emotions or desires they use speech as a quasi-magical cause, as when they encourage a player in a game to do or warn them not to do something. There are also many situations in which people engage in solitary speech. People talk to themselves sometimes in acts that are a development of what some
psychologist A psychologist is a professional who practices psychology and studies mental states, perceptual, cognitive, emotional, and social processes and behavior. Their work often involves the experimentation, observation, and interpretation of how indi ...
s (e.g., Lev Vygotsky) have maintained is the use of silent speech in an interior monologue to vivify and organize
cognition Cognition refers to "the mental action or process of acquiring knowledge and understanding through thought, experience, and the senses". It encompasses all aspects of intellectual functions and processes such as: perception, attention, thought, ...
, sometimes in the momentary adoption of a dual persona as self addressing self as though addressing another person. Solo speech can be used to memorize or to test one's memorization of things, and in prayer or in meditation (e.g., the use of a mantra). Researchers study many different aspects of speech: speech production and
speech perception Speech perception is the process by which the sounds of language Language is a structured system of communication. The structure of a language is its grammar and the free components are its vocabulary. Languages are the primary means by wh ...
of the sounds used in a language,
speech repetition 250px, Children copy with their own mouths the words spoken by the mouths of those around them. That enables them to learn the pronunciation of words not already in their vocabulary. Speech repetition occurs when individuals speech, speak the so ...
, speech errors, the ability to map heard spoken words onto the vocalizations needed to recreate them, which plays a key role in
children A child ( : children) is a human being between the stages of birth and puberty, or between the developmental period of infancy and puberty. The legal definition of ''child'' generally refers to a minor, otherwise known as a person younger ...
's enlargement of their vocabulary, and what different areas of the human brain, such as Broca's area and Wernicke's area, underlie speech. Speech is the subject of study for linguistics, cognitive science,
communication studies Communication studies or communication science is an academic discipline that deals with processes of human communication and behavior, patterns of communication in interpersonal relationships, social interactions and communication in differen ...
, psychology, computer science, speech pathology, otolaryngology, and
acoustics Acoustics is a branch of physics that deals with the study of mechanical waves in gases, liquids, and solids including topics such as vibration, sound, ultrasound and infrasound. A scientist who works in the field of acoustics is an acoustician ...
. Speech compares with written language, which may differ in its vocabulary, syntax, and phonetics from the spoken language, a situation called diglossia. The evolutionary origins of speech are unknown and subject to much debate and
speculation In finance, speculation is the purchase of an asset (a commodity, good (economics), goods, or real estate) with the hope that it will become more valuable shortly. (It can also refer to short sales in which the speculator hopes for a decline i ...
. While animals also communicate using vocalizations, and trained
apes Apes (collectively Hominoidea ) are a clade of Old World simians native to sub-Saharan Africa and Southeast Asia (though they were more widespread in Africa, most of Asia, and as well as Europe in prehistory), which together with its sister g ...
such as Washoe and Kanzi can use simple sign language, no animals' vocalizations are articulated phonemically and syntactically, and do not constitute speech.


Evolution

Although related to the more general problem of the
origin of language The origin of language (spoken and signed, as well as language-related technological systems such as writing), its relationship with human evolution, and its consequences have been subjects of study for centuries. Scholars wishing to study th ...
, the evolution of distinctively human speech capacities has become a distinct and in many ways separate area of scientific research. The topic is a separate one because language is not necessarily spoken: it can equally be written or signed. Speech is in this sense optional, although it is the default modality for language.
Monkey Monkey is a common name that may refer to most mammals of the infraorder Simiiformes, also known as the simians. Traditionally, all animals in the group now known as simians are counted as monkeys except the apes, which constitutes an incomple ...
s, non-human apes and humans, like many other animals, have evolved specialised mechanisms for producing ''sound'' for purposes of social communication. On the other hand, no monkey or ape uses its ''tongue'' for such purposes. The human species' unprecedented use of the tongue, lips and other moveable parts seems to place speech in a quite separate category, making its evolutionary emergence an intriguing theoretical challenge in the eyes of many scholars. Determining the timeline of human speech evolution is made additionally challenging by the lack of data in the fossil record. The human vocal tract does not fossilize, and indirect evidence of vocal tract changes in hominid fossils has proven inconclusive.


Production

Speech production is an unconscious multi-step process by which thoughts are generated into spoken utterances. Production involves the unconscious mind selecting appropriate words and the appropriate form of those words from the lexicon and morphology, and the organization of those words through the syntax. Then, the phonetic properties of the words are retrieved and the sentence is articulated through the articulations associated with those phonetic properties. In linguistics, articulatory phonetics is the study of how the tongue, lips, jaw, vocal cords, and other speech organs are used to make sounds. Speech sounds are categorized by
manner of articulation In articulatory phonetics, the manner of articulation is the configuration and interaction of the articulators (speech organs such as the tongue, lips, and palate) when making a speech sound. One parameter of manner is ''stricture,'' that is, h ...
and place of articulation. Place of articulation refers to where in the neck or mouth the airstream is constricted. Manner of articulation refers to the manner in which the speech organs interact, such as how closely the air is restricted, what form of airstream is used (e.g.
pulmonic In phonetics, the airstream mechanism is the method by which airflow is created in the vocal tract. Along with phonation and articulation, it is one of three main components of speech production. The airstream mechanism is mandatory for sound ...
, implosive, ejectives, and clicks), whether or not the vocal cords are vibrating, and whether the nasal cavity is opened to the airstream. The concept is primarily used for the production of consonants, but can be used for vowels in qualities such as voicing and nasalization. For any place of articulation, there may be several manners of articulation, and therefore several
homorganic In phonetics, a homorganic consonant (from ''homo-'' "same" and ''organ'' "(speech) organ") is a consonant sound that is articulated in the same place of articulation as another. For example, , and are homorganic consonants of one another since ...
consonants. Normal human speech is pulmonic, produced with pressure from the
lung The lungs are the primary organs of the respiratory system in humans and most other animals, including some snails and a small number of fish. In mammals and most other vertebrates, two lungs are located near the backbone on either side of t ...
s, which creates
phonation The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the defini ...
in the
glottis The glottis is the opening between the vocal folds (the rima glottidis). The glottis is crucial in producing vowels and voiced consonants. Etymology From Ancient Greek ''γλωττίς'' (glōttís), derived from ''γλῶττα'' (glôtta), va ...
in the
larynx The larynx (), commonly called the voice box, is an organ in the top of the neck involved in breathing, producing sound and protecting the trachea against food aspiration. The opening of larynx into pharynx known as the laryngeal inlet is about ...
, which is then modified by the vocal tract and mouth into different vowels and consonants. However humans can pronounce words without the use of the lungs and glottis in alaryngeal speech, of which there are three types: esophageal speech, pharyngeal speech and buccal speech (better known as Donald Duck talk).


Errors

Speech production is a complex activity, and as a consequence errors are common, especially in children. Speech errors come in many forms and are used to provide evidence to support hypotheses about the nature of speech. As a result, speech errors are often used in the construction of models for language production and child language acquisition. For example, the fact that children often make the error of over-regularizing the -ed past tense suffix in English (e.g. saying 'singed' instead of 'sang') shows that the regular forms are acquired earlier. Speech errors associated with certain kinds of aphasia have been used to map certain components of speech onto the brain and see the relation between different aspects of production; for example, the difficulty of expressive aphasia patients in producing regular past-tense verbs, but not irregulars like 'sing-sang' has been used to demonstrate that regular inflected forms of a word are not individually stored in the lexicon, but produced from affixation to the base form.


Perception

Speech perception refers to the processes by which humans can interpret and understand the sounds used in language. The study of speech perception is closely linked to the fields of phonetics and phonology in linguistics and cognitive psychology and perception in psychology. Research in speech perception seeks to understand how listeners recognize speech sounds and use this information to understand spoken language. Research into speech perception also has applications in building computer systems that can recognize speech, as well as improving speech recognition for hearing- and language-impaired listeners. Speech perception is categorical, in that people put the sounds they hear into categories rather than perceiving them as a spectrum. People are more likely to be able to hear differences in sounds across categorical boundaries than within them. A good example of this is voice onset time (VOT), one aspect of the phonetic production of consonant sounds. For example, Hebrew speakers, who distinguish voiced /b/ from voiceless /p/, will more easily detect a change in VOT from -10 ( perceived as /b/ ) to 0 ( perceived as /p/ ) than a change in VOT from +10 to +20, or -10 to -20, despite this being an equally large change on the VOT spectrum.


Development

Most human children develop proto-speech babbling behaviors when they are four to six months old. Most will begin saying their first words at some point during the first year of life. Typical children progress through two or three word phrases before they are three to short sentences by four years of age.


Repetition

In speech repetition, speech being heard is quickly turned from sensory input into motor instructions needed for its immediate or delayed vocal imitation (in phonological memory). This type of mapping plays a key role in enabling children to expand their spoken vocabulary. Masur (1995) found that how often children repeat novel words versus those they already have in their lexicon is related to the size of their lexicon later on, with young children who repeat more novel words having a larger lexicon later in development. Speech repetition could help facilitate the acquisition of this larger lexicon.


Problems

There are several organic and psychological factors that can affect speech. Among these are: # Diseases and disorders of the
lung The lungs are the primary organs of the respiratory system in humans and most other animals, including some snails and a small number of fish. In mammals and most other vertebrates, two lungs are located near the backbone on either side of t ...
s or the vocal cords, including
paralysis Paralysis (also known as plegia) is a loss of motor function in one or more muscles. Paralysis can also be accompanied by a loss of feeling (sensory loss) in the affected area if there is sensory damage. In the United States, roughly 1 in 50 ...
, respiratory infections (bronchitis), vocal fold nodules and cancers of the lungs and throat. # Diseases and disorders of the brain, including alogia, aphasias, dysarthria,
dystonia Dystonia is a neurological hyperkinetic movement disorder in which sustained or repetitive muscle contractions result in twisting and repetitive movements or abnormal fixed postures. The movements may resemble a tremor. Dystonia is often inten ...
and speech processing disorders, where impaired
motor planning In psychology and neuroscience, motor planning is a set of processes related to the preparation of a movement that occurs during the reaction time (the time between the presentation of a stimulus to a person and that person's initiation of a motor r ...
, nerve transmission, phonological processing or perception of the message (as opposed to the actual sound) leads to poor speech production. # Hearing problems, such as otitis media with effusion, and listening problems, auditory processing disorders, can lead to phonological problems. In addition to dysphasia, anomia and auditory processing disorder impede the quality of auditory perception, and therefore, expression. Those who are deaf or hard of hearing may be considered to fall into this category. # Articulatory problems, such as slurred speech,
stuttering Stuttering, also known as stammering, is a speech disorder in which the flow of speech is disrupted by involuntary repetitions and prolongations of sounds, syllables, words, or phrases as well as involuntary silent pauses or blocks in which the ...
,
lisping A lisp is a speech impairment in which a person misarticulates sibilants (, , , , , , , ). These misarticulations often result in unclear speech. Types * A frontal lisp occurs when the tongue is placed anterior to the target. Interdental lisping ...
, cleft palate, ataxia, or
nerve A nerve is an enclosed, cable-like bundle of nerve fibers (called axons) in the peripheral nervous system. A nerve transmits electrical impulses. It is the basic unit of the peripheral nervous system. A nerve provides a common pathway for the e ...
damage leading to problems in articulation. Tourette syndrome and tics can also affect speech. Various
congenital A birth defect, also known as a congenital disorder, is an abnormal condition that is present at birth regardless of its cause. Birth defects may result in disabilities that may be physical, intellectual, or developmental. The disabilities can ...
and acquired tongue diseases can affect speech as can motor neuron disease. # Psychiatric disorders have been shown to change speech acoustic features, where for instance, fundamental frequency of voice (perceived as pitch) tends to be significantly lower in major depressive disorder than in healthy controls. Therefore, speech is being investigated as a potential biomarker for mental health disorders. Speech and language disorders can also result from stroke, brain injury, hearing loss, developmental delay, a cleft palate, cerebral palsy, or emotional issues.


Treatment

Speech-related diseases, disorders, and conditions can be treated by a speech-language pathologist (SLP) or speech therapist. SLPs assess levels of speech needs, make diagnoses based on the assessments, and then treat the diagnoses or address the needs.


Brain physiology


Classical model

The classical or Wernicke-Geschwind model of the language system in the brain focuses on Broca's area in the inferior prefrontal cortex, and Wernicke's area in the posterior
superior temporal gyrus The superior temporal gyrus (STG) is one of three (sometimes two) gyri in the temporal lobe of the human brain, which is located laterally to the head, situated somewhat above the external ear. The superior temporal gyrus is bounded by: * the lat ...
on the dominant hemisphere of the brain (typically the left hemisphere for language). In this model, a linguistic auditory signal is first sent from the auditory cortex to Wernicke's area. The
lexicon A lexicon is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Koine Greek language, Greek word (), neuter of () ...
is accessed in Wernicke's area, and these words are sent via the arcuate fasciculus to Broca's area, where morphology, syntax, and instructions for articulation are generated. This is then sent from Broca's area to the motor cortex for articulation. Paul Broca identified an approximate region of the brain in 1861 which, when damaged in two of his patients, caused severe deficits in speech production, where his patients were unable to speak beyond a few monosyllabic words. This deficit, known as Broca's or expressive aphasia, is characterized by difficulty in speech production where speech is slow and labored, function words are absent, and syntax is severely impaired, as in telegraphic speech. In expressive aphasia, speech comprehension is generally less affected except in the comprehension of grammatically complex sentences.Hillis, A.E., & Caramazza, A. (2005). "Aphasia". In L. Nadel, ''Encyclopedia of cognitive science''. Hoboken, NJ: Wiley. Wernicke's area is named after Carl Wernicke, who in 1874 proposed a connection between damage to the posterior area of the left superior temporal gyrus and aphasia, as he noted that not all aphasic patients had had damage to the prefrontal cortex. Damage to Wernicke's area produces Wernicke's or receptive aphasia, which is characterized by relatively normal syntax and prosody but severe impairment in lexical access, resulting in poor comprehension and nonsensical or jargon speech.


Modern research

Modern models of the neurological systems behind linguistic comprehension and production recognize the importance of Broca's and Wernicke's areas, but are not limited to them nor solely to the left hemisphere. Instead, multiple streams are involved in speech production and comprehension. Damage to the left lateral sulcus has been connected with difficulty in processing and producing morphology and syntax, while lexical access and comprehension of irregular forms (e.g. eat-ate) remain unaffected. Moreover, the circuits involved in human speech comprehension dynamically adapt with learning, for example, by becoming more efficient in terms of processing time when listening to familiar messages such as learned verses.


Animal communication

Some non-human animals can produce sounds or gestures resembling those of a human language. Several species or groups of animals have developed forms of communication which superficially resemble verbal language, however, these usually are not considered a language because they lack one or more of the defining characteristics, e.g. grammar,
syntax In linguistics, syntax () is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure ( constituency) ...
, recursion, and displacement. Researchers have been successful in teaching some animals to make gestures similar to sign language, although whether this should be considered a language has been disputed.


See also

* FOXP2 *
Freedom of speech Freedom of speech is a principle that supports the freedom of an individual or a community to articulate their opinions and ideas without fear of retaliation, censorship, or legal sanction. The right to freedom of expression has been recogni ...
*
Imagined speech Imagined speech (also called silent speech, covert speech, inner speech, or, in the original Latin terminology used by clinicians, endophasia) is thinking in the form of sound – “hearing” one’s own voice silently to oneself, without the in ...
*
Index of linguistics articles LinguisticsList of linguistic topics,  is the scientific study of human language. Someone who engages in this study is called a linguist. ''See also the Outline of linguistics, the List of phonetics topics, the List of linguists, and the List ...
* List of language disorders * Spatial hearing loss * Speechwriter * Talking birds *
Vocology Vocology is the science and practice of vocal habilitation, or vocal training and therapy.Titze IR. (1996). What is vocology? Logopedics Phoniatrics Vocology, 21:5-6. Its concerns include the nature of speech and language pathology, the defects of ...
*
Public speaking Public speaking, also called oratory or oration, has traditionally meant the act of speaking face to face to a live audience. Today it includes any form of speaking (formally and informally) to an audience, including pre-recorded speech deliver ...


References


Further reading

* Fitzpatrick, Élizabeth M. ''Apprendre à écouter et à parler''. University of Ottawa Press, 2013
Available at
Project MUSE.


External links


Speaking captured by real-time MRI
YouTube {{Nonverbal communication Language Articles containing video clips