Speech is a human vocal
communication
Communication (from la, communicare, meaning "to share" or "to be in relation with") is usually defined as the transmission of information. The term may also refer to the message communicated through such transmissions or the field of inqu ...
using
language
Language is a structured system of communication. The structure of a language is its grammar and the free components are its vocabulary. Languages are the primary means by which humans communicate, and may be conveyed through a variety of ...
. Each language uses
phonetic
Phonetics is a branch of linguistics that studies how humans produce and perceive sounds, or in the case of sign languages, the equivalent aspects of sign. Linguists who specialize in studying the physical properties of speech are phoneticians. ...
combinations of
vowel
A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (l ...
and
consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced ...
sounds that form the sound of its words (that is, all English words sound different from all French words, even if they are the same word, e.g., "role" or "hotel"), and using those words in their semantic character as words in the
lexicon of a language according to the
syntactic
In linguistics, syntax () is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure ( constituenc ...
constraints that
govern lexical words' function in a sentence. In speaking, speakers perform many different intentional
speech act
In the philosophy of language and linguistics, speech act is something expressed by an individual that not only presents information but performs an action as well. For example, the phrase "I would like the kimchi; could you please pass it to me? ...
s, e.g., informing, declaring, asking, persuading, directing, and can use
enunciation,
intonation, degrees of
loudness,
tempo
In musical terminology, tempo ( Italian, 'time'; plural ''tempos'', or ''tempi'' from the Italian plural) is the speed or pace of a given piece. In classical music, tempo is typically indicated with an instruction at the start of a piece (ofte ...
, and other non-representational or
paralinguistic aspects of vocalization to convey meaning. In their speech, speakers also unintentionally communicate many aspects of their social position such as sex, age, place of origin (through
accent Accent may refer to:
Speech and language
* Accent (sociolinguistics), way of pronunciation particular to a speaker or group of speakers
* Accent (phonetics), prominence given to a particular syllable in a word, or a word in a phrase
** Pitch ac ...
), physical states (alertness and sleepiness, vigor or weakness, health or illness), psychological states (emotions or moods), physico-psychological states (sobriety or
drunkenness, normal consciousness and
trance
Trance is a state of semi-consciousness in which a person is not self-aware and is either altogether unresponsive to external stimuli (but nevertheless capable of pursuing and realizing an aim) or is selectively responsive in following the dir ...
states), education or experience, and the like.
Although people ordinarily use speech in dealing with other persons (or animals), when people
swear they do not always mean to communicate anything to anyone, and sometimes in expressing urgent emotions or desires they use speech as a quasi-magical cause, as when they encourage a player in a game to do or warn them not to do something. There are also many situations in which people engage in solitary speech. People
talk to themselves sometimes in acts that are a development of what some
psychologist
A psychologist is a professional who practices psychology and studies mental states, perceptual, cognitive, emotional, and social processes and behavior. Their work often involves the experimentation, observation, and interpretation of how ...
s (e.g.,
Lev Vygotsky
Lev Semyonovich Vygotsky (russian: Лев Семёнович Выго́тский, p=vɨˈɡotskʲɪj; be, Леў Сямёнавіч Выго́цкі, p=vɨˈɡotskʲɪj; – June 11, 1934) was a Soviet psychologist, known for his work on psy ...
) have maintained is the use of silent speech in an
interior monologue to vivify and organize
cognition
Cognition refers to "the mental action or process of acquiring knowledge and understanding through thought, experience, and the senses". It encompasses all aspects of intellectual functions and processes such as: perception, attention, thoug ...
, sometimes in the momentary adoption of a dual persona as self addressing self as though addressing another person. Solo speech can be used
to memorize or to test one's memorization of things, and in
prayer
Prayer is an invocation or act that seeks to activate a rapport with an object of worship through deliberate communication. In the narrow sense, the term refers to an act of supplication or intercession directed towards a deity or a deifi ...
or in
meditation
Meditation is a practice in which an individual uses a technique – such as mindfulness, or focusing the mind on a particular object, thought, or activity – to train attention and awareness, and achieve a mentally clear and emotionally calm ...
(e.g., the use of a
mantra
A mantra ( Pali: ''manta'') or mantram (मन्त्रम्) is a sacred utterance, a numinous sound, a syllable, word or phonemes, or group of words in Sanskrit, Pali and other languages believed by practitioners to have religious, ...
).
Researchers study many different aspects of speech: speech production and
speech perception
Speech perception is the process by which the sounds of language are heard, interpreted, and understood. The study of speech perception is closely linked to the fields of phonology and phonetics in linguistics and cognitive psychology and perc ...
of the
sound
In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid.
In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by ...
s used in a language,
speech repetition
250px, Children copy with their own mouths the words spoken by the mouths of those around them. That enables them to learn the pronunciation of words not already in their vocabulary.
Speech repetition occurs when individuals speech, speak the so ...
,
speech errors, the ability to map heard spoken words onto the vocalizations needed to recreate them, which plays a key role in
children's enlargement of their
vocabulary
A vocabulary is a set of familiar words within a person's language. A vocabulary, usually developed with age, serves as a useful and fundamental tool for communication and acquiring knowledge. Acquiring an extensive vocabulary is one of the la ...
, and what different areas of the
human brain
The human brain is the central organ (anatomy), organ of the human nervous system, and with the spinal cord makes up the central nervous system. The brain consists of the cerebrum, the brainstem and the cerebellum. It controls most of the act ...
, such as
Broca's area and
Wernicke's area, underlie speech. Speech is the subject of study for
linguistics
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Lingu ...
,
cognitive science,
communication studies,
psychology
Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries betwe ...
,
computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...
,
speech pathology,
otolaryngology, and
acoustics.
Speech compares with
written language
A written language is the representation of a spoken or gestural language by means of a writing system. Written language is an invention in that it must be taught to children, who will pick up spoken language or sign language by exposure even ...
, which may differ in its vocabulary, syntax, and phonetics from the spoken language, a situation called
diglossia
In linguistics, diglossia () is a situation in which two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled " ...
.
The evolutionary
origins of speech are unknown and subject to much debate and
speculation. While
animals also communicate using vocalizations, and trained
apes such as
Washoe and
Kanzi can use simple
sign language
Sign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with non-manual markers. Sign l ...
, no animals' vocalizations are articulated phonemically and syntactically, and do not constitute speech.
Evolution
Although related to the more general problem of the
origin of language, the
evolution
Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
of distinctively human speech capacities has become a distinct and in many ways separate area of scientific research.
The topic is a separate one because language is not necessarily spoken: it can equally be
written or
signed. Speech is in this sense optional, although it is the default modality for language.
Monkey
Monkey is a common name that may refer to most mammals of the infraorder Simiiformes, also known as the simians. Traditionally, all animals in the group now known as simians are counted as monkeys except the apes, which constitutes an incom ...
s,
non-human apes and humans, like many other animals, have evolved specialised mechanisms for producing ''sound'' for purposes of social communication. On the other hand, no monkey or ape uses its ''tongue'' for such purposes.
The human species' unprecedented use of the tongue, lips and other moveable parts seems to place speech in a quite separate category, making its evolutionary emergence an intriguing theoretical challenge in the eyes of many scholars.
Determining the timeline of human speech evolution is made additionally challenging by the lack of data in the fossil record. The human
vocal tract does not fossilize, and indirect evidence of vocal tract changes in hominid fossils has proven inconclusive.
Production
Speech production is an unconscious multi-step process by which thoughts are generated into spoken utterances. Production involves the unconscious mind selecting appropriate words and the appropriate form of those words from the lexicon and morphology, and the organization of those words through the syntax. Then, the phonetic properties of the words are retrieved and the sentence is articulated through the articulations associated with those phonetic properties.
In
linguistics
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Lingu ...
,
articulatory phonetics
The field of articulatory phonetics is a subfield of phonetics that studies articulation and ways that humans produce speech. Articulatory phoneticians explain how humans produce speech sounds via the interaction of different physiological struct ...
is the study of how the tongue, lips, jaw, vocal cords, and other speech organs are used to make sounds. Speech sounds are categorized by
manner of articulation
In articulatory phonetics, the manner of articulation is the configuration and interaction of the articulators ( speech organs such as the tongue, lips, and palate) when making a speech sound. One parameter of manner is ''stricture,'' that is, ...
and
place of articulation
In articulatory phonetics, the place of articulation (also point of articulation) of a consonant is a location along the vocal tract where its production occurs. It is a point where a constriction is made between an active and a passive articul ...
. Place of articulation refers to where in the neck or mouth the airstream is constricted. Manner of articulation refers to the manner in which the speech organs interact, such as how closely the air is restricted, what form of airstream is used (e.g.
pulmonic
In phonetics, the airstream mechanism is the method by which airflow is created in the vocal tract. Along with phonation and articulation, it is one of three main components of speech production. The airstream mechanism is mandatory for sound ...
, implosive, ejectives, and clicks), whether or not the vocal cords are vibrating, and whether the nasal cavity is opened to the airstream. The concept is primarily used for the production of
consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced ...
s, but can be used for
vowels
A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (l ...
in qualities such as
voicing and
nasalization
In phonetics, nasalization (or nasalisation) is the production of a sound while the velum is lowered, so that some air escapes through the nose during the production of the sound by the mouth. An archetypal nasal sound is .
In the Internation ...
. For any place of articulation, there may be several manners of articulation, and therefore several
homorganic consonants.
Normal human speech is pulmonic, produced with pressure from the
lungs, which creates
phonation in the
glottis
The glottis is the opening between the vocal folds (the rima glottidis). The glottis is crucial in producing vowels and voiced consonants.
Etymology
From Ancient Greek ''γλωττίς'' (glōttís), derived from ''γλῶττα'' (glôtta), ...
in the
larynx, which is then modified by the vocal tract and mouth into different vowels and consonants. However humans can pronounce words without the use of the lungs and glottis in
alaryngeal speech, of which there are three types:
esophageal speech, pharyngeal speech and buccal speech (better known as
Donald Duck talk).
Errors
Speech production is a complex activity, and as a consequence errors are common, especially in children. Speech errors come in many forms and are used to provide evidence to support hypotheses about the nature of speech. As a result, speech errors are often used in the construction of models for language production and
child language acquisition. For example, the fact that children often make the error of over-regularizing the -ed past tense suffix in English (e.g. saying 'singed' instead of 'sang') shows that the regular forms are acquired earlier. Speech errors associated with certain kinds of aphasia have been used to map certain components of speech onto the brain and see the relation between different aspects of production; for example, the difficulty of
expressive aphasia
Expressive aphasia, also known as Broca's aphasia, is a type of aphasia characterized by partial loss of the ability to produce language ( spoken, manual, or written), although comprehension generally remains intact. A person with expressive aph ...
patients in producing regular past-tense verbs, but not irregulars like 'sing-sang' has been used to demonstrate that regular inflected forms of a word are not individually stored in the lexicon, but produced from affixation to the base form.
Perception
Speech perception refers to the processes by which humans can interpret and understand the sounds used in language. The study of speech perception is closely linked to the fields of
phonetics
Phonetics is a branch of linguistics that studies how humans produce and perceive sounds, or in the case of sign languages, the equivalent aspects of sign. Linguists who specialize in studying the physical properties of speech are phoneticians. ...
and
phonology
Phonology is the branch of linguistics that studies how languages or dialects systematically organize their sounds or, for sign languages, their constituent parts of signs. The term can also refer specifically to the sound or sign system of a ...
in linguistics and cognitive psychology and perception in psychology. Research in speech perception seeks to understand how listeners recognize speech sounds and use this information to understand
spoken language
A spoken language is a language produced by articulate sounds or (depending on one's definition) manual gestures, as opposed to a written language. An oral language or vocal language is a language produced with the vocal tract in contrast with a si ...
. Research into speech perception also has applications in building
computer systems that can recognize speech, as well as improving speech recognition for hearing- and language-impaired listeners.
Speech perception is
categorical, in that people put the sounds they hear into categories rather than perceiving them as a spectrum. People are more likely to be able to hear differences in sounds across categorical boundaries than within them. A good example of this is
voice onset time
In phonetics, voice onset time (VOT) is a feature of the production of stop consonants. It is defined as the length of time that passes between the release of a stop consonant and the onset of voicing, the vibration of the vocal folds, or, acco ...
(VOT), one aspect of the phonetic production of consonant sounds. For example, Hebrew speakers, who distinguish voiced /b/ from voiceless /p/, will more easily detect a change in VOT from -10 ( perceived as /b/ ) to 0 ( perceived as /p/ ) than a change in VOT from +10 to +20, or -10 to -20, despite this being an equally large change on the VOT spectrum.
Development
Most human children develop proto-speech babbling behaviors when they are four to six months old. Most will begin saying their first words at some point during the first year of life. Typical children progress through two or three word phrases before they are three to short sentences by four years of age.
Repetition
In speech repetition, speech being heard is quickly turned from sensory input into motor instructions needed for its immediate or delayed vocal imitation (in
phonological memory). This type of mapping plays a key role in enabling children to expand their spoken vocabulary. Masur (1995) found that how often children repeat novel words versus those they already have in their lexicon is related to the size of their lexicon later on, with young children who repeat more novel words having a larger lexicon later in development. Speech repetition could help facilitate the acquisition of this larger lexicon.
Problems
There are several organic and psychological factors that can affect speech. Among these are:
# Diseases and disorders of the
lungs or the
vocal cords
In humans, vocal cords, also known as vocal folds or voice reeds, are folds of throat tissues that are key in creating sounds through vocalization. The size of vocal cords affects the pitch of voice. Open when breathing and vibrating for speech ...
, including
paralysis
Paralysis (also known as plegia) is a loss of motor function in one or more muscles. Paralysis can also be accompanied by a loss of feeling (sensory loss) in the affected area if there is sensory damage. In the United States, roughly 1 in 5 ...
, respiratory infections (bronchitis),
vocal fold nodules and
cancer
Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal bl ...
s of the lungs and throat.
# Diseases and disorders of the
brain
The brain is an organ that serves as the center of the nervous system in all vertebrate and most invertebrate animals. It consists of nervous tissue and is typically located in the head ( cephalization), usually near organs for special ...
, including
alogia,
aphasia
Aphasia is an inability to comprehend or formulate language because of damage to specific brain regions. The major causes are stroke and head trauma; prevalence is hard to determine but aphasia due to stroke is estimated to be 0.1–0.4% in t ...
s,
dysarthria,
dystonia and
speech processing disorders, where impaired
motor planning, nerve transmission, phonological processing or perception of the message (as opposed to the actual sound) leads to poor speech production.
# Hearing problems, such as
otitis media with effusion
Otitis media is a group of inflammatory diseases of the middle ear. One of the two main types is acute otitis media (AOM), an infection of rapid onset that usually presents with ear pain. In young children this may result in pulling at the ear, ...
, and listening problems,
auditory processing disorders, can lead to phonological problems. In addition to
dysphasia,
anomia and auditory processing disorder impede the quality of auditory perception, and therefore, expression. Those who are
deaf or hard of hearing may be considered to fall into this category.
# Articulatory problems, such as slurred speech,
stuttering,
lisping
A lisp is a speech impairment in which a person misarticulates sibilants (, , , , , , , ). These misarticulations often result in unclear speech.
Types
* A frontal lisp occurs when the tongue is placed anterior to the target. Interdental lisping ...
,
cleft palate,
ataxia, or
nerve
A nerve is an enclosed, cable-like bundle of nerve fibers (called axons) in the peripheral nervous system.
A nerve transmits electrical impulses. It is the basic unit of the peripheral nervous system. A nerve provides a common pathway for the ...
damage leading to problems in
articulation.
Tourette syndrome and
tics can also affect speech. Various
congenital
A birth defect, also known as a congenital disorder, is an abnormal condition that is present at birth regardless of its cause. Birth defects may result in disabilities that may be physical, intellectual, or developmental. The disabilities c ...
and acquired
tongue diseases can affect speech as can
motor neuron disease.
#
Psychiatric
Psychiatry is the medical specialty devoted to the diagnosis, prevention, and treatment of mental disorders. These include various maladaptations related to mood, behaviour, cognition, and perceptions. See glossary of psychiatry.
Initial psyc ...
disorders have been shown to change speech acoustic features, where for instance,
fundamental frequency of voice (perceived as pitch) tends to be significantly lower in
major depressive disorder
Major depressive disorder (MDD), also known as clinical depression, is a mental disorder characterized by at least two weeks of pervasive low mood, low self-esteem, and loss of interest or pleasure in normally enjoyable activities. Introdu ...
than in healthy controls.
Therefore, speech is being investigated as a potential biomarker for mental health disorders.
Speech and language disorders can also result from stroke, brain injury, hearing loss, developmental delay, a cleft palate, cerebral palsy, or emotional issues.
Treatment
Speech-related diseases, disorders, and conditions can be treated by a speech-language pathologist (SLP) or speech therapist. SLPs assess levels of speech needs, make diagnoses based on the assessments, and then treat the diagnoses or address the needs.
Brain physiology
Classical model
The classical or
Wernicke-Geschwind model of the language system in the brain focuses on
Broca's area in the inferior
prefrontal cortex
In mammalian brain anatomy, the prefrontal cortex (PFC) covers the front part of the frontal lobe of the cerebral cortex. The PFC contains the Brodmann areas BA8, BA9, BA10, BA11, BA12, BA13, BA14, BA24, BA25, BA32, BA44, BA45, BA ...
, and
Wernicke's area in the posterior
superior temporal gyrus on the
dominant hemisphere of the brain (typically the left hemisphere for language). In this model, a linguistic auditory signal is first sent from the
auditory cortex to Wernicke's area. The
lexicon is accessed in Wernicke's area, and these words are sent via the
arcuate fasciculus to Broca's area, where morphology, syntax, and instructions for articulation are generated. This is then sent from Broca's area to the
motor cortex
The motor cortex is the region of the cerebral cortex believed to be involved in the planning, control, and execution of voluntary movements.
The motor cortex is an area of the frontal lobe located in the posterior precentral gyrus immediately ...
for articulation.
Paul Broca identified an approximate region of the brain in 1861 which, when damaged in two of his patients, caused severe deficits in speech production, where his patients were unable to speak beyond a few monosyllabic words. This deficit, known as Broca's or
expressive aphasia
Expressive aphasia, also known as Broca's aphasia, is a type of aphasia characterized by partial loss of the ability to produce language ( spoken, manual, or written), although comprehension generally remains intact. A person with expressive aph ...
, is characterized by difficulty in speech production where speech is slow and labored, function words are absent, and syntax is severely impaired, as in
telegraphic speech. In expressive aphasia, speech comprehension is generally less affected except in the comprehension of grammatically complex sentences.
[Hillis, A.E., & Caramazza, A. (2005). "Aphasia". In L. Nadel, ''Encyclopedia of cognitive science''. Hoboken, NJ: Wiley.] Wernicke's area is named after
Carl Wernicke, who in 1874 proposed a connection between damage to the posterior area of the left superior temporal gyrus and aphasia, as he noted that not all aphasic patients had had damage to the prefrontal cortex. Damage to Wernicke's area produces Wernicke's or
receptive aphasia, which is characterized by relatively normal syntax and prosody but severe impairment in lexical access, resulting in poor comprehension and nonsensical or
jargon speech.
Modern research
Modern models of the neurological systems behind linguistic comprehension and production recognize the importance of Broca's and Wernicke's areas, but are not limited to them nor solely to the left hemisphere.
Instead, multiple streams are involved in speech production and comprehension. Damage to the left
lateral sulcus has been connected with difficulty in processing and producing morphology and syntax, while lexical access and comprehension of irregular forms (e.g. eat-ate) remain unaffected.
Moreover, the circuits involved in human speech comprehension dynamically adapt with learning, for example, by becoming more efficient in terms of processing time when listening to familiar messages such as learned verses.
Animal communication
Some non-human animals can produce sounds or gestures resembling those of a human language. Several species or groups of animals have developed
forms of communication which superficially resemble verbal language, however, these usually are not considered a language because they lack one or more of the
defining characteristics, e.g.
grammar
In linguistics, the grammar of a natural language is its set of structure, structural constraints on speakers' or writers' composition of clause (linguistics), clauses, phrases, and words. The term can also refer to the study of such constraint ...
,
syntax,
recursion
Recursion (adjective: ''recursive'') occurs when a thing is defined in terms of itself or of its type. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in mathematic ...
, and
displacement. Researchers have been successful in teaching some animals to make gestures similar to
sign language
Sign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with non-manual markers. Sign l ...
, although whether this should be considered a language has been disputed.
See also
*
FOXP2
Forkhead box protein P2 (FOXP2) is a protein that, in humans, is encoded by the ''FOXP2'' gene. FOXP2 is a member of the forkhead box family of transcription factors, proteins that regulate gene expression by binding to DNA. It is expressed ...
*
Freedom of speech
*
Imagined speech
Imagined speech (also called silent speech, covert speech, inner speech, or, in the original Latin terminology used by clinicians, endophasia) is thinking in the form of sound – “hearing” one’s own voice silently to oneself, without the in ...
*
Index of linguistics articles
*
List of language disorders
*
Spatial hearing loss
*
Speechwriter
*
Talking bird
Talking birds are birds that can mimic the speech of humans. There is debate within the scientific community over whether some talking parrots also have some cognitive understanding of the language. Birds have varying degrees of talking ability: ...
s
*
Vocology Vocology is the science and practice of vocal habilitation, or vocal training and therapy.Titze IR. (1996). What is vocology? Logopedics Phoniatrics Vocology, 21:5-6. Its concerns include the nature of speech and language pathology, the defects of ...
*
Public speaking
Public speaking, also called oratory or oration, has traditionally meant the act of speaking face to face to a live audience. Today it includes any form of speaking (formally and informally) to an audience, including pre-recorded speech delive ...
References
Further reading
* Fitzpatrick, Élizabeth M. ''Apprendre à écouter et à parler''.
University of Ottawa Press, 2013
Available atProject MUSE
Project MUSE, a non-profit collaboration between libraries and publishers, is an online database of peer-reviewed academic journals and electronic books. Project MUSE contains digital humanities and social science content from over 250 univers ...
.
External links
Speaking captured by real-time MRI YouTube
YouTube is a global online video sharing and social media platform headquartered in San Bruno, California. It was launched on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim. It is owned by Google, and is the second most ...
{{Nonverbal communication
Language
Articles containing video clips