Spoken English Corpus
   HOME
*





Spoken English Corpus
The Spoken English Corpus (SEC) is a speech corpus collection of recordings of spoken British English compiled during 1984-7. The corpus manual can be found on ICAME. History The Spoken English Corpus (SEC) project was supported jointly in 1984-5 by the Humanities Research Fund at Lancaster University and by IBM (UK) Ltd, and subsequently by IBM UK Ltd. The project was supported by Geoffrey Leech at Lancaster and Geoffrey Kaye at IBM. The project was a collaboration, funded by IBM, between the Unit for Computer Research on the English Language (UCREL) at the University of Lancaster and the IBM Scientific Centre in Winchester. Compilation SEC comprises 53 recorded passages, mainly from the BBC, spoken in the accent usually referred to as Received Pronunciation, or RP. The collection covers categories such as commentary, news broadcast, lecture, dialogue, poetry and propaganda. The corpus contains 52,637 words, totalling 339 minutes. The compilation of the corpus is described by ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Speech Corpus
A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions. In speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition or speaker identification engine). In linguistics, spoken corpora are used to do research into phonetic, conversation analysis, dialectology and other fields. A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases). There are two types of Speech Corpora: # Read Speech – which includes: #* Book excerpts #* Broadcast news #* Lists of words #* Sequences of numbers # Spontaneous Speech – which includes: #* Dialogs – between two or more people (includes meetings; one such corpus is the KEC); #* Narratives – a person telling a story (one such corpus is the Buckeye Corpus); #* Map-tasks – one person explains a route on a map to another; #* Appointment-tasks – two people try to find a common meeti ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Prosodic
In linguistics, prosody () is concerned with elements of speech that are not individual phonetic segments (vowels and consonants) but are properties of syllables and larger units of speech, including linguistic functions such as intonation, stress, and rhythm. Such elements are known as suprasegmentals. Prosody may reflect features of the speaker or the utterance: their emotional state; the form of utterance (statement, question, or command); the presence of irony or sarcasm; emphasis, contrast, and focus. It may reflect elements of language not encoded by grammar or choice of vocabulary. Attributes of prosody In the study of prosodic aspects of speech, it is usual to distinguish between auditory measures ( subjective impressions produced in the mind of the listener) and objective measures (physical properties of the sound wave and physiological characteristics of articulation that may be measured objectively). Auditory (subjective) and objective ( acoustic and articulatory) ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Geoffrey Sampson
Geoffrey Sampson (born 1944) is Professor of Natural Language Computing in the Department of Informatics, University of Sussex.Geoffrey Sampson
staff bio page.
He produces annotation standards for compiling (databases) of ordinary usage of the English language. His work has been applied in automatic language-understanding software, and in writing-skills training. He has also analysed

picture info

GNU GPL
The GNU General Public License (GNU GPL or simply GPL) is a series of widely used free software licenses that guarantee end users the four freedoms to run, study, share, and modify the software. The license was the first copyleft for general use and was originally written by the founder of the Free Software Foundation (FSF), Richard Stallman, for the GNU Project. The license grants the recipients of a computer program the rights of the Free Software Definition. These GPL series are all copyleft licenses, which means that any derivative work must be distributed under the same or equivalent license terms. It is more restrictive than the Lesser General Public License and even further distinct from the more widely used permissive software licenses BSD, MIT, and Apache. Historically, the GPL license family has been one of the most popular software licenses in the free and open-source software domain. Prominent free software programs licensed under the GPL include the Linux kernel a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Turn Construction Unit
A turn construction unit (TCU) is the fundamental segment of speech in a conversation, as analysed in conversation analysis. The idea was introduced in "A Simplest Systematics for the Organization of Turn-Taking for Conversation" by Harvey Sacks, Emanuel Schegloff, and Gail Jefferson in 1974. It describes pieces of conversation which may comprise an entire speaking turn by a speaker. The model is designed to explain that when people talk in conversation, they do not always talk all at the same time, but generally, one person speaks at a time, and then another person can follow. Such a contribution to a conversation by one speaker is then a ''turn''. A turn is created through certain forms or units that listeners can recognize and count on, called ''turn construction units'' (TCUs), and speakers and listeners will know that such forms can be a word or a clause, and use that knowledge to predict when a speaker is finished so that others can speak, to avoid or minimize both overlap ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Rhythm
Rhythm (from Greek , ''rhythmos'', "any regular recurring motion, symmetry") generally means a " movement marked by the regulated succession of strong and weak elements, or of opposite or different conditions". This general meaning of regular recurrence or pattern in time can apply to a wide variety of cyclical natural phenomena having a periodicity or frequency of anything from microseconds to several seconds (as with the riff in a rock music song); to several minutes or hours, or, at the most extreme, even over many years. Rhythm is related to and distinguished from pulse, meter, and beats: In the performance arts, rhythm is the timing of events on a human scale; of musical sounds and silences that occur over time, of the steps of a dance, or the meter of spoken language and poetry. In some performing arts, such as hip hop music, the rhythmic delivery of the lyrics is one of the most important elements of the style. Rhythm may also refer to visual presentation, as "timed mov ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Stress Feet
Stress may refer to: Science and medicine * Stress (biology), an organism's response to a stressor such as an environmental condition * Stress (linguistics), relative emphasis or prominence given to a syllable in a word, or to a word in a phrase or sentence * Stress (mechanics), the internal forces that neighboring particles of a continuous material exert on each other * Occupational stress Occupational stress is psychological stress related to one's job. Occupational stress refers to a chronic condition. Occupational stress can be managed by understanding what the stressful conditions at work are and taking steps to remediate those ..., stress related to one's job * Psychological stress, a feeling of strain and pressure * Surgical stress, systemic response to surgical injury Arts, entertainment, and media Music Groups and musicians * Stress (Brazilian band), a Brazilian heavy metal band * Stress (British band), a British rock band * Stress (pop rock band), an early 1980s melod ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Word
A word is a basic element of language that carries an semantics, objective or pragmatics, practical semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguistics, linguists on its definition and numerous attempts to find specific criteria of the concept remain controversial. Different standards have been proposed, depending on the theoretical background and descriptive context; these do not converge on a single definition. Some specific definitions of the term "word" are employed to convey its different meanings at different levels of description, for example based on phonology, phonological, grammar, grammatical or orthography, orthographic basis. Others suggest that the concept is simply a convention used in everyday situations. The concept of "word" is distinguished from that of a morpheme, which is the smallest unit of language that has a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Syllable
A syllable is a unit of organization for a sequence of speech sounds typically made up of a syllable nucleus (most often a vowel) with optional initial and final margins (typically, consonants). Syllables are often considered the phonological "building blocks" of words. They can influence the rhythm of a language, its prosody, its poetic metre and its stress patterns. Speech can usually be divided up into a whole number of syllables: for example, the word ''ignite'' is made of two syllables: ''ig'' and ''nite''. Syllabic writing began several hundred years before the first letters. The earliest recorded syllables are on tablets written around 2800 BC in the Sumerian city of Ur. This shift from pictograms to syllables has been called "the most important advance in the history of writing". A word that consists of a single syllable (like English ''dog'') is called a monosyllable (and is said to be ''monosyllabic''). Similar terms include disyllable (and ''disyllabic''; also '' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Phoneme
In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west of England, the sound patterns (''sin'') and (''sing'') are two separate words that are distinguished by the substitution of one phoneme, , for another phoneme, . Two words like this that differ in meaning through the contrast of a single phoneme form a ''minimal pair''. If, in another language, any two sequences differing only by pronunciation of the final sounds or are perceived as being the same in meaning, then these two sounds are interpreted as phonetic variants of a single phoneme in that language. Phonemes that are established by the use of minimal pairs, such as ''tap'' vs ''tab'' or ''pat'' vs ''bat'', are written between slashes: , . To show pronunciation, linguists use square brackets: (indicating an aspirated ''p'' in ''p ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

University Of Aix-en-Provence
Aix-Marseille University (AMU; french: Aix-Marseille Université; formally incorporated as ''Université d'Aix-Marseille'') is a public research university located in the Provence region of southern France. It was founded in 1409 when Louis II of Anjou, Count of Provence, petitioned the Pisan Antipope Alexander V to establish the University of Provence, making it one of the oldest university-level institutions in France. The institution came into its current form following a reunification of the University of Provence, the University of the Mediterranean and Paul Cézanne University. The reunification became effective on 1 January 2012, resulting in the creation of the largest university in the French-speaking world, with about 80,000 students. AMU has the largest budget of any academic institution in the Francophone world, standing at €750 million. It is consistently ranked among the top 200 universities in the world and is ranked within the top 4 universities in France acc ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

CD-ROM
A CD-ROM (, compact disc read-only memory) is a type of read-only memory consisting of a pre-pressed optical compact disc that contains data. Computers can read—but not write or erase—CD-ROMs. Some CDs, called enhanced CDs, hold both computer data and audio with the latter capable of being played on a CD player, while data (such as software or digital video) is only usable on a computer (such as ISO 9660 format PC CD-ROMs). During the 1990s and early 2000s, CD-ROMs were popularly used to distribute software and data for computers and fifth generation video game consoles. DVD started to replace it in these roles starting in the early 2000s. History The earliest theoretical work on optical disc storage was done by independent researchers in the United States including David Paul Gregg (1958) and James Russel (1965–1975). In particular, Gregg's patents were used as the basis of the LaserDisc specification that was co-developed between MCA and Philips after MCA purchased ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]