Closed captioning (CC) and
subtitling
Subtitles and captions are lines of dialogue or other text displayed at the bottom of the screen in films, television programs, video games or other visual media. They can be transcriptions of the screenplay, translations of it, or informati ...
are both processes of displaying text on a television, video screen, or other visual display to provide additional or interpretive information. Both are typically used as a
transcription of the audio portion of a program as it occurs (either
verbatim or in edited form), sometimes including descriptions of non-speech elements. Other uses have included providing a textual alternative language translation of a presentation's primary audio language that is usually burned-in (or "open") to the video and unselectable.
HTML5 defines subtitles as a "transcription or translation of the dialogue when sound is available but not understood" by the viewer (for example, dialogue in a foreign language) and captions as a "transcription or translation of the dialogue, sound effects, relevant musical cues, and other relevant audio information when sound is unavailable or not clearly audible" (for example, when audio is muted or the viewer is deaf or hard of hearing).
Terminology
The term ''closed'' indicates that the captions are not visible until activated by the viewer, usually via the
remote control or menu option. On the other hand, the terms ''open'', ''burned-in'', ''baked on'', ''hard-coded'', or simply ''hard'' indicate that the captions are visible to all viewers as they are embedded in the video.
In the United States and Canada, the terms ''
subtitles
Subtitles and captions are lines of dialogue or other text displayed at the bottom of the screen in films, television programs, video games or other visual media. They can be transcriptions of the screenplay, translations of it, or informati ...
'' and ''captions'' have different meanings. ''Subtitles'' assume the viewer can hear but cannot understand the language or accent, or the speech is not entirely clear, so they transcribe only dialogue and some on-screen text. ''Captions'' aim to describe to the deaf and hard of hearing all significant audio content—spoken dialogue and non-speech information such as the identity of speakers and, occasionally, their manner of speaking—along with any significant
music or
sound effects using words or symbols. Also, the term ''closed caption'' has come to be used to also refer to the North American
EIA-608 encoding that is used with NTSC-compatible video.
The
United Kingdom,
Ireland, and most other countries do not distinguish between subtitles and closed captions and use ''subtitles'' as the general term. The equivalent of ''captioning'' is usually referred to as ''subtitles for the hard of hearing''. Their presence is referenced on screen by notation which says "Subtitles", or previously "Subtitles 888" or just "888" (the latter two are in reference to the conventional
videotext channel for captions), which is why the term ''subtitle'' is also used to refer to the
Ceefax-based videotext encoding that is used with PAL-compatible video. The term ''subtitle'' has been replaced with ''caption'' in a number of markets—such as Australia and New Zealand—that purchase large amounts of imported US material, with much of that video having had the US CC logo already superimposed over the start of it. In New Zealand, broadcasters superimpose an ear logo with a line through it that represents subtitles for the hard of hearing, even though they are currently referred to as captions. In the UK, modern digital television services have subtitles for the majority of programs, so it is no longer necessary to highlight which have subtitling/captioning and which do not.
Remote control handsets for TVs, DVDs, and similar devices in most European markets often use "SUB" or "SUBTITLE" on the button used to control the display of subtitles/captions.
History
Open captioning
Regular open-captioned broadcasts began on
PBS's ''
The French Chef'' in 1972.
WGBH WGBH may refer to:
* WGBH Educational Foundation, based in Boston, Massachusetts, United States
** WGBH (FM), a public radio station at Boston, Massachusetts on 89.7 MHz owned by the WGBH Educational Foundation
** WGBH-TV
WGBH-TV (channel 2), ...
began open captioning of the programs ''
Zoom'', ''
ABC World News Tonight'', and ''
Once Upon a Classic'' shortly thereafter.
Technical development of closed captioning
Closed captioning was first demonstrated in the United States at the First National Conference on Television for the Hearing Impaired in Nashville, Tennessee, in 1971.
A second demonstration of closed captioning was held at Gallaudet College (now
Gallaudet University
Gallaudet University ( ) is a private federally chartered research university in Washington, D.C. for the education of the deaf and hard of hearing. It was founded in 1864 as a grammar school for both deaf and blind children. It was the first sc ...
) on February 15, 1972, where
ABC and the
National Bureau of Standards demonstrated closed captions embedded within a normal broadcast of ''
The Mod Squad''.
At the same time in the UK the BBC was demonstrating its Ceefax text based broadcast service which they were already using as a foundation to the development of a closed caption production system. They were working with Professor
Alan Newell from the University of Southampton who had been developing prototypes in the late 1960s.
The closed captioning system was successfully encoded and broadcast in 1973 with the cooperation of PBS station
WETA.
As a result of these tests, the FCC in 1976 set aside line 21 for the transmission of closed captions. PBS engineers then developed the caption editing consoles that would be used to caption prerecorded programs.
The
BBC in the UK was the first broadcaster to include closed captions (called subtitles in the UK) in 1979 based on the
Teletext framework for pre-recorded programming.
Real-time captioning, a process for captioning live broadcasts, was developed by the
National Captioning Institute in 1982.
In real-time captioning,
stenotype
A steno machine, stenotype machine, shorthand machine, stenograph or steno writer is a specialized chorded keyboard or typewriter used by stenographers for shorthand use. In order to pass the United States Registered Professional Reporter test, a ...
operators who are able to type at speeds of over 225 words per minute provide captions for live television programs, allowing the viewer to see the captions within two to three seconds of the words being spoken.
Major US producers of captions are
WGBH-TV,
VITAC,
CaptionMax and the
National Captioning Institute. In the UK and
Australasia, Ai-Media,
Red Bee Media, itfc, and Independent Media Support are the major vendors.
Improvements in
speech recognition technology means that live captioning may be fully or partially automated.
BBC Sport broadcasts use a "respeaker": a trained human who repeats the running commentary (with careful enunciation and some simplification and
markup
Markup or mark-up can refer to:
* Markup language, a standardized set of notations used to annotate a plain-text document's content to give information regarding the structure of the text or instructions for how it is to be displayed
** Lightweigh ...
) for input to the automated text generation system. This is generally reliable, though errors are not unknown.
Full-scale closed captioning
The National Captioning Institute was created in 1979 in order to get the cooperation of the commercial television networks.
The first use of regularly scheduled closed captioning on American television occurred on March 16, 1980.
Sears had developed and sold the Telecaption adapter, a decoding unit that could be connected to a standard television set. The first programs seen with captioning were a ''
Disney's Wonderful World
The Walt Disney Company has produced an anthology television series since 1954 under several titles and formats. The program's current title, ''The Wonderful World of Disney'', was used from 1969 to 1979 and again from 1991 to the present. The p ...
'' presentation of the film ''
Son of Flubber
''Son of Flubber'' is a 1963 American science fiction comedy film directed by Robert Stevenson and produced by Walt Disney Productions. The sequel to ''The Absent-Minded Professor'' (1961), Fred MacMurray reprises his role from the first film as ...
'' on
NBC, an ''
ABC Sunday Night Movie
''The ABC Sunday Night Movie'' is a television program that aired on Sunday nights, first for a brief time in 1962 under the title ''Hollywood Special'' (although ''Time'' magazine lists this version as ''The Sunday Night Movie'') to supposedly r ...
'' airing of ''
Semi-Tough'', and ''
Masterpiece Theatre'' on
PBS.
Since 2010 BBC provides a 100% broadcast captioning service across all 7 of its main broadcast channels
BBC One,
BBC Two
BBC Two is a British free-to-air public broadcast television network owned and operated by the BBC. It covers a wide range of subject matter, with a remit "to broadcast programmes of depth and substance" in contrast to the more mainstream an ...
,
BBC Three,
BBC Four
BBC Four is a British free-to-air public broadcast television channel owned and operated by the BBC. It was launched on 2 March 2002 ,
CBBC,
Cbeebies and
BBC News (TV channel)
BBC News (also known as the BBC News Channel) is a British free-to-air public broadcast television news channel for BBC News. It was launched as BBC News 24 on 9 November 1997 at 5:30 pm as part of the BBC's foray into digital domestic telev ...
.
BBC iPlayer
BBC iPlayer (stylised as iPLAYER or BBC iPLAYER) is a video on demand service from the BBC. The service is available on a wide range of devices, including mobile phones and tablets, personal computers and smart televisions. iPlayer services del ...
launched in 2008 as the first captioned
Video on demand service from a major broadcaster meeting comparable levels of captioning as those provided on its broadcast channels.
Legislative development in the U.S.
Until the passage of the Television Decoder Circuitry Act of 1990, television captioning was performed by a set-top box manufactured by Sanyo Electric and marketed by the National Captioning Institute (NCI). (At that time a set-top decoder cost about as much as a TV set itself, approximately $200.) Through discussions with the manufacturer it was established that the appropriate circuitry integrated into the television set would be less expensive than the stand-alone box, and Ronald May, then a Sanyo employee, provided the expert witness testimony on behalf of Sanyo and Gallaudet University in support of the passage of the bill. On January 23, 1991, the
Television Decoder Circuitry Act of 1990
Television, sometimes shortened to TV, is a telecommunication medium for transmitting moving images and sound. The term can refer to a television set, or the medium of television transmission. Television is a mass medium for advertising, e ...
was passed by Congress.
This Act gave the
Federal Communications Commission (FCC) power to enact rules on the implementation of closed captioning. This Act required all analog television receivers with screens of at least 13 inches or greater, either sold or manufactured, to have the ability to display closed captioning by July 1, 1993.
Also, in 1990, the
Americans with Disabilities Act
The Americans with Disabilities Act of 1990 or ADA () is a civil rights law that prohibits discrimination based on disability. It affords similar protections against discrimination to Americans with disabilities as the Civil Rights Act of 1964 ...
(ADA) was passed to ensure equal opportunity for persons with disabilities.
The ADA prohibits discrimination against persons with disabilities in public accommodations or commercial facilities. Title III of the ADA requires that public facilities—such as hospitals, bars, shopping centers and museums (but not movie theaters)—provide access to verbal information on televisions, films or slide shows.
The Federal Communications Commission requires all providers of programs to caption material which has audio in English or Spanish, with certain exceptions specified in Section 79.1(d) of the commission's rules. These exceptions apply to new networks; programs in languages other than English or Spanish; networks having to spend over 2% of income on captioning; networks having less than US$3,000,000 in revenue; and certain local programs; among other exceptions. Those who are not covered by the exceptions may apply for a hardship waiver.
The
Telecommunications Act of 1996 expanded on the Decoder Circuitry Act to place the same requirements on
digital television receivers by July 1, 2002. All TV programming distributors in the U.S. are required to provide closed captions for Spanish-language video programming as of January 1, 2010.
A bill, H.R. 3101, the Twenty-First Century Communications and Video Accessibility Act of 2010, was passed by the United States House of Representatives in July 2010. A similar bill, S. 3304, with the same name, was passed by the United States Senate on August 5, 2010, by the House of Representatives on September 28, 2010, and was signed by President
Barack Obama on October 8, 2010. The Act requires, in part, for
ATSC
Advanced Television Systems Committee (ATSC) standards are an American set of standards for digital television transmission over terrestrial, cable and satellite networks. It is largely a replacement for the analog NTSC standard and, like that ...
-decoding set-top box remotes to have a button to turn on or off the closed captioning in the output signal. It also requires broadcasters to provide captioning for television programs redistributed on the Internet.
On February 20, 2014, the FCC unanimously approved the implementation of quality standards for closed captioning, addressing accuracy, timing, completeness, and placement. This is the first time the FCC has addressed quality issues in captions.
In 2015, a law was passed in Hawaii requiring two screenings a week of each movie with captions on the screen. In 2022 a law took effect in New York City requiring movie theaters to offer captions on the screen for up to four showtimes per movie each week, including weekends and Friday nights.
Some state and local governments (including
Boston, Massachusetts;
Portland, Oregon;
Rochester, New York; and
Washington state) require closed captioning to be activated on TVs in public places at all times, even if no one has requested it.
Philippines
As amended by RA 10905, all TV networks in the Philippines are required to give CC. As of 2018, the three major TV networks in the country are currently testing the closed captioning system on their transmissions.
ABS-CBN added CC in their daily
3 O'Clock Habit in the afternoon.
5 started implementing CCs on their live noon and nightly news programs.
GMA was once started broadcasting nightly and late night news programs, but then they stopped adding CCs lately. Only select
Korean drama and local or foreign movies, ''Biyahe ni Drew'' (English: ''Drew's Explorations'') and ''Idol sa Kusina'' (English: ''Kitchen Idol'') are the programs and shows that they air with proper closed captioning.
Closed captioning in some Filipino films either to be "included" if film production companies have a bias on having impact on their viewing experience for those who did not understand the language. Since 2016, all Filipino-Language Films and also on some Streaming Services like iWant had included their English Subtitles in some showing on films. The law regarding that was passed by Gerald Anthony Gullas Jr., a lawmaker from Cebu City, who had implemented the regulations on standardizing both official languages of the Philippines, as the people had not fluently mastered their English vocabulary.
Legislative development in Australia
The government of Australia provided
seed funding in 1981 for the establishment of the Australian Caption Centre (ACC) and the purchase of equipment. Captioning by the ACC commenced in 1982 and a further grant from the Australian government enabled the ACC to achieve and maintain financial self-sufficiency. The ACC, now known as
Media Access Australia, sold its commercial captioning division to
Red Bee Media in December 2005. Red Bee Media continues to provide captioning services in Australia today.
Funding development in New Zealand
In 1981,
TVNZ
, type = Crown entity
, industry = Broadcast television
, num_locations = New Zealand
, location = Auckland, New Zealand
, area_served = Nationally (New Zealand) and some Pacific Island nations such as the Cook Islands, Fiji, and the So ...
held a
telethon to raise funds for Teletext-encoding equipment used for the creation and editing of text-based broadcast services for the deaf. The service came into use in 1984 with caption creation and importing paid for as part of the public broadcasting fee until the creation of the
NZ on Air taxpayer fund, which is used to provide captioning for
NZ On Air content, TVNZ news shows and conversion of
EIA-608 US captions to the preferred
EBU
The European Broadcasting Union (EBU; french: Union européenne de radio-télévision, links=no, UER) is an alliance of public service media organisations whose countries are within the European Broadcasting Area or who are members of the Co ...
STL format for only
TVNZ 1,
TV 2 and
TV 3 with archived captions available to
FOUR and select
Sky programming. During the second half of 2012,
TV3 Channel 3 or TV 3 may refer to:
Television
*Canal 3 (Burkina Faso), a commercial television channel in Burkina Faso
*Canal 3 (Guatemala), a commercial television channel in Guatemala
*Channel 3 (Algeria), a public Algerian TV channel owned by EPTV ...
and
FOUR began providing non-Teletext DVB image-based captions on their HD service and used the same format on the satellite service, which has since caused major timing issues in relation to server load and the loss of captions from most SD DVB-S receivers, such as the ones Sky Television provides their customers. As of April 2, 2013, only the Teletext page 801 caption service will remain in use with the informational Teletext non-caption content being discontinued.
Application
Closed captions were created for
deaf and
hard of hearing individuals to assist in comprehension. They can also be used as a tool by those learning to read, learning to speak a non-native language, or in an environment where the audio is difficult to hear or is intentionally muted. Captions can also be used by viewers who simply wish to read a transcript along with the program audio.
In the United States, the
National Captioning Institute noted that
English as a foreign or second language (ESL) learners were the largest group buying decoders in the late 1980s and early 1990s before built-in decoders became a standard feature of US television sets. This suggested that the largest audience of closed captioning was people whose native language was not English. In the United Kingdom, of 7.5 million people using TV subtitles (closed captioning), 6 million have no hearing impairment.
Closed captions are also used in public environments, such as bars and restaurants, where patrons may not be able to hear over the background noise, or where multiple televisions are displaying different programs. In addition, online videos may be treated through digital processing of their audio content by various robotic algorithms (robots). Multiple chains of errors are the result. When a video is truly and accurately transcribed, then the closed-captioning publication serves a useful purpose, and the content is available for search engines to index and make available to users on the internet.
Some television sets can be set to automatically turn captioning on when the volume is muted.
Television and video
For ''live'' programs, spoken words comprising the television program's
soundtrack are transcribed by a human operator (a
speech-to-text reporter) using
stenotype
A steno machine, stenotype machine, shorthand machine, stenograph or steno writer is a specialized chorded keyboard or typewriter used by stenographers for shorthand use. In order to pass the United States Registered Professional Reporter test, a ...
or
stenomask type of machines, whose phonetic output is instantly translated into text by a computer and displayed on the screen. This technique was developed in the 1970s as an initiative of the
BBC's
Ceefax teletext service. In collaboration with the BBC, a university student took on the research project of writing the first phonetics-to-text conversion program for this purpose. Sometimes, the captions of live broadcasts, like news bulletins, sports events, live entertainment shows, and other live shows, fall behind by a few seconds. This delay is because the machine does not know what the person is going to say next, so after the person on the show says the sentence, the captions appear. Automatic computer speech recognition works well when trained to recognize a single voice, and so since 2003, the BBC does live subtitling by having someone re-speak what is being broadcast. Live captioning is also a form of
real-time text. Meanwhile, sport events on ESPN are using
court reporter
A court reporter, court stenographer, or shorthand reporter is a person whose occupation is to capture the live testimony in proceedings using a stenographic machine, thereby transforming the proceedings into an official certified transcript b ...
s, using a special (steno) keyboard and individually constructed "dictionaries."
In some cases, the transcript is available beforehand, and captions are simply displayed during the program after being edited. For programs that have a mix of pre-prepared and live content, such as
news bulletins, a combination of techniques is used.
For prerecorded programs, commercials, and home videos, audio is transcribed and captions are prepared, positioned, and timed in advance.
For all types of
NTSC programming, captions are "encoded" into
line 21 of the
vertical blanking interval - a part of the TV picture that sits just above the visible portion and is usually unseen. For
ATSC
Advanced Television Systems Committee (ATSC) standards are an American set of standards for digital television transmission over terrestrial, cable and satellite networks. It is largely a replacement for the analog NTSC standard and, like that ...
(
digital television) programming, three streams are encoded in the video: two are backward compatible "line 21" captions, and the third is a set of up to 63 additional caption streams encoded in
EIA-708 format.
[ - ATSC Closed Captioning FAQ]
cached copy
Captioning is modulated and stored differently in
PAL and
SECAM 625 line 25 frame countries, where
teletext is used rather than in
EIA-608, but the methods of preparation and the line 21 field used are similar. For home
Betamax
Betamax (also known as Beta, as in its logo) is a consumer-level analog recording and cassette format of magnetic tape for video, commonly known as a video cassette recorder. It was developed by Sony and was released in Japan on May 10, 1975, ...
and
VHS videotapes, a shift down of this line 21 field must be done due to the greater number of VBI lines used in 625 line PAL countries, though only a small minority of European PAL VHS machines support this (or any) format for closed caption recording. Like all teletext fields, teletext captions can't be stored by a standard 625 line VHS recorder (due to the lack of field shifting support); they are available on all professional
S-VHS recordings due to all fields being recorded. Recorded Teletext caption fields also suffer from a higher number of caption errors due to increased number of bits and a low
SNR, especially on low-bandwidth VHS. This is why Teletext captions used to be stored separately on floppy disk to the analogue master tape. DVDs have their own system for subtitles and captions, which are digitally inserted in the data stream and decoded on playback into video.
For older televisions, a set-top box or other decoder is usually required. In the US, since the passage of the Television Decoder Circuitry Act, manufacturers of most television receivers sold have been required to include closed captioning display capability. High-definition TV sets, receivers, and
tuner cards are also covered, though the technical specifications are different (high-definition display screens, as opposed to high-definition TVs, may lack captioning). Canada has no similar law but receives the same sets as the US in most cases.
During transmission, single byte errors can be replaced by a white space which can appear at the beginning of the program. More byte errors during EIA-608 transmission can affect the screen momentarily, by defaulting to a real-time mode such as the "roll up" style, type random letters on screen, and then revert to normal. Uncorrectable byte errors within the teletext page header will cause whole captions to be dropped. EIA-608, due to using only two characters per video frame, sends these captions ahead of time storing them in a second buffer awaiting a command to display them; Teletext sends these in real-time.
The use of capitalization varies among caption providers. Most caption providers capitalize all words while others such as WGBH and non-US providers prefer to use mixed-case letters.
There are two main styles of line 21 closed captioning:
* Roll-up or scroll-up or paint-on or scrolling: Real-time words sent in paint-on or scrolling mode appear from left to right, up to one line at a time; when a line is filled in roll-up mode, the whole line scrolls up to make way for a new line, and the line on top is erased. The lines usually appear at the bottom of the screen, but can actually be placed on any of the 14 screen rows to avoid covering graphics or action. This method is used when captioning video in real-time such as for live events, where a sequential word-by-word captioning process is needed or a pre-made intermediary file isn't available. This method is signaled on
EIA-608 by a two-byte caption command or in Teletext by replacing rows for a roll-up effect and duplicating rows for a paint-on effect. This allows for real-time caption line editing.
* Pop-on or pop-up or block: A caption appears on any of the 14 screen rows as a complete sentence, which can be followed by additional captions. This method is used when captions come from an intermediary file (such as the Scenarist or EBU STL file formats) for pre-taped television and film programming, commonly produced at captioning facilities. This method of captioning can be aided by digital scripts or voice recognition software, and if used for live events, would require a video delay to avoid a large delay in the captions' appearance on-screen, which occurs with Teletext-encoded live subtitles.
Caption formatting
TVNZ
, type = Crown entity
, industry = Broadcast television
, num_locations = New Zealand
, location = Auckland, New Zealand
, area_served = Nationally (New Zealand) and some Pacific Island nations such as the Cook Islands, Fiji, and the So ...
Access Services and Red Bee Media for BBC and Australia example:
I got the machine ready.
ENGINE STARTING
(speeding away)
UK IMS for ITV and Sky example:
(man) I got the machine ready. (engine starting)
US WGBH Access Services example:
MAN: I got the machine ready. (engine starting)
US
National Captioning Institute example:
- I GOT THE MACHINE READY.
US other provider example:
I GOT THE MACHINE READY.
ngine starting
US in-house real-time roll-up example:
>> Man: I GOT THE MACHINE READY.
ngine starting
Non-US in-house real-time roll-up example:
MAN: I got the machine ready.
(ENGINE STARTING)
US
CaptionMax example:
ANI got the machine ready.
ngine starting
Syntax
For real-time captioning done outside of captioning facilities, the following syntax is used:
* '>>' (two prefixed
greater-than signs) indicates a change in single speaker.
** Sometimes appended with the speaker's name in alternate case, followed by a
colon.
* '>>>' (three prefixed greater-than signs) indicates a change in news story or multiple speakers.
Styles of syntax that are used by various captioning producers:
* Capitals indicate main on-screen dialogue and the name of the speaker.
** Legacy
EIA-608 home caption decoder fonts had no
descenders on lowercase letters.
** Outside North America, capitals with background coloration indicate a song title or sound effect description.
** Outside North America, capitals with black or no background coloration indicates when a word is stressed or emphasized.
* Descenders indicate background sound description and
off-screen dialogue.
** Most modern caption producers, such as
WGBH-TV, use
mixed case for both on-screen and
off-screen dialogue.
* '-' (a prefixed dash) indicates a change in single speaker (used by
CaptionMax).
* Words in
italics indicate when a word is stressed or emphasized and when real world names are quoted.
** Italics and
bold type are only supported by
EIA-608.
** Some North American providers use this for
narrated dialogue.
** Some providers use this for
off-screen dialogue.
** Italics are also applied when a word is spoken in a foreign language.
* Text coloration indicates captioning credits and sponsorship.
** Used by
music videos in the past, but generally has declined due to system incompatibilities.
** In Ceefax/Teletext countries, it indicates a change in single speaker in place of '>>'.
** Some Teletext countries use coloration to indicate when a word is stressed or emphasized.
** Coloration is limited to white, green, blue, cyan, red, yellow and magenta.
** UK order of use for text is
white,
green,
cyan
Cyan () is the color between green and blue on the visible spectrum of light. It is evoked by light with a predominant wavelength between 490 and 520 nm, between the wavelengths of green and blue.
In the subtractive color system, or CMYK color ...
,
yellow; and backgrounds is
black,
red,
blue,
magenta,
white.
** US order of use for text is
white,
yellow,
cyan
Cyan () is the color between green and blue on the visible spectrum of light. It is evoked by light with a predominant wavelength between 490 and 520 nm, between the wavelengths of green and blue.
In the subtractive color system, or CMYK color ...
,
green; and backgrounds is
black,
blue,
red,
magenta, white.
*
Square brackets or
parentheses indicate a song title or sound effect description.
*
Parentheses indicate speaker's vocal pitch e.g., (man), (woman), (boy) or (girl).
** Outside North America,
parentheses indicate a silent on-screen action.
* A pair of
eighth notes is used to bracket a line of
lyrics to indicate singing.
** A pair of eighth notes on a line of no text are used during a section of instrumental music.
** Outside North America, a single
number sign is used on a line of
lyrics to indicate singing.
** An additional musical notation character is appended to the end of the last line of lyrics to indicate the song's end.
** As the symbol is unsupported by
Ceefax/
Teletext, a
number sign - which resembles a musical
sharp - is substituted.
Technical aspects
There were many shortcomings in the original Line 21 specification from a
typographic standpoint, since, for example, it lacked many of the characters required for captioning in languages other than English. Since that time, the core Line 21 character set has been expanded to include quite a few more characters, handling most requirements for languages common in North and South America such as
French
French (french: français(e), link=no) may refer to:
* Something of, from, or related to France
** French language, which originated in France, and its various dialects and accents
** French people, a nation and ethnic group identified with Franc ...
,
Spanish, and
Portuguese, though those extended characters are not required in all decoders and are thus unreliable in everyday use. The problem has been almost eliminated with a market specific full set of Western European characters and a private adopted
Norpak extension for
South Korean and
Japanese markets. The full
EIA-708 standard for digital television has worldwide character set support, but there has been little use of it due to
EBU
The European Broadcasting Union (EBU; french: Union européenne de radio-télévision, links=no, UER) is an alliance of public service media organisations whose countries are within the European Broadcasting Area or who are members of the Co ...
Teletext dominating
DVB countries, which has its own extended character sets.
Captions are often edited to make them easier to read and to reduce the amount of text displayed onscreen. This editing can be very minor, with only a few occasional unimportant missed lines, to severe, where virtually every line spoken by the actors is condensed. The measure used to guide this editing is words per minute, commonly varying from 180 to 300, depending on the type of program. Offensive words are also captioned, but if the program is censored for TV broadcast, the broadcaster might not have arranged for the captioning to be edited or censored also. The "TV Guardian", a television
set-top box, is available to parents who wish to censor offensive language of programs—the video signal is fed into the box and if it detects an offensive word in the captioning, the audio signal is bleeped or muted for that period of time.
Caption channels
The Line 21 data stream can consist of data from several data channels
multiplexed together. Odd field 1 can have four data channels: two separate synchronized captions (CC1, CC2) with caption-related text, such as website
URL
A Uniform Resource Locator (URL), colloquially termed as a web address, is a reference to a web resource that specifies its location on a computer network and a mechanism for retrieving it. A URL is a specific type of Uniform Resource Identifie ...
s (T1, T2). Even field 2 can have five additional data channels: two separate synchronized captions (CC3, CC4) with caption related text (T3, T4), and
Extended Data Services (XDS) for Now/Next
EPG details. XDS data structure is defined in CEA-608.
As CC1 and CC2 share bandwidth, if there is a lot of data in CC1, there will be little room for CC2 data and is generally only used for the primary audio captions. Similarly, CC3 and CC4 share the second even field of line 21. Since some early caption decoders supported only single field decoding of CC1 and CC2, captions for
SAP in a second language were often placed in CC2. This led to bandwidth problems, and the U.S.
Federal Communications Commission (FCC) recommendation is that bilingual programming should have the second caption language in CC3. Many Spanish television networks such as
Univision and
Telemundo, for example, provides
English subtitles
Subtitles and captions are lines of dialogue or other Writing, text displayed at the bottom of the screen in films, television programs, video games or other visual media. They can be transcriptions of the screenplay, translations of it, or ...
for many of its
Spanish programs in CC3.
Canadian broadcasters use CC3 for French translated SAPs, which is also a similar practice in South Korea and Japan.
Ceefax and Teletext can have a larger number of captions for other languages due to the use of multiple VBI lines. However, only
European countries used a second subtitle page for second language audio tracks where either the
NICAM dual mono or
Zweikanalton were used.
Digital television interoperability issues
The US
ATSC
Advanced Television Systems Committee (ATSC) standards are an American set of standards for digital television transmission over terrestrial, cable and satellite networks. It is largely a replacement for the analog NTSC standard and, like that ...
digital television system originally specified two different kinds of ''closed captioning'' datastream standards: the original analog-compatible (available by
Line 21) and the more modern digital-only
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
formats are delivered within the video stream.
The
US FCC mandates that broadcasters deliver (and generate, if necessary) both datastream formats with the
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
format merely a conversion of the Line 21 format.
The
Canadian CRTC has not mandated that broadcasters either broadcast both datastream formats or exclusively in one format. Most broadcasters and networks to avoid large conversion cost outlays just provide
EIA-608 captions along with a transcoded
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
version encapsulated within
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
packets.
Incompatibility issues with digital TV
Many viewers find that when they acquire a
digital television or
set-top box they are unable to view closed caption (CC) information, even though the broadcaster is sending it and the
TV is able to display it.
Originally, CC information was included in the picture ("line 21") via a
composite video input
Composite video is an Video#Analog video, analog video signal format that carries Standard-definition television, standard-definition video (typically at 525 lines or 625 lines) as a single channel. Video information is Video coding format, e ...
, but there is no equivalent capability in digital video interconnects (such as
DVI and
HDMI) between the display and a "source". A "source", in this case, can be a
DVD player or a
terrestrial or cable digital television receiver. When CC information is encoded in the
MPEG-2
MPEG-2 (a.k.a. H.222/H.262 as was defined by the ITU) is a standard for "the generic video coding format, coding of moving pictures and associated audio information". It describes a combination of Lossy compression, lossy video compression and ...
data stream, only the device that decodes the
MPEG-2 data (a source) has access to the ''closed caption'' information; there is no standard for transmitting the CC information to a display monitor separately. Thus, if there is CC information, the source device needs to overlay the CC information on the picture prior to transmitting to the display over the interconnect's video output.
The responsibility of decoding the CC information and overlaying onto the visible video image has been taken away from the TV display and put into the "source" of DVI and HDMI digital video interconnects. Because the TV handles "mute" and, when using
DVI and
HDMI, a different device handles turning on and off CC, this means the "captions come on automatically when the
TV is muted" feature no longer works. That source device—such as a
DVD player or
set-top box—must "burn" the image of the CC text into the picture data carried by the
HDMI or
DVI cable; there's no other way for the CC text to be carried over the
HDMI or
DVI cable.
Many source devices do not have the ability to overlay CC information, for controlling the CC overlay can be complicated. For example, the
Motorola DCT-5xxx and -6xxx cable set-top receivers have the ability to decode CC information located on the
MPEG-2
MPEG-2 (a.k.a. H.222/H.262 as was defined by the ITU) is a standard for "the generic video coding format, coding of moving pictures and associated audio information". It describes a combination of Lossy compression, lossy video compression and ...
stream and overlay it on the picture, but turning CC on and off requires turning off the unit and going into a special setup menu (it is not on the standard configuration menu and it cannot be controlled using the remote). Historically,
DVD players
A DVD player is a device that plays DVDs produced under both the DVD-Video and DVD-Audio technical standards, two different and incompatible standards. Some DVD players will also play audio CDs. DVD players are connected to a television to wa ...
,
VCRs and set-top tuners did not need to do this overlaying, since they simply passed this information on to the TV, and they are not mandated to perform this overlaying.
Many modern digital television receivers can be directly connected to cables, but often cannot receive scrambled channels that the user is paying for. Thus, the lack of a standard way of sending CC information between components, along with the lack of a mandate to add this information to a picture, results in CC being unavailable to many hard-of-hearing and deaf users.
The
EBU
The European Broadcasting Union (EBU; french: Union européenne de radio-télévision, links=no, UER) is an alliance of public service media organisations whose countries are within the European Broadcasting Area or who are members of the Co ...
Ceefax-based teletext systems are the source for closed captioning signals, thus when teletext is embedded into
DVB-T
DVB-T, short for Digital Video Broadcasting – Terrestrial, is the DVB European-based consortium standard for the broadcast transmission of digital terrestrial television that was first published in 1997 and first broadcast in Singapore in Febr ...
or
DVB-S
Digital Video Broadcasting – Satellite (DVB-S) is the original DVB standard for Satellite Television and dates from 1995, in its first release, while development lasted from 1993 to 1997. The first commercial applications was by Star TV in Asia ...
the closed captioning signal is included. However, for DVB-T and DVB-S, it is not necessary for a teletext page signal to also be present (
ITV1, for example, does not carry analogue teletext signals on
Sky Digital, but does carry the embedded version, accessible from the "Services" menu of the receiver, or more recently by turning them off/on from a mini menu accessible from the "help" button).
The
BBC's Subtitle (Captioning) Editorial Guidelines were born out of the capabilities of
Teletext but are now used by multiple European broadcasters as the editorial and design best practice guide
New Zealand
In New Zealand, captions use an
EBU
The European Broadcasting Union (EBU; french: Union européenne de radio-télévision, links=no, UER) is an alliance of public service media organisations whose countries are within the European Broadcasting Area or who are members of the Co ...
Ceefax-based teletext system on
DVB broadcasts via
satellite and
cable television with the exception of
MediaWorks New Zealand channels who completely switched to
DVB RLE subtitles in 2012 on both
Freeview satellite and
UHF
Ultra high frequency (UHF) is the ITU designation for radio frequencies in the range between 300 megahertz (MHz) and 3 gigahertz (GHz), also known as the decimetre band as the wavelengths range from one meter to one tenth of a meter (on ...
broadcasts, this decision was made based on the
TVNZ
, type = Crown entity
, industry = Broadcast television
, num_locations = New Zealand
, location = Auckland, New Zealand
, area_served = Nationally (New Zealand) and some Pacific Island nations such as the Cook Islands, Fiji, and the So ...
practice of using this format on only
DVB UHF
Ultra high frequency (UHF) is the ITU designation for radio frequencies in the range between 300 megahertz (MHz) and 3 gigahertz (GHz), also known as the decimetre band as the wavelengths range from one meter to one tenth of a meter (on ...
broadcasts (aka
Freeview HD). This made
composite video
Composite video is an analog video signal format that carries standard-definition video (typically at 525 lines or 625 lines) as a single channel. Video information is encoded on one channel, unlike the higher-quality S-Video (two channels) a ...
connected TVs incapable of decoding the captions on their own. Also, these
pre-rendered subtitles use classic caption style opaque backgrounds with an overly large
font size and obscure the picture more than the more modern, partially transparent backgrounds.
Digital television standard captioning improvements
The
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
specification provides for dramatically improved ''captioning''
* An enhanced character set with more
accented letters and non-Latin letters, and more special symbols
* Viewer-adjustable text size (called the "caption volume control" in the specification), allowing individuals to adjust their TVs to display small, normal, or large captions
* More text and background colors, including both transparent and translucent backgrounds to optionally replace the big
black block
* More text styles, including edged or
drop shadowed text rather than the letters on a solid background
* More text fonts, including
monospaced and proportional spaced,
serif
In typography, a serif () is a small line or stroke regularly attached to the end of a larger stroke in a letter or symbol within a particular font or family of fonts. A typeface or "font family" making use of serifs is called a serif typeface ...
and
sans-serif
In typography and lettering, a sans-serif, sans serif, gothic, or simply sans letterform is one that does not have extending features called "serifs" at the end of strokes. Sans-serif typefaces tend to have less stroke width variation than seri ...
, and some playful cursive fonts
* Higher
bandwidth, to allow more
data per minute of
video
* More language channels, to allow the encoding of more independent caption streams
As of 2009, most closed captioning for digital television environments is done using tools designed for analog captioning (working to the
CEA-608 NTSC specification rather than the
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
ATSC
Advanced Television Systems Committee (ATSC) standards are an American set of standards for digital television transmission over terrestrial, cable and satellite networks. It is largely a replacement for the analog NTSC standard and, like that ...
specification). The captions are then run through transcoders made by companies lik
EEG Enterprisesor
Evertz, which convert the analog
Line 21 caption format to the digital format. This means that none of the
CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
features are used unless they were also contained in
CEA-608.
Uses in other media
DVDs and Blu-ray Discs
NTSC DVDs may carry closed captions in data packets of the MPEG-2 video streams inside of the Video-TS folder. Once played out of the analog outputs of a set top DVD player, the caption data is converted to the Line 21 format. They are output by the player to the
composite video
Composite video is an analog video signal format that carries standard-definition video (typically at 525 lines or 625 lines) as a single channel. Video information is encoded on one channel, unlike the higher-quality S-Video (two channels) a ...
(or an available
RF connector
A coaxial RF connector (radio frequency connector) is an electrical connector designed to work at radio frequencies in the multi-megahertz range.
RF connectors are typically used with coaxial cables and are designed to maintain the shielding t ...
) for a connected TV's built-in decoder or a set-top decoder as usual. They can not be output on
S-Video or
component video outputs due to the lack of a
colorburst
Colorburst is an analog video, composite video signal generated by a video-signal generator used to keep the chrominance subcarrier synchronized in a color television signal. By synchronizing an oscillator with the colorburst at the back porc ...
signal on line 21. (Actually, regardless of this, if the DVD player is in interlaced rather than progressive mode, closed captioning ''will'' be displayed on the TV over component video input if the TV captioning is turned on and set to CC1.) When viewed on a personal computer, caption data can be viewed by software that can read and decode the caption data packets in the MPEG-2 streams of the DVD-Video disc.
Windows Media Player (before
Windows 7) in Vista supported only closed caption channels 1 and 2 (not 3 or 4).
Apple's DVD Player does not have the ability to read and decode Line 21 caption data which are recorded on a DVD made from an over-the-air broadcast. It can display some movie DVD captions.
In addition to Line 21 closed captions, video DVDs may also carry subtitles, which generally rendered from the
EIA-608 captions as a bitmap overlay that can be turned on and off via a set top DVD player or DVD player software, just like the textual captions. This type of captioning is usually carried in a subtitle track labeled either "English for the hearing impaired" or, more recently, "SDH" (subtitled for the deaf and Hard of hearing). Many popular Hollywood DVD-Videos can carry both subtitles and closed captions (e.g. ''
Stepmom'' DVD by Columbia Pictures). On some DVDs, the Line 21 captions may contain the same text as the subtitles; on others, only the Line 21 captions include the additional non-speech information (even sometimes song lyrics) needed for deaf and hard-of-hearing viewers. European Region 2 DVDs do not carry Line 21 captions, and instead list the subtitle languages available-English is often listed twice, one as the representation of the dialogue alone, and a second subtitle set which carries additional information for the deaf and hard-of-hearing audience. (Many deaf/ subtitle files on DVDs are reworkings of original teletext subtitle files.)
Blu-ray media cannot carry any
VBI data such as Line 21 closed captioning due to the design of
DVI-based
High-Definition Multimedia Interface (HDMI) specifications that was only extended for synchronized digital audio replacing older analog standards, such as
VGA, S-Video, component video, and
SCART. Both Blu-ray and DVD can use either PNG bitmap subtitles or 'advanced subtitles' to carry SDH type subtitling, the latter being an XML-based textual format which includes font, styling and positioning information as well as a unicode representation of the text. Advanced subtitling can also include additional media accessibility features such as "descriptive audio".
Movies
There are several competing technologies used to provide captioning for movies in theaters. Cinema captioning falls into the categories of open and closed. The definition of "closed" captioning in this context is different from television, as it refers to any technology that allows as few as one member of the audience to view the captions.
Open captioning in a film theater can be accomplished through burned-in captions, projected text or
bitmaps, or (rarely) a display located above or below the movie screen. Typically, this display is a large LED sign. In a digital theater, open caption display capability is built into the digital projector. Closed caption capability is also available, with the ability for 3rd-party closed caption devices to plug into the digital cinema server.
Probably the best known closed captioning option for film theaters is the
Rear Window Captioning System from the
National Center for Accessible Media. Upon entering the theater, viewers requiring captions are given a panel of flat translucent glass or plastic on a gooseneck stalk, which can be mounted in front of the viewer's seat. In the back of the theater is an
LED
A light-emitting diode (LED) is a semiconductor Electronics, device that Light#Light sources, emits light when Electric current, current flows through it. Electrons in the semiconductor recombine with electron holes, releasing energy i ...
display that shows the captions in mirror image. The panel reflects captions for the viewer but is nearly invisible to surrounding patrons. The panel can be positioned so that the viewer watches the movie through the panel, and captions appear either on or near the movie image. A company called Cinematic Captioning Systems has a similar reflective system called Bounce Back. A major problem for distributors has been that these systems are each proprietary, and require separate distributions to the theater to enable them to work. Proprietary systems also incur license fees.
For film projection systems,
Digital Theater Systems, the company behind the DTS
surround sound standard, has created a digital captioning device called the DTS-CSS (Cinema Subtitling System). It is a combination of a laser projector which places the captioning (words, sounds) anywhere on the screen and a thin playback device with a
CD that holds many languages. If the Rear Window Captioning System is used, the DTS-CSS player is also required for sending caption text to the Rear Window sign located in the rear of the theater.
Special effort has been made to build accessibility features into digital projection systems (see
digital cinema). Through
SMPTE, standards now exist that dictate how open and closed captions, as well as hearing-impaired and visually impaired narrative audio, are packaged with the rest of the digital movie. This eliminates the proprietary caption distributions required for film, and the associated royalties. SMPTE has also standardized the communication of closed caption content between the digital cinema server and 3rd-party closed caption systems (the CSP/RPL protocol). As a result, new, competitive closed caption systems for digital cinema are now emerging that will work with any standards-compliant digital cinema server. These newer closed caption devices include cupholder-mounted electronic displays and wireless glasses which display caption text in front of the wearer's eyes. Bridge devices are also available to enable the use of Rear Window systems. As of mid-2010, the remaining challenge to the wide introduction of accessibility in digital cinema is the industry-wide transition to SMPTE DCP, the standardized packaging method for very high quality, secure distribution of digital movies.
Sports venues
Captioning systems have also been adopted by most major league and high-profile college
stadium
A stadium ( : stadiums or stadia) is a place or venue for (mostly) outdoor sports, concerts, or other events and consists of a field or stage either partly or completely surrounded by a tiered structure designed to allow spectators to stand o ...
s and
arenas, typically through dedicated portions of their main
scoreboards or as part of balcony
fascia
A fascia (; plural fasciae or fascias; adjective fascial; from Latin: "band") is a band or sheet of connective tissue, primarily collagen, beneath the skin that attaches to, stabilizes, encloses, and separates muscles and other internal organs. ...
LED boards. These screens display captions of the
public address
A public address system (or PA system) is an electronic system comprising microphones, amplifiers, loudspeakers, and related equipment. It increases the apparent volume (loudness) of a human voice, musical instrument, or other acoustic sound sou ...
announcer and other spoken content, such as those contained within in-game segments, public service announcements, and lyrics of songs played in-stadium. In some facilities, these systems were added as a result of discrimination lawsuits. Following a lawsuit under the
Americans with Disabilities Act
The Americans with Disabilities Act of 1990 or ADA () is a civil rights law that prohibits discrimination based on disability. It affords similar protections against discrimination to Americans with disabilities as the Civil Rights Act of 1964 ...
,
FedExField added caption screens in 2006.
Some stadiums utilize on-site captioners while others outsource them to external providers who caption remotely.
Video games
The infrequent appearance of closed captioning in
video games became a problem in the 1990s as games began to commonly feature voice tracks, which in some cases contained information which the player needed in order to know how to progress in the game. Closed captioning of video games is becoming more common. One of the first video game companies to feature closed captioning was
Bethesda Softworks
Bethesda Softworks LLC is an American video game publisher based in Rockville, Maryland. The company was founded by Christopher Weaver in 1986 as a division of Media Technology Limited, and in 1999 became a subsidiary of ZeniMax Media. In its ...
in their 1990 release of ''Hockey League Simulator'' and ''
The Terminator 2029''. Infocom also offered ''
Zork Grand Inquisitor'' in 1997. Many games since then have at least offered subtitles for spoken dialog during
cutscenes, and many include significant in-game dialog and sound effects in the captions as well; for example, with subtitles turned on in the ''
Metal Gear Solid'' series of stealth games, not only are subtitles available during cut scenes, but any dialog spoken during real-time gameplay will be captioned as well, allowing players who can't hear the dialog to know what enemy guards are saying and when the main character has been detected. Also, in many of developer
Valve's video games (such as ''
Half-Life 2'' or ''
Left 4 Dead''), when closed captions are activated, dialog and nearly all sound effects either made by the player or from other sources (e.g. gunfire, explosions) will be captioned.
Video games don't offer Line 21 captioning, decoded and displayed by the television itself but rather a built-in subtitle display, more akin to that of a DVD. The game systems themselves have no role in the captioning either; each game must have its subtitle display programmed individually.
Reid Kimball, a game designer who is hearing impaired, is attempting to educate game developers about closed captioning for games. Reid started th
Games[CC/nowiki>">C">Games
[CC
/nowiki>group to closed caption games and serve as a research and development team to aid the industry. Kimball designed the Dynamic Closed Captioning system, writes articles and speaks at developer conferences. Games[CC]'s first closed captioning project called Doom3[CC] was nominated for an award a
Best Doom3 Mod of the Year
for IGDA's Choice Awards 2006 show.
Online video streaming
Internet video streaming service YouTube offers captioning services in videos. The author of the video can upload a SubViewer (*.SUB), SubRip
SubRip is a free software program for Microsoft Windows which extracts subtitles and their timings from various video formats to a text file. It is released under the GNU GPL. Its subtitle format's file extension is .srt and is widely supported. ...
(*.SRT) or *.SBV file. As a beta feature, the site also added the ability to automatically transcribe and generate captioning on videos, with varying degrees of success based upon the content of the video. However, on August 30, 2020, the company announced that communal captions will end on September 28. The automatic captioning is often inaccurate on videos with background music or exaggerated emotion in speaking. Variations in volume can also result in nonsensical machine-generated captions. Additional problems arise with strong accents, sarcasm, differing contexts, or homonyms.
On June 30, 2010, YouTube announced a new "YouTube Ready" designation for professional caption vendors in the United States. The initial list included twelve companies who passed a caption quality evaluation administered by the Described and Captioned Media Project, have a website and a YouTube channel where customers can learn more about their services and have agreed to post rates for the range of services that they offer for YouTube content.
Flash video also supports captions using the Distribution Exchange profile (DFXP) of W3C timed text format. The latest Flash authoring software adds free player skins and caption components that enable viewers to turn captions on/off during playback from a web page. Previous versions of Flash relied on the Captionate 3rd party component and skin to caption Flash video. Custom Flash players designed in Flex can be tailored to support the timed-text exchange profile, Captionate .XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. ...
, or SAMI file (e.g. Hulu
Hulu () is an American subscription streaming service majority-owned by The Walt Disney Company, with Comcast's NBCUniversal holding a minority stake. It was launched on October 29, 2007 and it offers a library of films and television serie ...
captioning). This is the preferred method for most US broadcast and cable networks that are mandated by the U.S. Federal Communications Commission to provide captioned on-demand content. The media encoding firms generally use software such as MacCaption to convert EIA-608 captions to this format. The Silverlight Media Framework also includes support for the timed-text exchange profile for both download and adaptive streaming media.
Windows Media Video can support closed captions for both video on demand streaming or live streaming scenarios. Typically, Windows Media captions support the SAMI file format but can also carry embedded closed caption data.
EBU-TT-D distribution format
supports multiple players across multiple platforms.
QuickTime video supports raw EIA-608 caption data via proprietary closed caption track, which are just EIA-608 byte pairs wrapped in a QuickTime
QuickTime is an extensible multimedia framework developed by Apple Inc., capable of handling various formats of digital video, picture, sound, panoramic images, and interactivity. Created in 1991, the latest Mac version, QuickTime X, is avai ...
packet container with different IDs for both line 21 fields. These captions can be turned on and off and appear in the same style as TV closed captions, with all the standard formatting (pop-on, roll-up, paint-on), and can be positioned and split anywhere on the video screen. QuickTime closed caption tracks can be viewed in Macintosh or Windows versions of QuickTime
QuickTime is an extensible multimedia framework developed by Apple Inc., capable of handling various formats of digital video, picture, sound, panoramic images, and interactivity. Created in 1991, the latest Mac version, QuickTime X, is avai ...
Player, iTunes
iTunes () is a software program that acts as a media player, media library, mobile device management utility, and the client app for the iTunes Store. Developed by Apple Inc., it is used to purchase, play, download, and organize digital mul ...
(via QuickTime), iPod Nano, iPod Classic, iPod Touch
The iPod Touch (stylized as iPod touch) is a discontinued line of iOS-based mobile devices designed and marketed by Apple Inc. with a touchscreen-controlled user interface. As with other iPod models, the iPod Touch can be used as a music pl ...
, iPhone, and iPad.
Theatre
Live plays can be open captioned by a captioner who displays lines from the script
Script may refer to:
Writing systems
* Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire
* Script (styles of handwriting)
** Script typeface, a typeface with characteristics of ha ...
and including non-speech elements on a large display screen near the stage. Software is also now available that automatically generates the captioning and streams the captioning to individuals sitting in the theater, with that captioning being viewed using heads-up glasses or on a smartphone or computer tablet.
Telephones
A captioned telephone is a telephone that displays real-time captions of the current conversation. The captions are typically displayed on a screen embedded into the telephone base.
Video conferencing
Some online video conferencing services, such as Google Meet
Google Meet (formerly known as Hangouts Meet) is a video-communication service developed by Google. It is one of two apps that constitute the replacement for Google Hangouts, the other being Google Chat. It replaced the consumer-facing Google ...
, offer the ability to display captions in real time of the current conversation.
Media monitoring services
In the United States especially, most media monitoring services capture and index closed captioning text from news and public affairs programs, allowing them to search the text for client references. The use of closed captioning for television news monitoring was pioneered by Universal Press Clipping Bureau (Universal Information Services) in 1992, and later in 1993 by Tulsa-based NewsTrak of Oklahoma (later known as Broadcast News of Mid-America, acquired by video news release pioneer Medialink Worldwide Incorporated in 1997). US patent 7,009,657 describes a "method and system for the automatic collection and conditioning of closed caption text originating from multiple geographic locations" as used by news monitoring services.
Conversations
Software programs are available that automatically generate a closed-captioning of conversations. Examples of such conversations include discussions in conference rooms, classroom lectures, or religious services.
Non-linear video editing systems and closed captioning
In 2010, Vegas Pro, the professional non-linear editor, was updated to support importing, editing, and delivering CEA-608 closed captions. Vegas Pro 10, released on October 11, 2010, added several enhancements to the closed captioning support. TV-like CEA-608 closed captioning can now be displayed as an overlay when played back in the Preview and Trimmer windows, making it easy to check placement, edits, and timing of CC information. CEA708 style Closed Captioning is automatically created when the CEA-608 data is created. Line 21 closed captioning is now supported, as well as HD-SDI closed captioning capture and print from AJA and Blackmagic Design cards. Line 21 support provides a workflow for existing legacy media. Other improvements include increased support for multiple closed captioning file types, as well as the ability to export closed caption data for DVD Architect, YouTube, RealPlayer, QuickTime, and Windows Media Player.
In mid-2009, Apple released Final Cut Pro version 7 and began support for inserting closed caption data into SD and HD tape masters via FireWire
IEEE 1394 is an interface standard for a serial bus for high-speed communications and isochronous real-time data transfer. It was developed in the late 1980s and early 1990s by Apple in cooperation with a number of companies, primarily Sony an ...
and compatible video capture cards. Up until this time, it was not possible for video editors to insert caption data with both CEA-608 and CEA-708
CTA-708 (formerly EIA-708 and CEA-708) is the standard for closed captioning for ATSC digital television (DTV) streams in the United States and Canada. It was developed by the Consumer Electronics sector of the Electronic Industries Alliance, wh ...
to their tape masters. The typical workflow included first printing the SD or HD video to a tape and sending it to a professional closed caption service company that had a stand-alone closed caption hardware encoder.
This new closed captioning workflow known as e-Captioning involves making a proxy video from the non-linear system to import into a third-party non-linear closed captioning software. Once the closed captioning software project is completed, it must export a closed caption file compatible with the non-linear editing system. In the case of Final Cut Pro 7, three different file formats can be accepted: a .SCC file (Scenarist Closed Caption file) for Standard Definition video, a QuickTime
QuickTime is an extensible multimedia framework developed by Apple Inc., capable of handling various formats of digital video, picture, sound, panoramic images, and interactivity. Created in 1991, the latest Mac version, QuickTime X, is avai ...
608 closed caption track (a special 608 coded track in the .mov file wrapper) for standard-definition video, and finally a QuickTime 708 closed caption track (a special 708 coded track in the .mov file wrapper) for high-definition video output.
Alternatively, Matrox video systems devised another mechanism for inserting closed caption data by allowing the video editor to include CEA-608 and CEA-708 in a discrete audio channel on the video editing timeline. This allows real-time preview of the captions while editing and is compatible with Final Cut Pro 6 and 7.
Other non-linear editing systems indirectly support closed captioning only in Standard Definition line-21. Video files on the editing timeline must be composited with a line-21 VBI graphic layer known in the industry as a "blackmovie" with closed caption data. Alternately, video editors working with the DV25 and DV50 FireWire workflows must encode their DV .avi or .mov file with VAUX data which includes CEA-608 closed caption data.
Logo
The current and most familiar logo for closed captioning consists of two Cs (for "closed captioned") inside a television screen. It was created at WGBH WGBH may refer to:
* WGBH Educational Foundation, based in Boston, Massachusetts, United States
** WGBH (FM), a public radio station at Boston, Massachusetts on 89.7 MHz owned by the WGBH Educational Foundation
** WGBH-TV
WGBH-TV (channel 2), ...
. The other logo, trademarked by the National Captioning Institute, is that of a simple geometric rendering of a television set merged with the tail of a speech balloon
Speech balloons (also speech bubbles, dialogue balloons, or word balloons) are a graphic convention used most commonly in comic books, comics, and cartoons to allow words (and much less often, pictures) to be understood as representing a char ...
; two such versions exist – one with a tail on the left, the other with a tail on the right.
See also
* Speech-to-text reporter (captioner), an occupation
* Fansub
* Same Language Subtitling
* Synchronized Accessible Media Interchange (SAMI) file format
* Sign language on television
* Subtitle (captioning)
Subtitles and captions are lines of dialogue or other text displayed at the bottom of the screen in films, television programs, video games or other visual media. They can be transcriptions of the screenplay, translations of it, or informat ...
* Surtitles
Surtitles, also known as supertitles, SurCaps, OpTrans, are translated or transcribed lyrics/dialogue projected above a stage or displayed on a screen, commonly used in opera, theatre or other musical performances. The word "surtitle" comes from ...
* Synchronized Multimedia Integration Language (SMIL) file format
References
;Sources
* ''Realtime Captioning... The VITAC Way'' by Amy Bowlen and Kathy DiLorenzo (no ISBN)
* ''BBC Subtitles (Captions) Editorial Guideline
'
* ''Closed Captioning: Subtitling, Stenography, and the Digital Convergence of Text with Television'' by Gregory J. Downey ()
* ''The Closed Captioning Handbook'' by Gary D. Robson
Gary D. Robson (May 11, 1958) is an American author from Red Lodge, Montana. He is best known for his children's picture book series entitled ''Who Pooped in the Park?'', which teaches children about animal scat and tracks. The series currentl ...
()
* ''Alternative Realtime Careers: A Guide to Closed Captioning and CART for Court Reporters'' by Gary D. Robson
Gary D. Robson (May 11, 1958) is an American author from Red Lodge, Montana. He is best known for his children's picture book series entitled ''Who Pooped in the Park?'', which teaches children about animal scat and tracks. The series currentl ...
()
* ''A New Civil Right: Telecommunications Equality for Deaf and Hard of Hearing Americans'' by Karen Peltz Strauss ()
* ''Enabling The Disabled'' by Michael Karagosian (no ISBN)
External links
*
Closed Captioning of Video Programming - 47 C.F.R. 79.1
From the Federal Communications Commission Consumer & Governmental Affairs Bureau
FCC Consumer Facts on Closed Captioning
Alan Newell, Inventor of Closed Captioning, Teletext for the Deaf, 1982
*
From the Education Resources Information Center Clearinghouse for ESL Literacy Education, Washington D.C.
Bill Kastner: The Man Behind Closed Captioning
First Sears Telecaption adapter advertised in 1980 Sears catalog
BBC Best Practice Guidelines for Captioning and Subtitling (UK)
EBU-TT-D Subtitling (Captions) Distribution Format
{{Video formats
Subtitling
Assistive technology
Deafness
Television terminology
High-definition television
Transcription (linguistics)
de:Untertitel#Technische Ausführungen