Captioning Terms

A

Automated Speech Recognition (ASR), also referred to as AI Captioning:

It is the means of producing captions using voice recognition software with no human intervention. 

C

Captioning: 

The process of converting the narration, dialogue, music and sound effects of a video production into text that is displayed on a television screen. The captions are typically white upper-case letters against a black background.

Caption File: 

A computer file that stores a program’s caption information, including the text, timing and placement information. The caption file is used in conjunction with an encoder to create the captioned submaster.

Center Placement Pop-On Captions: 

Pop-on captions are centered at the bottom third of the TV screen. Their placement is similar to subtitles although they are displayed as captions in white letters in a black box. Speaker changes are noted by a dash.

Closed Captions: 

Captions that can only appear with the use of a decoder. The decoder may be either attached to a TV or built into TV’s made after July 1993. Closed captioning allows caption users to enjoy the same broadcast and recorded video materials that other television viewers enjoy. Closed-caption information is carried in Line 21 of the vertical blanking interval of the television signal.

Closed Caption Decoder: 

An electronic device or software program that decodes the captioning signal and causes caption text to appear on the screen. In the 1980’s and early 1990’s, closed caption decoders were the major means by which consumers could watch captioned television. Since July 1, 1993, all television sets with screens 13 inches or larger manufactured for sale in the United States must have a built-in decoder capability.

E

Encoder: 

A device that electronically inserts the caption data into the TV signal. 

Encoding: 

The process of inserting the caption data into the television signal. 

I

IP Delivery:  

Caption data is transmitted via IP instead of via modem. 

L

Line 21:   

Line 21 is how captions were and still and are delivered in the NTSC format. The television signal is comprised of 525 lines. The vertical blanking interval encompasses Line 1 through 21. The caption information in NTSC format resides on Line 21, and active video starts on Line 22.  However, most everyone today watching TV is viewing programs in High Definition (HD) using either the 720P or 1080i formats.  Most if not all cable networks still carry the Standard Definition 480P video feeds as well to support older Digital TV sets not supporting HD.   

Live-display Captions (punchback captions): 

Live-display captions are prepared in advance when an accurate script and/or program recording is available prior to the time a program airs. These roll-up captions are then transmitted in real time, as the program is airing, with live encoding, which is the corresponding process for pop-on captions. The captions appear line-by-line and are synchronized with the broadcast audio as closely as possible.

Open Captions: 

Captions that are visible without using a set-top decoder or a TV with a built-in decoder. When a video is open-captioned, the captions are permanently part of the picture.

P

Pop-on Captions: 

A phrase or sentence appears on the screen all at once – not line by line – stays there for a few seconds and then disappears or is replaced by another full caption. The captions are timed to synchronize with the program and placed on the screen to help identify the speaker. Pop-on captions are used for prerecorded captioning.

Prerecorded (Off-line) Captioning: 

The preparation of captions for recorded programming so that, at the time of air or media playback, the captions are a part of the media. Appearance of captions is usually “pop-on” but could also be “roll-up.” Captions are typically placed in the upper or lower third of the television screen.

R

Real-time (live) Captioning: 

Method of captioning in which captions are simultaneously prepared and transmitted at the time of origination by specially trained real-time captioners or voice recognition software.

Real-time Dictionary: 

A computerized dictionary that is comprised of the phonetics and their corresponding English that the captioner uses to build words and create punctuation. Real-time captioners write phonetically what they hear. Similar to playing chords on a piano, multiple keys are depressed on a steno machine to create different word combinations. No two captioners write exactly the same way, so each has a custom dictionary.

Reformat: 

The process of revising previously captioned programs for rebroadcast, requiring the retiming and/or editing of caption text to synchronize it to the edited video and audio.

Roll-up Captions: 

Roll-up captions roll onto and off the screen in a continuous motion. Usually two to three lines of text appear at one time. As a new line comes along, it appears on the bottom, pushing the other lines on the screen up. Roll-up captions are used for all live captioning and can also be used for prerecorded captioning.

S

Subtitles: 

Permanent on-screen text that represents the narration and dialogue of a program. Subtitles are created with a character generator; no decoding capability is required for viewing them. Subtitles are usually in upper- and lower-case letters and do not appear in a black background. Also, subtitles are typically placed at the bottom center of the screen.

T

Time-code: 

An electronic signal embedded in a videotape that discretely identifies each frame of video.

Timed Roll-Up Captions: 

For prerecorded programming, roll-up captions can be timed to be closely synchronized with the audio.

airlinecomp.jpg