ONE
A Rhetorical View of Captioning
Four New Principles of Closed Captioning
Closed captioning has been around since 1980—it’s not “new media” by any means—but you wouldn’t know it from the passionate captioning advocacy campaigns, new web accessibility laws, revised international standards, ongoing lawsuits, new and imperfect web-based captioning solutions, corporate feet dragging, and millions of uncaptioned web videos. Situated at the intersection of a number of competing discourses and perspectives, closed captioning offers a key location for exploring the rhetoric of disability in the age of digital media. Reading Sounds offers the first extended study of closed captioning from a humanistic perspective. Instead of treating closed captioning as a legal requirement, a technical problem, or a matter of simple transcription, this book considers how captioning can be a potent source of meaning in rhetorical analysis.
Reading Sounds positions closed captioning as a significant variable in multimodal analysis, questions narrow definitions that reduce captioning to the mere “display” of text on the screen, broadens current treatments of quality captioning, and explores captioning as a complex rhetorical and interpretative practice. This book argues that captioners not only select which sounds are significant, and hence which sounds are worthy of being captioned, but also rhetorically invent words for sounds. Drawing on a number of examples from a range of popular movies and television shows, Reading Sounds develops a rhetorical sensitivity to the interactions among sounds, captions, contexts, constraints, writers, and readers.
This view is founded on a number of key but rarely acknowledged and little-understood principles of closed captioning. Taken together, these principles set us on a path towards a new, more complex theory of captioning for deaf and hard-of-hearing viewers. These principles also offer an implicit rationale for the development of theoretically informed caption studies, a research program that is deeply invested in questions of meaning at the interface of sound, writing, and accessibility.
1. Every sound cannot be closed captioned.
Captioning is not mere transcription or the dutiful recording of every sound. There’s not enough space or reading time to try to provide captions for every sound, particularly when sounds are layered on top of each other in the typical big-budget flick. Multiple soundtracks create a wall of sound: foreground speech, background speech, sound effects, music with lyrics, and other ambient sounds overlap and in some cases compete with each other. Sound is simultaneous; print is linear. It’s not possible to convert the entire soundscape of a major film or TV production into a highly condensed print form. It can also be distracting and confusing to readers when the caption track is filled with references to sounds that are incidental to the main narrative. Caption readers may mistake an ambient, stock, or “keynote” sound (Schafer 1977, 9) for a significant plot sound when that sound is repeatedly captioned. A professional captioner shared the following example with me: Consider a dog barking in an establishing shot of a suburban home. When the dog’s bark is repeatedly captioned, one may begin to wonder if there’s something wrong with that dog. Is that sound relevant to this scene? (See figure 1.2.) Very few discussions of captioning acknowledge or even seem to recognize that captioning, done well, must be a selective inscription of the soundscape, even when the goal is so-called “verbatim captioning.”
2. Captioners must decide which sounds are significant.
If every sound cannot be captioned, then someone has to figure out which sounds should be. Speech sounds usually take precedence over nonspeech sounds, but it’s not that simple. What about speech sounds in the background that border on indistinct but are discernable through careful and repeated listening by a well-trained captioner? Should these sounds be captioned (1) verbatim, (2) with a short description such as (indistinct chatter), or (3) not at all? Answering this question by appealing to volume levels (under the assumption that louder sounds are more important) may downplay the important role that quieter sounds sometimes play in a narrative (see figure 1.3). What is needed is an awareness of how sounds are situated in specific contexts. Context trumps volume level. Only through a complete understanding of the entire program can the captioner effectively interpret and reconstruct it. Just as earlier scenes in a movie anticipate later ones, so too should earlier captions anticipate later ones. In the case of a television series, the captioner may need to be familiar with previous episodes (including, when applicable, the work of other captioners on those episodes) in order to identify which sounds have historical significance. The concept of significance (or “relevant” sounds [see Sydik 2007, 181]) shifts our attention away from captioning as copying and toward captioning as the creative selection and interpretation of sounds.
3. Captioners must rhetorically invent and negotiate the meaning of the text.
The caption track isn’t a simple reflection of the production script. The script is not poured wholesale into the caption file. Rather, the movie is transformed into a new text through the process of captioning it. In fact, as we will see in chapter 4, when the captioner relies too heavily on the script (for example, mistaking ambient sounds for distinct speech sounds), the results can be disastrous. In other cases, words must be rhetorically invented, which is typical for nonspeech sounds. I don’t mean that the captioner must invent neologisms—I issue a warning about neologistic onomatopoeia in chapter 8. Rather, the captioner must choose the best word(s) to convey the meaning of a sound in the context of a scene and under the constraints of space and time. The best way to understand this process, as this book argues throughout, is in terms of a rhetorical negotiation of meaning that is dependent on context, purpose, genre, and audience.
4. Captions are interpretations.
Captioning is not an objective science. The meaning is not waiting there to be written down. While the practice of captioning will present a number of simple scenarios for the captioner, the subjectivity of the captioner and the ideological pressures that shape the production of closed captions will always be close to the surface of the captioned text. The practice of captioning movies and TV shows is typically perfo...