eBook - ePub
Foundations of Voice Studies
An Interdisciplinary Approach to Voice Production and Perception
Jody Kreiman, Diana Sidtis
This is a test
Share book
- English
- ePUB (mobile friendly)
- Available on iOS & Android
eBook - ePub
Foundations of Voice Studies
An Interdisciplinary Approach to Voice Production and Perception
Jody Kreiman, Diana Sidtis
Book details
Book preview
Table of contents
Citations
About This Book
Foundations of Voice Studies provides a comprehensive description and analysis of the multifaceted role that voice quality plays in human existence.
- Offers a unique interdisciplinary perspective on all facets of voice perception, illustrating why listeners hear what they do and how they reach conclusions based on voice quality
- Integrates voice literature from a multitude of sources and disciplines
- Supplemented with practical and approachable examples, including a companion website with sound files at www.wiley.com/go/voicestudies
- Explores the choice of various voices in advertising and broadcasting, and voice perception in singing voices and forensic applications
- Provides a straightforward and thorough overview of vocal physiology and control
Frequently asked questions
How do I cancel my subscription?
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlegoâs features. The only differences are the price and subscription period: With the annual plan youâll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weâve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Foundations of Voice Studies an online PDF/ePUB?
Yes, you can access Foundations of Voice Studies by Jody Kreiman, Diana Sidtis in PDF and/or ePUB format, as well as other popular books in Languages & Linguistics & Phonetics & Phonology. We have over one million books available in our catalogue for you to explore.
Information
1
Introduction
1.1 Why Should We Care About Voice Quality?
Whenever we speak, our voices convey information about us as individuals. Speakers may sound young, or tired, or elated, or distracted. They may sound as if they are drunk, or lying, or ill, or bearing secret, exciting news. By their voices, adult speakers usually reveal whether they are male or female, and in addition, they may signal that they come from Texas, or Wisconsin, or France. Over the telephone or radio we may recognize the speaker as someone we know, or we may form a distinct impression of the physical appearance of someone we have never seen. The impressions listeners gain from voices are not necessarily accurate; for example, everyone has known the surprise of meeting a telephone acquaintance who does not match the mental picture we have previously formed of them. Despite such occasional mismatches, voice quality is one of the primary means by which speakers project their identity â their â physical, psychological, and social characteristicsâ (Laver, 1980, p. 2) or their âauditory faceâ (Belin, Fecteau, and Bedard, 2004) â to the world.
Table 1.1 non-exhaustively summarizes some of the kinds of judgments that listeners make when listening to voices. These human abilities arise from a long evolutionary process, and many animal species, including primates (Cheney and Seyfarth, 1980), wolves (Goldman, Phillips, and Fentress, 1995), penguins (Jouventin and Aubin, 2002), frogs (Bee, 2004), and bats (Balcombe and McCracken, 1992) use vocal quality to signal or perceive size, threat, and kin relationships. Human infantsâ ability to recognize their mothersâ voices is in place at birth (DeCasper and Fifer, 1980), and responses to maternal voices can be measured in utero, suggesting such abilities develop even before birth (Hepper, Scott, and Shahidullah, 1993; Kisilevsky et al., 2003). Voice conveys much of the emotion and attitude communicated by speech (Williams and Stevens, 1972; Banse and Scherer, 1996; Ellgring and Scherer, 1996; Van Lancker and Pachana, 1998; Breitenstein, Van Lancker, and Daum, 2001). Alterations in voice quality relative to the speakerâs normal vocal delivery may signal irony or sarcasm (Van Lancker, Canter, and Terbeek, 1981). Changes in rate and fundamental frequency affect the perceived âcompetenceâ (Brown, Strong, and Rencher, 1974) or credibility (Geiselman and Bellezza, 1977) of a speaker. Voice quality provides cues that indicate order of turn-taking in conversation (Schegloff, 1998; Wells and Macfarlane, 1998) and helps resolve sentential ambiguities (Kjelgaard, Titone, and Wingfield, 1999; Schafer, Speer, Warren, and White, 2000). Listeners may also judge the speakerâs sexual preference (Linville, 1998; Munson and Babel, 2007), status as native or nonnative speaker (Piske, MacKay, and Flege, 2001), and a myriad of personality factors (Scherer, 1979) based on voice quality cues.
Spoken message |
Physical characteristics of the speaker |
Age Appearance (height, weight, attractiveness) Dental/oral/nasal status Health status, fatigue Identity Intoxication Race, ethnicity Sex Sexual orientation Smoker/non-smoker |
Psychological characteristics of the speaker |
Arousal (relaxed, hurried) Competence Emotional status/mood Intelligence Personality Psychiatric status Stress Truthfulness |
Social characteristics of the speaker |
Education Occupation Regional origin Role in conversational setting Social status |
This book describes the manner in which these kinds of information are conveyed to listeners, and how listeners draw conclusions â correctly or incorrectly â about speakers from their voices. Many of the points described are illustrated by recorded examples provided on the accompanying web site.
For example, consider the voice in audio sample 1.1. As you listen to this brief speech sample, you will probably automatically gather information about the speaker. Listeners agree that the speaker is female. Although opinions differ, listeners are likely to think that the speaker is adult but not elderly, cheerful, confident, alert, and in good health. She is American, but does not have a pronounced regional, social, or ethnic accent. She sounds average or slightly above average in height and weight. She seems educated and is speaking carefully. She does not sound like a smoker. You probably do not recognize the voice, but it may remind you of someone you know.
Compare this talker to the voice in audio sample 1.2. This speaker is also female, but the voice sounds like a much older person. She has a New England accent, and the rhythm of her speech is unusual, making her sound rather upper-class or snobby (or merely self-conscious) to some listeners. She is not tired, depressed, or angry, but she is not obviously happy, either, and may be bored. Her voice is somewhat hoarse, suggesting that she is or has been a smoker, but she does not seem ill. Listeners disagree somewhat about her height and weight, but generally estimate that she is average or slightly below average in height, and slightly above average in weight.
The voice of a speaker with a vocal pathology is presented in audio sample 1.3. Even this short sample may produce complex impressions of old age, illness, and unattractiveness, along with a sense of the speakerâs emotions or mood, intelligence, and competence. Patients who develop a voice disorder often complain that the disordered voice is not really their voice, and does not convey who they are. In some cases, patients dislike the image they portray so much that they avoid speaking, resulting in significant social and work-related difficulties. Severe voice quality problems may also interfere with speech intelligibility, creating a handicap in the communication of verbal information (Kempler and Van Lancker, 2002).
The strong impressions conveyed by voice quality are often manipulated by the media for multiple purposes. For example, in the classic film Singinâ in the Rain (Freed, Kelly, and Donen, 1952), the shrill, loud voice of the character Lina Lamont (played by actress Jean Hagen) surprises and amuses because it does not fit her appearance (a beautiful, smiling blonde) or the elegant, poised, sophisticated personality she visually projects. This contrast â a prototypically silly voice in an elegant physique â forms a running joke throughout the film, playing off such lines as, âWhat do you think I am, dumb or something?â spoken in the abrasive voice stereotypically associated with a vulgar, uneducated, shrewish female. More often, voices are selected to fit the intended message. Documentary films enhance credibility through the use of a male narrator whose voice carries the stereotype of an authoritative figure who is solid, mature, calm, highly intelligent, and dignified. In the field of advertising, impressions conveyed by voice quality are integral to establishing a product image. Consider the characteristics projected by the voices typically used in advertisements for luxury automobiles. Low pitch, breathy quality, and a fairly rapid speaking rate produce the image of an intimate message from a mature but energetic male who possesses authority, sex appeal, social status, and âcoolness.â These vocal attributes are appropriate to the economic niche for the product and imply that its owners are powerful, sexy, and affluent.
Given the wide range of information listeners derive from voices, it is not surprising that scholars from many different disciplines have studied the production and perception of voice. Table 1.2 lists some of these disciplines, along with a sampling of typical research questions. These research questions encompass much of human existence, and indicate how central voice quality is to human life.
Discipline | Some typical research questions |
Acoustics | Deriving reliable and meaningful acoustic measures of voices |
Animal behavior | Vocal recognition of kin and social information by nonhuman animals |
Biology | Biological and evolutionary significance of vocalization |
Computer science, signal processing, information | Transmission, measurement, and synthesis of voice |
Forensic science, law enforcement | Reliability and verification of âearwitnessâ testimony; assessment of truthfulness from voice |
Linguistics, phonetics | Meanings of vocal quality in speech |
Medicine: | |
Developmental biology | Infant voice recognition |
Gerontology | Voice quality changes in aging |
Neurology | Brain function underlying vocal behaviors |
Obstetrics | Prenatal voice perception |
Otolaryngology | Voice disorders |
Pediatrics | Childrensâ processing of vocal information |
Physiology | Control of phonation |
Respiration | Role of breathing in vocalization |
Surgery | Effects of surgical interventions in the vocal tract on voice; cosmetic changes for transgendered voices |
Music: | |
Singing | The singing voice: many questions |
Vocal coaching | The effects of training on the voice |
Physics | Vibrating laryngeal tissues; relation of vibration to sound; patterns of airflow through the glottis |
Psychology: | |
Cognitive psychology | Speaker recognition and its causes; interaction between speech recognition and voice quality |
Clinical psychology | Detecting depression, psychopathology, and personality in the human voice |
Social psychology | Voices as signals of social relationships including conversational turn taking, sarcasm, and successful con-artistry |
Neuropsychology | Brain mechanisms underlying the perception and production of voice cuing personal identity as well as mood and motivation |
Psychophysics | Relevant acoustic voice features for perception |
Psycholinguistics | Voice information in meaning comprehension for grammatical structure and nonliteral meanings |
Sociology | Voice types associated with social groups and their development |
Speech science | Normal voice and speech production |
Speech pathology | Effects of vocal pathologies on voice quality |
Theater arts | Voice as artistic instrument |
1.2 What is Voice? What is Voice Quality? The Definitional Dilemma
The terms âvoiceâ and âvoice qualityâ are variously used, sometimes apparently interchangeably, and deriving consistent definitions has not proven easy. Adding to the confusion, authors also discuss a range of specific voice qualities (a creaky voice, a breathy voice), qualities associated with a speakerâs internal or physical state (a sad voice, a tired voice; a sexy voice), and so on, without benefit of a theoretical framework linking all these usages. We attempt to distinguish these meanings usefully by discussing the terms here.
Although a clear definition of voice is a prerequisite to its study, the broad range of functions subserved by voice has made it difficult to provide a single, all-purpose definition that is valid and useful across disciplines, scholarly traditions, and research applications. As voice scientist Johann Sundberg has noted (1987), everyone knows what voice is until they try to pin it down, and several senses of the term are in common use. In scientific usage (and throughout this book), the term âvoiceâ has a physical and physiological base that refers to the acoustic signal (as generated by the voice production system), while âvoice qualityâ refers to the perceptual impression that occurs as a result of that signal, analogous to the distinction between âfrequencyâ (a physical property of vibration) and âpitchâ (a listenerâs sensation). Definitions of voice fall into two general classes. In the first, voice can be defined very narrowly in physiological terms as âsound produced by vibration of the vocal folds.â Were this definition applied, voice would include only those aspects of the signal that are attributable to the action of the vocal folds, and would exclude the acoustic effects of vocal tract resonances, vocal tract excitation from turbulent noise, or anything else that occurs during speech production other than the action of the vocal folds. (Chapter 2 describes the voice production process in detail.) This definition corresponds approximately to the linguistic voicing feature that phonetically...