In a captioned scene from Loudermilk, a salesclerk says to Loudermilk, "I can't help it. This is my voice."

That is not your voice

This is the fifth post in a series inspired by Lake Bell’s audiobook chapter “Sexy Baby Voice.” In previous posts I’ve covered the three key features she uses to define this vocal style – bright resonance (which Bell refers to as “high pitch”), creaky voice (“vocal fry”) and legato articulation (“slurring”), and discussed the various ways that we can manipulate our vocal tracts to create or amplify bright or dark resonances. Now I want to talk about your voice.

Not your voice, but what people mean when they say “your voice.” A friend who’s a vocal coach and read my earlier posts sent me a not-very-funny opening scene from a sitcom called Loudermilk, where the title character (played by Ron Livingstone of Office Space) mocks and insults a young woman who takes his order at a coffee bar. The salesclerk is friendly, prompt and thorough; Loudermilk has no cause for complaint. His abuse is entirely based on his dislike for the sound of her voice.

Anyone who’s read this series or listened to the “Sexy Baby Voice” chapter will recognize three particular features of salesclerk’s voice: bright resonance, creaky voice, legato articulation. The Loudermilk scene could have been inspired by the scenes about “sexy baby voice” in Lake Bell’s 2013 film about the voice-over industry, In a World…

Loudermilk mocks the salesclerk’s creaky voice by using creaky voice in his own responses, and the salesclerk asks “why are you talking like that?” Loudermilk responds, “This is my voice,” and she says, “No, it’s not.” After mocking her voice more and ranting a bit, he says, “just stop doing that.” Her response mirrors the earlier exchange: “I can’t help it, this is my voice,” to which he responds, “No, it’s not.”

As Loudermilk receives his coffee and leaves, the salesclerk, infuriated by his abuse, shouts at his back, “You’re a total dick!” Surprise! She doesn’t use legato articulation or creaky voice – because it’s really fucking hard to shout with either of those features. He turns back and says, “There, there you go, you’re talking!” as though she’d proven his point.

Loudermilk’s insistence that the salesclerk’s use of creaky voice is not “your voice” echoes a deleted scene from In a World… that Lake Bell includes in the audiobook chapter. In the scene, Bell’s character conducts “a vocal experiment” on another character who habitually uses “sexy baby voice.” She asks the other character to count to ten, alternating “the lowest point in your register” (i.e. with dark resonance) on odd numbers with “the highest point in your register” (bright resonance) on even numbers, and then say “Here’s my voice.”

Of course, “Here’s my voice” is the eleventh utterance in the sequence. As an odd-numbered utterance, Bell’s character pronounces it with relatively dark resonance, and the other character follows suit. As with the Loudermilk scene, we’re meant to marvel at the transformation: this woman’s True Voice, stripped of all that sexy baby junk! The message of both scenes is the same: that “sexy baby voice” is fake and women only use it because they’re insecure, but maybe they can be tricked into experiencing the power of their True Voices.

I don’t know about you, but when I first heard the deleted scene with the “vocal experiment,” the first thing I thought of was Elizabeth Holmes, the business executive who is currently in prison for selling a fake technology to investors. In addition to amassing wealth and power through lies and hype, Holmes is famous for having an unusually low voice for a woman – not just dark resonances, but when she speaks publicly, her fundamental frequency is in the range more typically used by American men.

During the height of Holmes’s success, several people felt that her claims were too good to be true, and they suspected her voice of being fake too. When recordings surfaced of Holmes speaking in a more typical pitch range for an American woman, that was presented as casting doubt on her honesty in general. Is her voice as big a fraud as her company?

I’ll have more to say about the notion of “your voice” and what it means to accuse someone of habitually using a fake voice, but astute observers may note that this double bind – don’t talk too “high-pitched,” but don’t talk too low-pitched either! – is an echo to the double-binds put on women in all kinds of areas – be assertive but not bossy! be attractive but not slutty!

Slurring sexy babies

Recently I’ve written a few posts in response to the notion of “sexy baby voice” in Lake Bell’s latest audiobook. Bell identifies “sexy baby voice” with three characteristic features: “high pitch” (which I argue is actually bright resonance), “vocal fry” (what phoneticians call creaky voice) and “slurring.” I’ve argued that while bright resonance can be controlled to some degree, it is characteristic of youth and femininity, and that creaky voice is the only way that some young woman can add darker resonance (and hence a bit of gravitas) without sounding tomboyish or fussy.

I wanted to write a quick post about Bell’s third criterion, “slurring,” which Gladwell summaries as “running some words together,” and “sentences without spaces.” Bell’s caricature of slurring gets to the point where she sounds like she’s doing an impression of a drunk sorority girl, but in moderation this is a well-documented pattern of speech variation: some people are noted for short, quick transitions from one speech segment to the next and from one intonational pitch to the next, known as “staccato” articulation, while others take these transitions more gradually, designated by the Italian word “legato.”

Guess what the legato vs. staccato articulation patterns are associated with? Gender. I learned it from my voice teachers, Kristy Bissell and Erin Carney, as part of lessons on developing gender expression in the voice. I’m not familiar with research on this in phonetics, if any has been done.

Basically, staccato articulation is stereotypically associated with men barking orders, while legato articulation is associated with women discussing things in soft, flowing ways. Yes, these are stereotypes, and we can all think of women who bark orders and men with soft, legato articulation. But those women are perceived as acting masculine when they speak with staccato articulation, and men speaking legato are perceived as speaking in feminine ways.

It’s understandable why the use of legato articulation bothers Lake Bell so much: it’s the antithesis of a particular voice-over style that she admires. In her chapter she includes an audio clip of a film she made in 2013, In a World… Before listening to this chapter I had never heard of her or the film, but I discovered that it was seen by a fairly large number of people, and generally well appreciated. That film introduced the general public to her idea of “sexy baby voice,” and was discussed by Mark Liberman in a series of LanguageLog posts.

The name of the film references the famous phrase “In a world…” used in voice-over tracks to introduce trailers for science-fiction action films. In the film, Bell’s character is competing to be the first woman to voice these kinds of macho trailers. The thesis of the film is that women are just as capable as men of delivering this punchy, aggressive style of speech, and are being held back from that success by what else? “Sexy baby voice.”

Even without going to the hypermasculine extent of action film voice-overs, Bell is implicitly endorsing the management-consultant approach to voice and gender that treats any bias against women’s speech as evidence of a deficiency in the women’s speech itself, a deficiency that can be remedied with enough courses in proper speaking. This is extensively debunked by linguists like Deborah Cameron and Lisa Davidson in articles that I linked from previous posts.

So there we have the three features of “sexy baby voice”: bright resonance, which is an indicator of youth and femininity; creaky voice, which is one of a handful of strategies available to young women to darken their resonance, and legato articulation, which is also an indicator of femininity. If we find this in women who are actually young, it basically means that they want to get away from girlish voices without sounding like tomboys or fussy older women. Judging young women for this strikes me as unfair and mean-spirited.

I have to point out, however, that young women are not the main target of Bell’s “sexy baby voice” tirades. Her ire is directed at older women who, she argues, have other ways of accessing dark resonance but use bright resonance with creaky voice anyway. I’ll address that in another post!

Youth, authority, gender and creaky voice

Recently I’ve written two posts about bright resonance in response to Lake Bell’s audiobook chapter, “Sexy Baby Voice.” Bell describes “sexy baby voice” as having three characteristic features: “high pitch”, “vocal fry” and “slurring.” My first post supported Byron Ahn’s analysis that found that Bell’s “sexy baby voice” samples didn’t have reliably higher pitch than the non-“sexy baby voice” samples, and suggested that she’s probably talking about bright resonance. My second post drew on phonetic and pedagogical research to confirm Bell’s claim that while resonance is constrained by the size and shape of our vocal tracts, it can be consciously controlled to a certain degree.

In this post I want to connect bright resonance (what Bell calls “high pitch”) with creaky voice (“vocal fry”). The original reason they’re used together is youth.

Bell’s argument is that “sexy baby voice” keeps women from being taken seriously, so let’s imagine a young woman who wants to be taken seriously when she talks. Let’s say it’s 1990, and this woman is named Heather, and she has important things to say, whether it’s in a speech or in conversation. And importantly for our purposes, Heather is trendy and feminine.

On some level Heather is aware that dark resonance adds gravitas to speech. But she’s young, she’s petite, she hasn’t given birth and she doesn’t smoke, so she has a relatively short vocal tract and thin vocal folds. This means that without using any of the vocal habits I described in my last post, Heather’s voice will sound girlish, and will risk being prejudged as immature and unserious.

Heather may try some of those habits and find them wanting. She’s already avoiding twang and nasal resonance, which would make her voice sound even brighter. She could try rounding and protruding her lips and using the furthest-back tongue articulations, the time-honored strategy of boys and tomboys. But here’s the thing: she doesn’t want to sound too masculine. She wants to be feminine, but taken seriously. And maybe even sexy.

Another strategy, lowering the larynx, also clashes with the style she wants. It sounds too formal, too grand dame, too fussy. Not at all trendy or stylish.

Let’s imagine that after trying all these strategies, Heather’s a little tired and resigned. She relaxes her voice and it drops into creak. And it doesn’t sound fussy or tomboyish, but it has dark resonance. Maybe it even sounds a bit fashionably blasé!

And from a completely personal view, I just want to say that I do find creaky voice adds a bit of gravitas, and it can be very sexy. When I hear a woman with creaky voice combined with bright overtones, I get an impression of smallness in bigness. I think of creaky voice as the oversize sweater, boyfriend shirt or even mom jeans of the voice.

So Heather starts using creak whenever she wants to be taken seriously. And because she’s trendy, other young women imitate her. Heather is Creaker Zero of late twentieth century “vocal fry.”

Is that the way it actually happened? I have no idea. But it’s a possible scenario. And the scorn that’s been heaped on “vocal fry” over the past thirty plus years has been a potent example of the double bind that women are placed in so many times. Not enough dark resonance? Girlish. Rounded lips? Transgressing gender. Lowered larynx? Fussy. Creaky voice? You’re destroying your voice!

A lot of the politics of women’s voices has been covered by linguists I respect and admire, so for most of this I’ll just refer you to the responses of Deborah Cameron, Penny Eckert and Lisa Davidson to the 2015 “vocal fry” panic, and radio producer Katie Mingle’s all-purpose response to criticism of women’s voices.

This is one area where Malcolm Gladwell failed in this chapter. Gladwell is the producer of Bell’s audiobook and a friend of Bell, and in the chapter she turns to him for feedback. His biggest strength is the ability to find experts and present their ideas in ways that engage a broader audience, but in this chapter he doesn’t talk to Cameron, Eckert, Davidson or even Mingle. He just sits there and gives his own opinions, even conflating “high pitch” with “uptalk.” In his defense, it is possible that he tried to refer Bell to experts, but we don’t hear about it.

Controlling the brightness of the voice

A few weeks ago I posted about “Sexy baby voice,” the topic of a chapter in Lake Bell’s audiobook about the culture and politics of voices. Bell identified three characteristics of “sexy baby voice” in women: high pitch, “vocal fry” (creaky voice) and “slurring.”

In phonetics, “pitch” is generally understood to refer to the fundamental frequency of the speech signal, but on Twitter the phonetician Byron Ahn posted the results of a computer analysis of some of the examples Bell gave for “high pitch” and pointed out that their fundamental frequencies weren’t much higher than the examples she gave for “normal” speech. In my post, I suggested that Bell is probably referring to the frequencies of harmonics in the speech, also called “resonance” or “formants.” It sounds like the most salient feature of “sexy baby voice” is bright resonance.

As I discussed in my post, bright resonance is generally associated with youth and femininity, because it’s usually caused by small vocal folds in small vocal tracts, and women and children tend to be smaller and have smaller vocal folds. Even younger women tend to have brighter resonance than older women, primarily because of the affect of hormonal changes during childbirth and menopause.

Of course, as Bell demonstrates repeatedly in her chapter, bright resonance can also be controlled, either consciously in the moment or subconsciously through habit and training. I’ve learned about these ways over the years, as a linguistics doctoral student, as a transgender woman and as an amateur singer. I’ll go through all the ways I know to do this.

Diagram of the vocal tract produced by the U.S. Centers for Disease Control and Prevention, and distributed by Wikimedia Commons.

As with the previous post in this series, my knowledge comes from training, not reading, so I don’t know who to credit for figuring all this out about the vocal tract. For now I will credit my primary teachers: the vocal coaches Kristy Bissell and Erin Carney, and the phoneticians Jacques Filliolet, Karen Landahl, Alex Francis and Doug Honorof.

For people who haven’t studied the anatomy of the vocal tract, this will get a little technical. In this blog post I’m going to use all the technical language, but if there’s a particular area that you feel could use more explanation for a general audience, please let me know.

Let’s start with the larynx and move through the vocal tract with the breath. The vocal folds generate sound through their vibration. When closing completely they generate a relatively coherent sound wave, but they can add dark resonance by maintaining gaps of particular sizes to allow low-frequency vibrations, what we call creaky voice or “vocal fry.” Similarly, they can add bright resonance by allowing turbulent air to flow through, causing breathy voice.

Just above the larynx is a tube called the pharynx. We can add bright resonance by constricting the pharynx, a practice that vocal coaches call “twang.” The name confused me for a while, because I associate the word “twang” with the Southern vowel shift, but in this case it refers to the narrowing of the pharynx.

The velum is a flap of muscle that we open to allow air to flow through the nose. When we allow air to flow through the nose and mouth at the same time it produces nasal resonance, which adds brighter resonance.

We use our tongues to produce consonants and vowels, raising a part of the tongue towards the roof of our mouths, so a /d/ sound is formed by touching the front of the mouth, and a /g/ sound by touching further back. For each of these sounds there is a range of positions along the roof of our mouth. When we raise our tongues further forward within the range for that sound, we generate brighter resonance. We can also generate bright resonance by flattening the tongue, allowing it to be raised higher. There is extensive research showing that women and gay men tend to have brighter resonance on their /s/ phonemes, and that people who make brighter /s/ sounds tend to be heard as women or gay men, even if they aren’t.

The lips are the gates that release our voices to the air outside. Rounded and protruded lips can produce darker resonance, and spread lips (in a smile or similar shape) can produce lighter resonance. I remember hearing about a study showing that even before puberty, boys tend to round their lips to sound more masculine.

One thing that makes this confusing is that all these vocal tract configurations have other functions. Creaky voice can be a sign of fatigue. Breathy voice can be a sign of relaxation. Pharyngeal constriction, nasal resonance, place of articulation and lip rounding can each change one word into another word with a completely different meaning, in Arabic, French, English and other languages.

These articulations can also interact with each other and with the fundamental frequency of the voice in different ways. At low frequencies, breathy voice can sound sympathetic or sexy, but at high frequencies it can sound weak and vulnerable. This may be what you want to project, or it may not. Nasal resonance and pharyngeal constriction can sound forced or strident, obnoxious or insensitive.

The bottom line is that these aspects of the voice are all under some degree of conscious control. How much control a speaker has, and how conscious they are, depends on a lot of factors, but the takeaway for Bell’s chapter is that people with smaller vocal tracts can use these techniques to speak with darker resonance than they would without them, and people with larger vocal tracts can use them to speak with brighter resonance than they otherwise would.

Note that I’m using the term “otherwise.” The terms I want to avoid, for this post at least, are “natural,” “authentic,” “real” and “your/my/their voice.” The tension between biological constraints, habit and conscious control is what makes resonance so fraught, politically, culturally and socially, which is why Bell and others have such intense feelings about it. That’s for another post.