WASHINGTON, DC - OCTOBER 20: Actress and model Paris Hilton speaks during a news conference outside the U.S. Capitol October 20, 2021 in Washington, DC. Congressional Democrats held a news conference with Hilton to discuss child abuse and legislation to establish a “bill of rights” to protect children placed in congregate care facilities. (Photo by Alex Wong/Getty Images)

Listen to the voices of the sexy babies

Leave a comment November 13, 2022 Angus Andrea Grieve-Smith

A few days ago, Byron Ahn drew our attention to an excerpt from a new, six-hour audiobook, Inside Voice by Lake Bell, credited as an “actress/writer/director/producer.” Bell is a friend of author and podcaster Malcolm Gladwell, and Gladwell agreed to serve as a kind of sounding board for Bell’s ideas about something she calls “sexy baby voice,” pointing to the voices of Paris Hilton and Kim Kardashian as paradigm examples of it. Gladwell, whose company is publishing Inside Voice, also published this excerpt as a free bonus episode of his podcast Revisionist History, which I listen to regularly, although I’m almost two years behind.

Bell argues for a few points: that what she calls “sexy baby voice” is a distinct speech style with specific audible features, that it is particularly inauthentic (she claims several times that it requires effort to speak that way, and describes a coaching technique for helping women to find their “true” voices) and that it makes them sound stupider than Bell knows them to be. She repeatedly assures us that she is not passing judgment, and then uses extremely judgmental language to describe “sexy baby voice,” which I interpret as an application of “love the sinner, hate the sin.”

Ahn posted a series of Twitter threads about the excerpt. He notes that it’s problematic for Bell to criticize women as a self-identified feminist, but he focuses on the terminology that she uses to describe the features of “sexy baby voice,” particularly the word “pitch.” He concludes, “we should encourage public figures talking about voices to consult linguists who have the training.”

I’ve got a lot of thoughts and feelings about this excerpt and Bell’s idea of “sexy baby voice.” I could probably write several blog posts on the practical, cultural and social angles to this. For this post I’m going to keep with Ahn’s focus on what “sexy baby voice” is, phonetically. I sketched some of this out on Ahn’s Twitter thread, and I’ll synthesize and expand that here.

Bell says that the primary feature that defines “sexy baby voice” is “pitch,” and as linguists, we’re trained to interpret “pitch” as the fundamental frequency of the voice – essentially, the lowest pitch produced by the voice at any given time. I’ve been taking singing lessons, and all the singers and singing teachers I’ve talked to use “pitch” in the same way.

Ahn introduces his discussion of the “sexy baby voice” excerpt with a graph of the fundamental frequency of a segment of the recording – throughout the excerpt, Bell uses her own voice to demonstrate the “sexy baby voice” style, even though she says she does not use it in everyday conversation. In the graph he posts, the floor and ceiling of Bell’s fundamental frequency range are not particularly higher when she is using “sexy baby voice” than at other times.

Bell mentions two other factors: “vocal fry” (the linguistic term is “creaky voice”) and “slurring” speech. Ahn speculates that she may be picking up on other factors as well, like “SoCal vowels” or laryngeal constriction. He also acknowledges that “pitch” may refer to other pitch-related features besides fundamental frequency range, such as “uptalk,” a pattern of rising in fundamental frequency at the ends of phrases. Gladwell uses the word “uptalk” when echoing Bell’s explanations, but it’s not clear that he’s referring to phrase-final pitch rise.

So here’s where I come in: my gender expression is fluid, so I’ve been studying differences in vocal quality. When I listen to the samples in the chapter of “sexy baby voice” and … not-sexy-baby-voice (that’s for another post!) given by Bell, both in recordings and her own mimicry, I hear some creaky voice (“vocal fry”), but the main difference I hear is resonance.

This section is going to be a bit of a departure from my normal linguistics blogging, because I have not studied any of the literature on this. My understanding of it comes from practical training, so I don’t know who to cite or credit for any of this besides my teachers, Kristy Bissell and Erin Carney.? Of course, any inaccuracies are most likely due to my misunderstanding of what they’ve tried to teach me!

Resonance is about the pitch of speech, but it’s not about the fundamental frequency. It’s about everything else: the harmonics that result from the way the tones from our vocal folds echo around our bodies and are filtered through different parts of our vocal tracts and nasal passages. Just as plucking a string on an acoustic guitar produces overtones from the guitar body, whenever we arrange our vocal folds to talk or sing we produce overtones: higher pitched frequencies that can harmonize or clash with the fundamental frequency.

There are a ton of things you can do with resonance and it can get really complicated, so let’s focus on the primary resonance difference I’m hearing between Lake Bell’s “sexy baby voice” and the other examples. To me, the “sexy baby voice” examples sound brighter.

Bright and dark are useful terms to evoke the quality of resonance while distinguishing it from fundamental frequency. Bright sounds are ones where we hear more of the higher-pitched harmonics, while in dark sounds the lower harmonics dominate.

As I’ve learned from my teachers, and as Bell demonstrates, there’s a lot we can do with our voices to shift the balance of harmonics towards light or dark, but a substantial part of resonance comes form the structure of our bones, cartilage, muscles and fat. Higher-pitched harmonics tend to come from shorter vocal tracts, smaller nasal cavities, and in general, from smaller bodies. As a result, the voices of smaller people tend to sound brighter.

Testosterone during the teenage years also changes the configuration of our vocal tracts: thickening the vocal folds, making the larynx larger and shifting it lower in the throat. This is why men’s and trans women’s voices tend to sound darker than those of women, girls and prepubescent boys, even when singing the same pitch.

Bodies that see an increase in testosterone after puberty do not get larger or lower larynxes, but do tend to develop thicker vocal folds. This is why many trans men’s voices change, but often sound different from typical men’s voices. It is also, as Bell mentions, why women’s voices often change when they give birth or go through menopause.

As you might have guessed, this is where the “baby” in “sexy baby voice” comes from. Children are smaller than adults and tend to have brighter resonances. It’s also why Bell sees “sexy baby voice” as an exaggerated expression of femininity: women tend to be smaller than men and therefore have brighter voices. Women who haven’t given birth or gone through menopause tend to have brighter voices. Bright resonance suggests youth, femininity and immaturity.

As I mentioned above, there are several things that people can do, consciously or unconsciously, to shift their resonances, and I want to talk about them. I would also love to get into a discussion of the sociopolitical issues that Bell identifies around “sexy baby voice” and women’s voices in general. But this is already pretty long for a blog post, so I’ll save those for another time.

(imit.: dez may be slightly bent spread 5) v type, N typewriter, typist with or without suffix -|| [/BB/v,.

Fonts for Stokoe notation

Leave a comment May 8, 2022 Angus Andrea Grieve-Smith

You may be familiar with the International Phonetic Alphabet, the global standard for representing speech sounds, ideally independent of the way those speech sounds may be represented in a writing system. Did you know that sign languages have similar standards for representing hand and body gestures?

Unfortunately, we haven’t settled on a single notation system for sign languages the way linguists have mostly chosen the IPA for speech. There are compelling arguments that none of the existing systems are complete enough for all sign languages, and different systems have different strengths.

Another difference is that signers, by and large, do not read and write their languages. Several writing systems have been developed and promoted, but to my knowledge, there is no community that sends written messages to each other in any sign language, or that writes works of fiction or nonfiction for other signers to read.

One of the oldest and best-known notation system is the one developed by Gallaudet University professor William Stokoe (u5"_tx) for his pioneering analysis of American Sign Language in the 1960s, which succeeded in convincing many people that ASL is, in ways that matter, a language like English or Japanese or Navajo. Among other things, with his co-authors Dorothy Casterline and Carl Cronenberg Stokoe used this system for the entries in their 1965 Dictionary of American Sign Language (available from SignMedia).? In the dictonary entry above, the sign C_bC_b^r~ is given the English translation of “type.”

Stokoe notation is incomplete in a number of ways. Chiefly, it is optimized for the lexical signs of American Sign Language. It does not account for the wide range of handshapes used in American fingerspelling, or the wide range of locations, orientations and movements used in ASL depicting gestures. It only describes what a signer’s hands are doing, with none of the face and body gestures that have come to be recognized as essential to the grammar of sign languages. Some researchers have produced modifications for other languages, but those are not always well-documented.

Stokoe created a number of symbols, some of which bore a general resemblance to Roman letters, and some that didn’t. This made it impossible to type with existing technology; I believe all the transcriptions in the Dictionary of ASL were written by hand. In 1993 another linguist, Mark Mandel, developed a system for encoding Stokoe notation into the American Standard Code for Information Interchange (ASCII) character set, which by then could be used on almost all American computers.

In September 1995 I was in the middle of a year-long course in ASL at the ASL Institute in Manhattan. I used some Stokoe notation for my notes, but I wanted to be able to type it on the computer, not just using Mandel’s ASCII encoding. I also happened to be working as a trainer at Userfriendly, a small chain of computer labs with a variety of software available, including Altsys Fontographer, and as an employee I could use the workstations whenever customers weren’t paying for them.

One day I sat down in a Userfriendly lab and started modifying an existing public domain TrueType font (Tempo by David Rakowski) to make the Stokoe symbols. The symbols were not in Unicode, and still are not, despite a proposal to that effect on file. I arranged it so that the symbols used the ASCII-Stokoe mappings: if you typed something in ASCII-Stokoe and applied my font, the appropriate Stokoe symbols would appear. StokoeTempo was born. It wasn’t elegant, but it worked.

I made the font available for download from my website, where it’s been for the past 26-plus years. I wound up not using it for much, other than to create materials for the linguistics courses I taught at Saint John’s University, but others have downloaded it and put it to use. It is linked from the Wikipedia article on Stokoe notation.

A few years later I developed SignSynth, a web-based prototype sign language synthesis application. At the time web browsers did not offer much flexibility in terms of fonts, so I could not use Stokoe symbols and had to rely on ASCII-Stokoe, and later Don Newkirk’s (1986) Literal Orthography, along with custom extensions for fingerspelling and nonmanual gestures.

Recently, as part of a project to bring SignSynth (another project of mine) into the 21st Century I decided to explore using fonts on the Web. I discovered a free service, FontSquirrel, that creates Web Open Font Format (WOFF and WOFF2) wrappers for TrueType fonts. I created WOFF and WOFF2 files for StokoeTempo and posted them on my site.

I also discovered a different standard, Typeface.js, which actually uses a JSON format. This is of particular relevance to SignSynth, because it can be used with the 3D web library Three.js. There’s another free service, Facetype.js, that converts TrueType fonts to Typeface.js fonts.

To demonstrate the use of StokoeTempo web fonts, above is a scan of the definition of C_bC_b^r~ from page 51 of the Dictionary of American Sign Language. Below I have reproduced it using HTML and StokoeTempo:

C_bC_b^r~ (imit.: dez may be slightly bent spread 5) v type, r typewriter, typist with or without suffix _____ ?[BB^v.

StokoeTempo is free to download and use by individuals and educational institutions.

Screenshot of LanguageLab displaying the exercise "J'étais certain que j'aillais écrire à quinze ans"

Imagining an alternate language service

Leave a comment April 6, 2022 Angus Andrea Grieve-Smith

It’s well known that some languages have multiple national standards, to the point where you can take courses in either Brazilian or European Portuguese, for example. Most language instruction services seem to choose one variety per language: when I studied Portuguese at the University of Paris X-Nanterre it was the European variety, but the online service Duolingo only offers the Brazilian one.

I looked into some of Duolingo’s offerings for this post, because they’re the most talked about language instruction service these days. I was surprised to discover that they use no recordings of human speakers; all their speech samples are synthesized using an Amazon speech synthesis service named Polly. Interestingly, even though Duolingo only offers one variety of each language, Amazon Polly offers multiple varieties of English, Spanish, Portuguese and French.

As an aside, when I first tried Duolingo years ago I had the thought, “Wait, is this synthesized?” but it just seemed too outrageous to think that someone would make a business out of teaching humans to talk like statistical models of corpus speech. It turns out it wasn’t too outrageous, and I’m still thinking through the implications of that.

Synthesized or not, it makes sense for a company with finite resources to focus on one variety. But if that one company controls a commanding market share, or if there’s a significant amount of collusion or groupthink among language instruction services, they can wind up shutting out whole swathes of the world, even while claiming to be inclusive.

This is one of the reasons I created an open LanguageLab platform: to make it easier for people to build their own exercises and lessons, focusing on any variety they choose. You can set up your own LanguageLab server with exercises exclusively based on recordings of the English spoken on Smith Island, Maryland (population 149), if you like.

So what about excluded varieties with a few more speakers? I made a table of all the Duolingo language offerings according to their number of English learners, along with the Amazon Polly dialect that is used on Duolingo. If the variety is only vaguely specified, I made a guess.

For each of these languages I picked another variety, one with a large number of speakers. I tried to find the variety with the largest number of speakers, but these counts are always very imprecise. The result is an imagined alternate language service, one that does not automatically privilege the speakers of the most influential variety. Here are the top ten:

Language	Duolingo dialect	Alternate dialect
English	Midwestern US	India
Spanish	Mexico	Argentina
French	Paris	Quebec
Japanese	Tokyo	Kagoshima
German	Berlin	Bavarian
Korean	Seoul	Pyongyang
Italian	Florence	Rome
Mandarin Chinese	Beijing	Taipei
Hindi	Delhi	Chhatisgarhi
Russian	Moscow	Almaty

To show what could be done with a little volunteer work, I created a sample lesson for a language that I know, the third-most popular language on Duolingo, French. After France, the country with the next largest number of French speakers is Canada. Canadian French is distinct in pronunciation, vocabulary and to some degree grammar.

Canadian French is stigmatized outside Canada, to the point where I’m not aware of any program in the US that teaches it, but it is omnipresent in all forms of media in Canada, and there is quite a bit of local pride. These days at least, it would be as odd for a Canadian to speak French like a Parisian as for an American to speak English like a Londoner. There are upper and lower class accents, but they all share certain features, notably the ranges of the nasal vowels.

I chose a bestselling author and television anchor, Michel Jean, who has one grandmother from the indigenous Innu people and three presumably descended from white French settlers. I took a small excerpt from an interview with Jean about his latest novel where he responds spontaneously to the questions of a librarian, Josianne Binette.

The sample lesson in Canadian French based on Michel Jean’s speech is available on the LanguageLab demo site. You are welcome to try it! Just log in with the username demo and the password LanguageLab.

The gesture location symbols of Stokoe notation, mapped onto a chart of the upper torso, arm and head

Teaching intro sign phonetics

Leave a comment October 18, 2021 Angus Andrea Grieve-Smith

A few years ago I wrote about incorporating sign linguistics when I taught Introduction to Linguistics at Saint John’s University. The other course I taught most often was Introduction to Phonology. This course was required for our majors in Speech Pathology and Audiology, and they often filled up the class. I never had a Deaf student, but almost all of my students expressed some level of interest in signed languages, and many had taken several semesters of American Sign Language.

The texts I used tended to devote a chapter to sign linguistics here or there, but not present it systematically or include it in general discussions. I always included those chapters, and any mention of signed languages was received enthusiastically by my students, so having a love of sign linguistics myself, I was happy to teach more.

The first thing I did was to add sign phonetics. I had previously found that I needed to start Introduction to Phonology with a comprehensive review of spoken phonetics, so I just followed that with a section on the systematic description of hand, face and upper body gestures. A lot of the spoken phonetics review was focused on phonetic transcription, and the students needed some way to keep track of the gestures they were studying, so I taught them Stokoe notation.

Some of you may be remembering negative things you’ve read, or heard, or said, about Stokoe notation. It’s not perfect. But it’s granular enough for an intro phonology course, and it’s straightforward and relatively transparent. My students had no problem with it. Remember that the appropriate level of granularity depends on what you’re trying to communicate about the language.

I developed charts for the Stokoe symbols for locations, orientations and movements (“tab” and “sig” in Stokoe’s terminology), corresponding to the vowel quadrilateral charts developed by Pierre Delattre and others for spoken languages. To create the charts I used the StokoeTempo font that I developed back in 1995.

The next step was to find data for students to analyze. I instructed my students to watch videos of jokes in American Sign Language posted to YouTube and Facebook by two Deaf storytellers and ASL teachers, Greg “NorthTrue” Eyben and Joseph Wheeler.

Deaf YouTuber NorthTrue makes the ASL sign for “mail”

The first exercise I gave my students was a scavenger hunt. I had previously found them to be useful in studying spoken language features at all levels of analysis. Here is a list of items I asked my students to find in one two-minute video:

A lexical sign
A point
A gesture depicting movement or location
An iconic gesture miming a person’s hand movement
A nonmanual miming a person’s emotion
A grammatical nonmanual indicating question, role shifting or topic

The students did well on the exercises, whether in class, for homework or for exams. Unfortunately that was pretty much all that I was able to develop during the years I taught Introduction to Phonology.

There is one more exercise I created using sign phonology; I will write about that in a future post.

How to set up your own LanguageLab

Leave a comment March 24, 2021 Angus Andrea Grieve-Smith

I’ve got great news! I have now released LanguageLab, my free, open-source software for learning languages and music, to the public on GitHub.

I wish I could tell you I’ve got a public site up that you can all use for free. Unfortunately, the features that would make LanguageLab easy for multiple users to share one server are later in the roadmap. There are a few other issues that also stand in the way of a massive public service. But you can set up your own server!

I’ve documented the steps in the README file, but here’s an overview. You don’t need to know how to program, but you will need to know how to set up web services, retrieve files from GitHub, edit configuration files, and run a few commands at a Linux/MacOS/DOS prompt.

LanguageLab uses Django, one of the most popular web frameworks for Python, and React, one of the most popular frameworks for Javascript. All you need is a server that can run Django and host some Javascript files! I’ve been doing my development and testing on Pythonanywhere, but I’ve also set it up on Amazon Web Services, and you should be able to run it on Google Cloud, Microsoft Azure, a University web server or even your personal computer.

There are guides online for setting up Django in all those environments. Once you’ve got a basic Django setup installed, you’ll need to clone the LanguageLab repo from GitHub to a place where it can be read by your web server. Then you’ll configure it to access the database, and configure the web server to load it. You’ll use Pip and NPM to download the Python and Javascript libraries you need, like the Django REST Framework, React and the Open Iconic font. Finally, you’ll copy all the files into the right places for the web server to read them and restart the server.

Once you’ve got everything in place, you should be able to log in! You can make multiple accounts, but keep in mind that at this point we do not have account-level access, so all accounts have full access to all the data. You can then start building your library of languages, media, exercises and lessons. LanguageLab comes with the most widely used languages, but it’s easy to set up new ones if yours are not on the list.

Media can be a bit tricky, because LanguageLab is not a media server. You can upload your media to another place on your server, or any other server – as long as it’s got an HTTPS URL you should be able to use it. If the media you’re using is copyrighted you may want to set up some basic password protection to avoid any accusations of piracy. I use a simple .htaccess password. I have to log in every time, but it works.

With the URL of your media file, you can create a media entry. Just paste that URL into the form and add metadata to keep track of the file and what it can be used for. You can then set up one or more exercises based on particular segments of that media file. It may take a little trial and error to get the exercises right.

You can then create one or more lessons to organize your exercises. You can choose to have a lesson for all the exercises in a particular media file, or you can combine exercises from multiple media files in a lesson. It’s up to you how to organize the lessons. You can edit the queues for each lesson to reorder or remove exercises.

Once you’ve got exercises, you can start practicing! The principle is simple: listen to the model, repeat into the microphone, then listen to the model again, followed by your recording. Set yourself a goal of a.certain number of repetitions per session.

After you’ve created your language and media entries, exercises and lessons, you can export the data. Importing the data is not yet implemented, but the data is exported to a human-readable JSON format that you can then recreate if necessary.

In the near future I will go on Twitch to demonstrate how to set up exercises and lessons, and how to practice with them. I will also try to find time to demonstrate the installation process. I will record each demonstration and put it on YouTube for your future reference. You can follow me on Twitter to find out when I’m doing the demos and posting the videos.

If you try setting up a LanguageLab, please let me know how it goes! You can report bugs by creating incidents on GitHub, or you can send me an email. I’m happy to hear about problems, but I’d also like to hear success stories! And if you know some Python or Javascript, please consider writing a little code to help me add one of the features in the roadmap!

A free, open source language lab app

1 Comment February 5, 2021 Angus Andrea Grieve-Smith

Viewers of the Crown may have noticed a brief scene where Prince Charles practices Welsh by sitting in a glass cubicle wearing a headset.? Some viewers may recognize that as a language lab. Some may have even used language labs themselves.

The core of the language lab technique is language drills, which are based on the bedrock of all skills training: mimicry, feedback and repetition.? An instructor can identify areas for the learner to focus on.

Because it’s hard for us to hear our own speech, the instructor also can observe things in the learner’s voice that the learner may not perceive.? Recording technology enabled the learner to take on some of the role of observer more directly.

When I used a language lab to learn Portuguese in college, it ran on cassette tapes.? The lab station played the model (I can still remember “Elena, estudante francesa, vai passar as ferias em Portugal?“), then it recorded my attempted mimicry onto a blank cassette.? Once I was done recording it played back the model, followed by my own recording.

Hearing my voice repeated back to me after the model helped me judge for myself how well I had mimicked the model.? It wasn’t enough by itself, so the lab instructor had a master station where he could listen in on any of us and provide additional feedback.? We also had classroom lessons with an instructor, and weekly lectures on culture and grammar.

There are several companies that have brought language lab technology into the digital age, on CD-ROM and then over the internet.? Many online language learning providers rely on proprietary software and closed platforms to generate revenue, which is fine for them but doesn’t allow teachers the flexibility to add new language varieties.

People have petitioned these language learning companies to offer new languages, but developing offerings for a new language is expensive.? If a language has a small user base it may never generate enough revenue to offset the cost of developing the lessons.? It would effectively be a donation to people who want to promote these languages, and these companies are for profit entities.

Duolingo has offered a work-around to this closed system: they will accept materials developed by volunteers according to their specifications and freely donated.? Anyone who remembers the Internet Movie Database before it was sold to Amazon can identify the problems with this arrangement: what happens to those submissions if Duolingo goes bankrupt, or simply decides not to support them anymore?

Closed systems raise another issue: who decides what it means to learn French, or Hindi?? This has been discussed in the context of Duolingo, which chose to teach the artificial Modern Standard Arabic rather than a colloquial dialect or the classical language of the Qur’an.? Similarly, activists for the Hawai’ian language wanted the company to focus on lessons to encourage Hawai’ians to speak the language, rather than tourists who might visit for a few weeks at most.

Years ago I realized that we could make a free, open-source language lab application.? It wouldn’t have to replicate all the features of the commercial apps, especially not initially.? An app would be valuable if it offers the basic language lab functionality: play a model, record the learner’s mimicry, play the model again and finally play the recording of the learner.

An open system would be able to use any recording that the device can play.? This would allow learners to choose the models they practice with, or allow an instructor to choose models for their students.? The lessons don’t have to be professionally produced.? They can be created for a single student, or even for a single occasion.? I am not a lawyer, but I believe they can even use copyrighted materials.

I have created a language lab app using the Django Rest Framework and ReactJS that provides basic language lab functionality.? It runs in a web browser using responsive layout, and I have successfully tested it in Chrome and Firefox, on Windows and Android.

This openness and flexibility drastically reduces the cost of producing a lesson.? The initial code can be installed in an hour, on any server that can host Django.? The monthly cost of hosting code and media can be under $25.? Once this is set up, a media item and several exercises based on it can be added in five minutes.

This reduced cost means that a language does not have to bring in enough learners to recoup a heavy investment.? That in turn means that teachers can create lessons for every dialect of Arabic, or in fact for every dialect of English.? They can create Hawai’ian lessons for both tourists and heritage speakers.? They could even create lessons for actors to learn dialects, or master impressions of celebrities.

As a transgender person I’ve long been interested in developing a feminine voice to match my feminine visual image.? Gender differences in language include voice quality, pitch contour, rhythm and word choice – areas that can only be changed through experience.? I have used the alpha and beta versions of my app to create exercises for practicing these differences.

Another area where it helps a learner to hear a recording of their own voice is singing.? This could be used by professional singers or amateurs.? It could even be used for instrument practice.? I use it to improve my karaoke!

This week I was proud to present my work at the QueensJS meetup.? My slides from that talk contain more technical details about how to record audio through the web browser.? I’ll be pushing my source to GitHub soon. You can read more details about how to set up and use LanguageLab.? In the meantime, if you’d like to contribute, or to help with beta testing, please get in touch!

Angus Grieve-Smith wears a mask of his own design, featuring IPA vowel quadrilaterals on each cheek

Show your vowels and support Doctors Without Borders!

Leave a comment July 16, 2020 Angus Andrea Grieve-Smith

I’m very excited about a new face mask I designed.? You can order it online!

I was inspired by two tweets I saw within minutes of each other on July Fourth.? First, M?d?ric Gasquet-Cyrus, a professor at Aix-Marseille, posted a picture of? his colleague Pascal Rom?as wearing a “triangle vocalique” T-shirt designed by the linguistics YouTuber Romain Filstroff, known as Linguisticae. Gasquet-Cyrus’s tweet translates to “When you eat out with a phonetician colleague, you get a chance to practice your vowel quadrilateral!”

Quand tu manges avec un copain phonéticien ?.
Toute la soirée, tu révises ton trapèze vocalique ! @PascalRomeas pic.twitter.com/klPIwjPe4X
— Médéric Gasquet-Cyrus (@MedericGC) July 4, 2020

The vowel quadrilateral is one of the great data visualizations of linguistics: a two-dimensional diagram of the tongue height and position assigned to the vowel symbols of the Interneational Phonetic Alphabet, as viewed from the left side of the face.?? It is also known as the vowel triangle, depending on how much wiggle room you think people have for their tongues when their mouths are fully open.? It can even be plotted based on the formant frequencies extracted from acoustic analysis.

The second was a tweet by Emily Bender, a professor at the University of Washington, about face masks with a random grid of IPA symbols on them.? These are designed by the Lingthusiasm podcast team of author Gretchen McCulloch and professor Lauren Gawne, using the same pattern as in their popular IPA scarves.

Important update! @lingthusiasm has masks (in the same patterns as their scarves). https://t.co/tG97NZWebw
— Emily M. Bender (@emilymbender) July 4, 2020

Seeing the two pictures one after the other, I realized that rather than a random grid, I could put a vowel quadrilateral on an IPA mask.? Then I realized that if I placed the quadrilateral on one side, I could get it to line up with the wearer’s mouth.? I also had to make a corresponding chart for the right side.

I decided that I wanted the money to go to a charity that was helping with COVID-19.? Doctors Without Borders has been doing good work around the world for years, and with COVID they’ve really stepped up.? Here in New York they provided support to several local organizations and operated two shower trailers in Manhattan at the height of the outbreak.

From July 16 through 29, and then from November 27 through December 28, I ran a fundraiser through Custom Ink where we raised $430 in profits for Doctors Without Borders, and masks were sent to 32 supporters.

There’s another way to get masks!? I have made a slightly different mask design available at RedBubble.com.? You can even get a mug or a phone case.? This is the same store where I’ve been selling Existential Black Swan T-shirts for years.? You can get a mask with the swan on it, if that’s your style.? None of these part of a fundraiser, but you can still donate directly to Doctors Without Borders!

Update, February 1, 2021: There are more virulent strains of COVID spreading, so medical experts are recommending that people wear three-layer masks, or wear a single or double layer mask over a disposable surgical mask.? You should know that the white-on-black Custom Ink masks sold in the fundraisers in 2020 are single layer, and the RedBubble masks sold in 2020 are double layer.? They can both be worn over surgical masks.? Both services are now offering triple-layer masks, so I’ve updated the RedBubble links to the three-layer masks, and will use three-layer masks for any future fundraisers.? Stay safe, everyone!

Seeing the Star Wars movies does not make you a Star Wars fan. Actual Star Wars fans have done some of the following: * Read the novelizations * Read books in the EU * Read new canon books * Read some comics * Watched the animated shows * Participated in SW discussion groups.

Coercing with categories

Leave a comment January 21, 2020 Angus Andrea Grieve-Smith

Recently some guy tweeted “Seeing the Star Wars movies does not make you a Star Wars fan. Actual Star Wars fans have done some of the following…”? This is a great opportunity for me to talk about a particular kind of category fight: coercion.

Over the past several years I’ve written about some things people try to do with categories: watchdogging, gatekeeping, pedantry, eclipsing and splitting. Coercion is similar to gatekeeping, which is where someone highlights category boundaries with the goal of preventing free riders from accessing benefits that they are not entitled to: the example I gave was of Dr. Nerdlove defending the category of “socially awkward men” from incursion by genuinely abusive men. He argues that these abusive men do not deserve the accommodation that is sometimes extended to men who are simply socially awkward.

Coercion is different from gatekeeping in that the person making the accusation is shifting the category boundaries. Ed Powell knows quite well that most people’s definition of “Star Wars fan” includes people who have not done any of the six things he lists. So why is he insisting that “Actual Star Wars fans” have all done some of those things? Because he wants to control the behavior of people who care about whether they are considered Star Wars fans.

Why would someone care about being considered a Star Wars fan? Because fandom is often a communal affair. Fans go to movies and conventions together, and bond over their shared appreciation for Star wars. As Powell says, they may participate in discussion groups. There’s a satisfaction people get in talking about Wookiees or midichlorians with people who share background knowledge and don’t have to ask what a protocol droid is.

I’ve also heard that some people get a sense of belonging from participating in these groups. They may have been teased – and rejected from other groups – for being one of the few Star Wars fans in their high school, especially in the seventies and eighties. There’s a satisfaction and relief in finally finding a group that you share so much with.

Of course, these groups are vulnerable to the dark side. They contain people, and people aren’t necessarily nice just because they’ve been treated badly by other people. Sometimes not even if they’re Star Wars fans. Sometimes people discover they can wield power within a group like that, and they’re not always interested in using that power for good.

One way to wield power is to be able to give people something they want – or to deny it to them. And if people want the sense of belonging to a group, or the enjoyment of participating in group activities, it’s a source of power to be able to control who belongs to the group – and who doesn’t. Some groups are arbitrary: in theory, the only person who gets to decide who belongs to “Brenda’s friends” is Brenda, and the only person who gets to decide who’s invited to Kevin’s party is Kevin.

Other groups are based on categories, like these Meetup groups that are hosting events tomorrow: the New York Haskell Users Group, Black Baby Boomers Just Want to have Fun, or First Time Upper West Side Moms. Or like Star Wars fans. These groups are much less arbitrary: if a woman lives on the Upper West Side with her only child, it’s going to be hard to throw her out.

It’s hard to exclude people from a category-based group, but not impossible. What if our First Time Upper West Side Mom is trans, or a stepmother? Or if she’s a stepmother and a first-time biological mother? Or if she lives on 107th Street? Or if her kid is in college? Because categories are fuzzy, the power to draw category boundaries can be the power to exclude people from group membership. If the group leader doesn’t like our hypothetical mom, all she has to do is draw the boundary of the Upper West Side at 106th Street. Sorry honey, there is no First Time Morningside Heights Moms? Oh gee, what a shame.

The power to exclude doesn’t even need to be exercised. It doesn’t even need to have any direct force to have a chilling effect. Even if the head of your local Star Wars fan club totally owns Ed Powell on Twitter, you still may be wondering if people at the next regional convention are going to look at you funny because you haven’t read Dark Force Rising.

But if you’re not actually going to use this power to exclude people, what do you use it for? This is where the coercion comes in. You can use the threat of exclusion to bully people into doing things. And the easiest way to do that is simply to make doing those things the criteria for inclusion.

So here’s what I think happened: Ed Powell got tired of going to conferences and not having anyone to talk about novelizations and animated series with. All they wanted to talk about was the movies (I can’t imagine why!). So how does Powell get people to read these books? He changes the criteria for what counts as an Actual Star Wars fan. Now they have to read them, or watch the series, if they want to be Actual Star Wars fans.

Now as far as I can tell, Ed Powell is just some guy on Twitter, and has no authority to exclude anyone from any fan club. And he seems to be getting owned by everyone. I doubt that his shaming will have an effect on the general population of Star Wars fans. It may serve as advertising to encourage people who have read these books and watched the animated series to talk with him about them. If it doesn’t turn them off too.

Flu. What is Thisby? A wandring knight? Quin. It is the Lady, that Pyramus must loue. Fl. Nay faith: let not me play a wom?: I haue a beard c?-(ming. Quin. Thats all one: you shall play it in a Maske: and you may speake as small as you will. Bott. And I may hide my face, let me play Thisby to: Ile speake in a monstrous little voice; Thisne, Thisne, ah Py-, ramus my louer deare, thy Thysby deare, & Lady deare. Qu. No, no: you must play Pyramus: & Flute, you Thysby.

The History of English through SparkNotes

1 Comment November 10, 2019 Angus Andrea Grieve-Smith

Language change has been the focus of my research for over twenty years now, so when I taught second semester linguistics at Saint John’s University, I was very much looking forward to teaching a unit focused on change. I had been working to replace constructed examples with real data, so I took a tip from my natural language processing colleague Dr. Wei Xu and turned to SparkNotes.

I first encountered SparkNotes when I was teaching French Language and Culture, and I assigned all of my students to write a book report on a work of French literature, or a book about French language or culture. I don’t remember the details, but at times I had reason to suspect that one or another of my students was copying summary or commentary information about their chosen book from SparkNotes rather than writing their own.

When I was in high school, my classmates would make use of similar information for their book reports. The rule was that you could consult the Cliffs Notes for help understanding the text, but you weren’t allowed to simply copy the Cliffs Notes.

Modern Text

FLUTE
Who?s Thisbe? A knight on a quest?

QUINCE
Thisbe is the lady Pyramus is in love with.

FLUTE
No, come on, don?t make me play a woman. I?m growing a beard.

QUINCE
That doesn?t matter. You?ll wear a mask, and you can make your voice as high as you want to.

BOTTOM
In that case, if I can wear a mask, let me play Thisbe too! I?ll be Pyramus first: ?Thisne, Thisne!??And then in falsetto: ?Ah, Pyramus, my dear lover! I?m your dear Thisbe, your dear lady!?

QUINCE
No, no. Bottom, you?re Pyramus.?And Flute, you?re Thisbe.

When I discovered SparkNotes I noticed that for some older authors – Shakespeare, of course, but even Dickens – they not only offered summaries and commentary, but translations of the text into contemporary English. It was this feature I drew on for the unit on language change.

While I was developing and teaching this second semester intro linguistics course at Saint John’s, I was also working as a linguistic annotator for an information extraction project in the NYU Computer Science Department. I met a doctoral student, Wei Xu, who was studying a number of interesting corpora, including Twitter, hip-hop and SparkNotes. Wei graduated in 2014, and is now Assistant Professor of Computer Science and Engineering at Ohio State.

Wei had realized that the modern translations on SparkNotes and eNotes, combined with the original Shakespearean text, formed a parallel corpus, a collection of texts in one language variety that are paired with translations in another language variety. Parallel corpora, like the Canadian Hansard Corpus of French and English parliamentary debates, are used in translation studies, including for training machine translation software. Wei used the SparkNotes/eNotes parallel Shakespeare corpus to generate Shakespearean-style paraphrases of contemporary movie lines, among other things.

When it came time to teach the unit on language change at Saint John’s, I found a few small exercises that asked students to compare older literary excerpts with modern translations. Given the constraints of this being one unit in a survey course, it made sense to focus on the language of instruction, English. The Language Files had one such exercise featuring a short Chaucer passage. In general, when working with corpora I prefer to look at larger segments, ideally an entire text but at minimum a full page.

I realized that I could cover all the major areas of language change – phonological, morphological, syntactic, semantic and pragmatic – with these texts. Linguists have been able to identify phonological changes from changes in spelling, for example that Chaucer’s spelling of “when” as “whan” indicates that we typically put our tongues in a higher place in our mouths when pronouncing the vowel of that word than people did in the fourteenth century.

When teaching Shakespeare to college students it is common to use texts with standardized spelling, but we now have access to scans of Shakespeare’s work as it was first published in his lifetime or shortly after his death, with the spellings chosen by those printers. This spelling modernization is even practiced with some nineteenth century authors, and similarly we have access to the first editions of most words through digitization projects like Google Books.

With this in mind, I created exercises to explore language change. For a second semester intro course the students learned a lot from a simple scavenger hunt: compare a passage from the SparkNotes translation of Shakespeare with the Quarto, find five differences, and specify whether they are phonological, morphological, syntactic, semantic or pragmatic. In more advanced courses stufents could compare differences more systematically.

This comparison is the kind of thing that we always do when we read an old text: compare older spellings and wordings with the forms we would expect from a more modern text. Wei Xu showed us that the translations and spelling changes in SparkNotes and eNotes can be used for a more explicit comparison, because they are written down based on the translators’ and editors’ understanding of what modern students will find difficult to read.

As I have detailed in my forthcoming book, Building a Representative Theater Corpus, we must be careful not to generalize universal statements, including statements about prevalence, to the language as a whole. This is especially problematic when we are looking at authors who appealed to elite audiences, but it applies to Shakespeare and Dickens as well. Existential observations, such as that Shakespeare used bare not (“let me not”) in one instance where SparkNotes used do-support (“don’t let me”) are much safer.

My students seemed to learn a lot from this technique. I hope some of you find it useful in your classrooms!

What is “text” for a sign language?

3 Comments September 26, 2018 Angus Andrea Grieve-Smith

I started writing this post back in August, and I hurried it a little because of a Limping Chicken article guest written by researchers at the Deafness, Cognition and Language Research Centre at University College London. I’ve known the DCAL folks for years, and they graciously acknowledged some of my previous writings on this issue. I know they don’t think the textual form of British Sign Language is written English, so I was surprised that they used the term “sign-to-text” in the title of their article and in a tweet announcing the article. After I brought it up, Dr. Kearsy Cormier acknowledged that there was potential for confusion in that term.

So, what does “sign-to-text” mean, and why do I find it problematic in this context? “Sign-to-text” is an analogy with “speech-to-text,” also known as speech recognition, the technology that enables dictation software like DragonSpeak. Speech recognition is also used by agents like Siri to interpret words we say so that they can act on them.

There are other computer technologies that rely on the concept of text. Speech synthesis is also known as text-to-speech. It’s the technology that enables a computer to read a text aloud. It can also be used by agents like Siri and Alexa to produce sounds we understand as words. Machine translation is another one: it typically proceeds from text in one language to text in another language. When the DCAL researchers wrote “sign-to-text” they meant a sign recognition system hooked up to a BSL-to-English machine translation system.

Years ago I became interested in the possibility of applying these technologies to sign languages, and created a prototype sign synthesis system, SignSynth, and an experimental English-to-American Sign Language system.

I realized that all these technologies make heavy use of text. If we want automated audiobooks or virtual assistants or machine translation with sign languages, we need some kind of text, or we need to figure out a new way of accomplishing these things without text. So what does text mean for a sign language?

One big thing I discovered when working on SignSynth is that (unlike the DCAL researchers) many people really think that the written form of ASL (or BSL) is written English. On one level that makes a certain sense, because when we train ASL signers for literacy we typically teach them to read and write English. On another level, it’s completely nuts if you know anything about sign languages. The syntax of ASL is completely different from that of English, and in some ways resembles Mandarin Chinese or Swahili more than English.

It’s bad enough that we have speakers of languages like Moroccan Arabic and Fujianese that have to write in a related language (written Arabic and written Chinese, respectively) that is different in non-trivial ways that take years of schooling to master. ASL and English are so totally different that it’s like writing Korean or Japanese with Chinese characters. People actually did this for centuries until someone smart invented hangul and katakana, which enabled huge jumps in literacy.

There are real costs to this, serious costs. I spent some time volunteering with Deaf and hard-of-hearing fifth graders in an elementary school, and after years of drills they were able to put English words on paper and pronounce them when they saw them. But it became clear to me that despite their obvious intelligence and curiosity, they had no idea that they could use words on paper to send a message, or that some of the words they saw might have a message for them.

There are a number of Deaf people who are able to master English early on. But from extensive reading and discussions with Deaf people, it is clear to me that the experience of these kids is typical of that for the vast majority of Deaf people.

It is a tremendous injustice to a child, and a tremendous waste of that child’s time and attention, for them to get to the age of twelve, at normal intelligence, without being able to use writing. This is the result of portraying English as the written form of ASL or BSL.

So what is the written form of ASL? Simply put, it doesn’t have one, despite several writing systems that have been invented, and it won’t have one until Deaf people adopt one. There will be no sign-to-text until signers have text, in their language.

I can say more about that, but I’ll leave it for another post.

Author: Angus Andrea Grieve-Smith