Part I: Getting to Know the Face
Before we start animating, building, or rigging anything, letâs be sure weâre speaking the same language. In Chapter 1, I talk about talking, pointing out the things that are important in speech visually and isolating the things that are not. Narrowing our focus to lip sync gives a good base from which to build the more complicated aspects of the work later. In Chapter 2, I define and outline, in the same focused way, the top half of the face. In Chapter 3, we zoom back to the entire faceâthe tilt of the head, wrinkles being a good thing, and even parts of the face you didnât know were important.
Each chapter in this part is expanded into a detailed explanation in a later part of the book: Chapter 1 in Part II, Chapter 2 in Part III, and Chapter 3 in Part IV.
Chapter 1: Learning the Basics of Lip Sync
Chapter 2: What the Eyes and Brows Tell Us
Chapter 3: Facial Landmarking
Chapter 1
Learning the Basics of Lip Sync
In modeling for facial animation, mix and match is the name of the game. Instead of building individual specialized shapes for every phoneme and expression, like for an F or a T, weâll build shapes that are broader in their application, like wide or narrow, and use combinations of them to create all those other specialized shapes. On the animation front, itâs all about efficiency. You want to spend your time being creative and animating, not fighting with the complexities that often emerge from having a face with great range. It doesnât sound like thereâs much to these concepts for modeling and animating, and, yeah, they really are small and simpleâbut theyâre huge in their details, so letâs get into them.
Before we can jump into re-creating the things we see and understand on faces, we need to first identify those things we see and understand. Starting on the ground floor, this chapter breaks down the essentials of lip sync. Next, weâll go into how basic speech can be broken into two basic cycles of movement, which is what makes the sync portion of this book so simple. Finally, at the end of this chapter, weâll take those two thingsâwhatâs essential and the two cyclesâand build them into a technique for animating.
- The bare-bones essentials of lip sync
- The two speech cycles
- Starting with whatâs most important: visemes
- Building the simplest sync
The Essentials of Lip Sync
People overcomplicate things. Itâs easy to assume that anything that looks good must also be complex. In the world of 3D animation, where programs are packed with mile after mile of options, tools, and dialog boxes, overcomplication can be an especially easy trap to fall into. Not using every feature available to you is a good start in refining any technique in 3D, and not always using the recommended tools is when youâre really advancing and thinking outside the box. Many programs have controls and systems geared for facial animation, but you can usually find better tools for the job in their arsenals.
If youâre fairly new to 3D, and have dabbled with lip sync, it has probably been frustrating, complicated, difficult, and unrewarding. In the end, most people are just glad to be done with it and regret deciding to involve sync in their project. Weâre starting to see some amazing results come from facial motion capture techniques, but at least for now, thatâs probably beyond the cost range for readers of this book. Automated techniques are always improving too, but so far, they arenât keeping up with what a good animator or capture technique can deliver.
Donât despair. I will get you set up for the sync part of things quickly and painlessly so you can spend your time on performance (the fun stuff!). If your bag is automation, thereâs still a lot of information in here you can use to bump the quality of that up too.
When teased apart properly, the lip sync portion of facial animation is the easiest to understand because itâs the simplest. You see, peopleâs mouths donât do that much during speech. Things like smiles and frowns and all sorts of neat gooey faces are cool, and weâll get to them later, but for now weâre just talking sync. Plain old speech. Deadpan and emotionless and, well, boring, is where our base will be. Now, youâre probably thinking, âHey! My face can do all sorts of stuff! I donât want to create boring animation!â Well, youâre right on both counts: Your face can do all sorts of things, and who really wants to do boring animation? Nobody! For the basics, however, this is a case of learning to walk before you can run. For now, weâre not going to complicate it. If we jumped right into a world with hundreds or even thousands of verbal and emotional poses (which is how they do it in the movies), weâd never get anywhere. So, to make sure youâre ready for the advanced hands-on work later, weâre focusing on the most basic concept now: bare-bones lip sync. When dealing with the essentials of lip sync and studying people, there are just two basic motions. The mouth goes Open/Closed, and it goes Wide/Narrow, as illustrated in Figure 1-1.
At its core, thatâs really all that speech entails. When lip-syncing a character with a plain circle for a mouth (which weâll do in just a minute), the shapes in Figure 1-2 are all thatâs needed to create the illusion of speech.
Your reaction to this very short list of two motions might be, âWhat about poses like F where I bite my lip, or L where I roll up my tongue?â Ignoring that kind of specificity is precisely the point right now. Weâre ignoring those highly specialized shapes and stripping the building blocks down to what is absolutely necessary to be understood visually. If these two rangesâfrom Open to Closed and Wide to Narrowâare all you have to draw on, you become creative with how to utilize them. Things like F get pared back to âsort-of closed.â When you animate this way and stop the animation on the frame where the âsort of closedâ is standing in for an F, it is easy to say, âThatâs not an F!â But in motion, you hardly notice the lack of the specific shapeâand motion is what Iâm really talking about here. You should be less concerned with the individual frames and more concerned with the motion and the impression that it creates. For most animators, there is a strong instinct to add more and more complexity too early in the lip-sync process, but too much detail in the sync can actually detract from the acting.
Animating lip sync is all illusion. What would really be happening isnât nearly as relevant as the impression of what is happening. How about M? You may be thinking, âI need to roll my lips in together to say M, and I canât do that with a wide-narrow-mouth-thingamajig.â Sure you can, or at least you can give the impression in motion that the lips are rolled inâjust close the mouth all the wayâand thatâs usually going to be good enough. When you get the lip sync good enough to create an impression of speech and then focus your energies on the acting, others will also focus on the acting, which is precisely what you want them to do.
Analyzing the Right Things
Let me take you on a small real-world tutorial of what is and what is not important in speech.
Animators have a tendency to slow things down to a super-slow-mo or frame-by-frame level and analyze in excruciating detail what happens so as to re-create it. This is not necessarily a bad thing, but hereâs an example of how that can break down as a method: Look in the mirror, and then slowly and deliberately overenunciate the word pebble: PEH-BULL. Youâre trying to see exactly what happens with your face. Watch all the details of what your lips are doing: the little puff in your cheeks after the B; the way the pursing of your lips for P is different than for B; how your tongue starts its way to the roof of your mouth early in the B sound and stays there until just a split second after the end of the word. Youâd think that all these details give you a better idea ...