Perceiving Systems, Computer Vision

Emotion Driven Monocular Face Capture and Animation

2022

Conference Paper

ps


As 3D facial avatars become more widely used for communication, it is critical that they faithfully convey emotion. Unfortunately, the best recent methods that regress parametric 3D face models from monocular images are unable to capture the full spectrum of facial expression, such as subtle or extreme emotions. We find the standard reconstruction metrics used for training (landmark reprojection error, photometric error, and face recognition loss) are insufficient to capture high-fidelity expressions. The result is facial geometries that do not match the emotional content of the input image. We address this with EMOCA (EMOtion Capture and Animation), by introducing a novel deep perceptual emotion consistency loss during training, which helps ensure that the reconstructed 3D expression matches the expression depicted in the input image. While EMOCA achieves 3D reconstruction errors that are on par with the current best methods, it significantly outperforms them in terms of the quality of the reconstructed expression and the perceived emotional content. We also directly regress levels of valence and arousal and classify basic expressions from the estimated 3D face parameters. On the task of in-the-wild emotion recognition, our purely geometric approach is on par with the best image-based methods, highlighting the value of 3D geometry in analyzing human behavior. The model and code are publicly available at https://emoca.is.tue.mpg.de.

Author(s): Radek Daněček and Michael J. Black and Timo Bolkart
Book Title: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022)
Pages: 20279--20290
Year: 2022
Month: June
Publisher: IEEE

Department(s): Perceiving Systems
Bibtex Type: Conference Paper (inproceedings)
Paper Type: Conference

DOI: 10.1109/CVPR52688.2022.01967
Event Name: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022)
Event Place: New Orleans, Louisiana, USA

Address: Piscataway, NJ
ISBN: 978-1-6654-6947-0
State: Published

Links: code
project
Attachments: pdf
supplemental

BibTex

@inproceedings{EMOCA:CVPR:2022,
  title = {Emotion Driven Monocular Face Capture and Animation},
  author = {Daněček, Radek and Black, Michael J. and Bolkart, Timo},
  booktitle = {2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022)},
  pages = {20279--20290},
  publisher = {IEEE},
  address = {Piscataway, NJ},
  month = jun,
  year = {2022},
  doi = {10.1109/CVPR52688.2022.01967},
  month_numeric = {6}
}