Localization and recognition of human action in 3D using transformers
2024
Article
ps
Understanding a person’s behavior from their 3D motion sequence is a fundamental problem in computer vision with many applications. An important component of this problem is 3D action localization, which involves recognizing what actions a person is performing, and when the actions occur in the sequence. To promote the progress of the 3D action localization community, we introduce a new, challenging, and more complex benchmark dataset, BABEL-TAL (BT), for 3D action localization. Important baselines and evaluating metrics, as well as human evaluations, are carefully established on this benchmark. We also propose a strong baseline model, i.e., Localizing Actions with Transformers (LocATe), that jointly localizes and recognizes actions in a 3D sequence. The proposed LocATe shows superior performance on BABEL-TAL as well as on the large-scale PKU-MMD dataset, achieving state-of-the-art performance by using only 10% of the labeled training data. Our research could advance the development of more accurate and efficient systems for human behavior analysis, with potential applications in areas such as human-computer interaction and healthcare.
Author(s): | Jiankai Sun and Linjiang Huang and Hongsong Wang, Chuanyang Zheng, Jianing Qiu and Md Tauhidul Islam and Enze Xie and Bolei Zhou and Lei Xing and Arjun Chandrasekaran and Michael J. Black |
Journal: | Nature Communications Engineering |
Volume: | 13 |
Number (issue): | 125 |
Year: | 2024 |
Month: | September |
Department(s): | Perceiving Systems |
Bibtex Type: | Article (article) |
Paper Type: | Journal |
DOI: | https://www.nature.com/articles/s44172-024-00272-7 |
Links: |
paper
|
BibTex @article{locate:2024, title = {Localization and recognition of human action in {3D} using transformers}, author = {Sun, Jiankai and Huang, Linjiang and Hongsong Wang, Chuanyang Zheng, Jianing Qiu and Islam, Md Tauhidul and Xie, Enze and Zhou, Bolei and Xing, Lei and Chandrasekaran, Arjun and Black, Michael J.}, journal = {Nature Communications Engineering }, volume = {13}, number = {125}, month = sep, year = {2024}, doi = {https://www.nature.com/articles/s44172-024-00272-7}, month_numeric = {9} } |