Pose-DWT-Former: An Improved Transformer-Based 3D Human Pose Estimation Model

Yajuan Wei, Chuan Dai, Zhijie Xu, Minsi Chen, Ying Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Human pose estimation (HPE) is the basis of a wide variety of computer vision tasks. However, existing approaches are designed mainly for static 2D images, ignoring the temporal continuity and geometric consistency between video frames. With the aim of resolving the aforementioned issues, Pose-DWT-Former, an enhanced transformer-based approach, was proposed in this paper. The principle is to employ the 2D skeleton-based pose sequences extracted from the video frames and the discrete wavelet domain (DWT) information of those sequences as the network input for the purposes of 3D HPE. The evaluations were performed on two datasets, with good results in speed, accuracy, and robustness against noise, laying a solid foundation for the subsequent use of 3D human posture information for action recognition.

Original languageEnglish
Title of host publicationProceedings of the UNIfied Conference of DAMAS, IncoME and TEPEN Conferences (UNIfied 2023) - Volume 2
EditorsAndrew D. Ball, Huajiang Ouyang, Jyoti K. Sinha, Zuolu Wang
PublisherSpringer, Cham
Pages41-49
Number of pages9
Volume152
ISBN (Electronic)9783031494215
ISBN (Print)9783031494208, 9783031494239
DOIs
Publication statusPublished - 29 May 2024
EventThe UNIfied Conference of DAMAS, InCoME and TEPEN Conferences - Huddersfield, United Kingdom, Huddersfield, United Kingdom
Duration: 29 Aug 20231 Sep 2023
https://unified2023.org/

Publication series

NameMechanisms and Machine Science
PublisherSpringer
Volume152 MMS
ISSN (Print)2211-0984
ISSN (Electronic)2211-0992

Conference

ConferenceThe UNIfied Conference of DAMAS, InCoME and TEPEN Conferences
Abbreviated titleUNIfied 2023
Country/TerritoryUnited Kingdom
CityHuddersfield
Period29/08/231/09/23
Internet address

Cite this