Abstract
Human pose estimation (HPE) is the basis of a wide variety of computer vision tasks. However, existing approaches are designed mainly for static 2D images, ignoring the temporal continuity and geometric consistency between video frames. With the aim of resolving the aforementioned issues, Pose-DWT-Former, an enhanced transformer-based approach, was proposed in this paper. The principle is to employ the 2D skeleton-based pose sequences extracted from the video frames and the discrete wavelet domain (DWT) information of those sequences as the network input for the purposes of 3D HPE. The evaluations were performed on two datasets, with good results in speed, accuracy, and robustness against noise, laying a solid foundation for the subsequent use of 3D human posture information for action recognition.
Original language | English |
---|---|
Title of host publication | Proceedings of the UNIfied Conference of DAMAS, IncoME and TEPEN Conferences (UNIfied 2023) - Volume 2 |
Editors | Andrew D. Ball, Huajiang Ouyang, Jyoti K. Sinha, Zuolu Wang |
Publisher | Springer, Cham |
Pages | 41-49 |
Number of pages | 9 |
Volume | 152 |
ISBN (Electronic) | 9783031494215 |
ISBN (Print) | 9783031494208, 9783031494239 |
DOIs | |
Publication status | Published - 29 May 2024 |
Event | The UNIfied Conference of DAMAS, InCoME and TEPEN Conferences - Huddersfield, United Kingdom, Huddersfield, United Kingdom Duration: 29 Aug 2023 → 1 Sep 2023 https://unified2023.org/ |
Publication series
Name | Mechanisms and Machine Science |
---|---|
Publisher | Springer |
Volume | 152 MMS |
ISSN (Print) | 2211-0984 |
ISSN (Electronic) | 2211-0992 |
Conference
Conference | The UNIfied Conference of DAMAS, InCoME and TEPEN Conferences |
---|---|
Abbreviated title | UNIfied 2023 |
Country/Territory | United Kingdom |
City | Huddersfield |
Period | 29/08/23 → 1/09/23 |
Internet address |