Paper
27 March 2024 Multiscale structure aware semantic GCN for occluded human pose estimation
Linhao Xu, Junsheng Wang
Author Affiliations +
Proceedings Volume 13105, International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2023); 131052I (2024) https://doi.org/10.1117/12.3026386
Event: 3rd International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2023), 2023, Qingdao, China
Abstract
In recent years, human pose estimation has seen substantial advancements. However, occlusion remains a persistent challenge, leading to issues like missing keypoints, ambiguity, and abnormal poses. This paper introduces a novel framework, comprising two core components: the Initial Pose Net and the Multi-Scale Structure-Aware Semantic GCN (MSS-GCN). Our approach starts with the Initial Pose Net, which predicts an initial pose with the aid of attention mechanisms, enhancing feature precision. Subsequently, the MSS-GCN refines this initial pose to yield the final pose. The MSS-GCN is a network consisting of multiple parallel multi-scale subgraphs, informed by the human body’s geometric constraints. It leverages prior knowledge of human anatomy to learn constraints regarding keypoints and uses feature information to compensate for the absence of semantic data in coordinates. The MSS-GCN excels in capturing the structural aspects of the human body and effectively handles the challenge of long-distance keypoint predictions. Additionally, our module is designed for easy integration into other networks, enhancing its versatility. Through extensive experiments on standard benchmark datasets for occluded human pose estimation, we demonstrate that our method surpasses existing state-of-the-art approaches. This work marks a significant stride toward enhancing pose estimation accuracy in occlusion scenarios.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Linhao Xu and Junsheng Wang "Multiscale structure aware semantic GCN for occluded human pose estimation", Proc. SPIE 13105, International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2023), 131052I (27 March 2024); https://doi.org/10.1117/12.3026386
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Pose estimation

Semantics

Education and training

Data modeling

Image enhancement

Prior knowledge

Ablation

Back to Top