GaitRef: Gait Recognition with Refined Sequential Skeletons
Haidong Zhu*,  Wanrong Zheng*,  Zhaoheng ZhengRam Nevatia
University of Southern California
Overview
This work combines the silhouettes and skeletons and refines the framewise joint predictions for gait recognition. With temporal information from the silhouette sequences, we show that the refined skeletons can improve gait recognition performance without extra annotations. We compare our methods on four public datasets, CASIA-B, OUMVLP, Gait3D and GREW, and show state-of-the-art performance.
Approach
Our proposed architecture for GaitRef and GaitMix. Trapezoids are trainable modules, and modules of the same color in the same model share the weight. Dashed lines are the operation of feature copying. S and J are the input silhouettes and skeletons. FS represents silhouette features, while FJ and FJ∗ represent skeleton features from input and refined skeletons, respectively.
Architecture of the skeleton correction network. FJP is the skeleton features after average pooling. We concatenate the joint position J with its feature FJ along with the global feature after pooling FJP and the silhouette feature FS before sending it into the decoder for calculating the position difference ∆J for each frame. Decoders at different timestamps share weights.
Results
Visualization of successful and failure refined skeletons with GaitRef. For each example, from left to right, we have original skeletons, silhouette of the nearby timestamp and corrected skeletons from skeleton correction network.
BibTeX
  @misc{zhu2023gaitref,
    title={GaitRef: Gait Recognition with Refined Sequential Skeletons}, 
    author={Haidong Zhu and Wanrong Zheng and Zhaoheng Zheng and Ram Nevatia},
    year={2023},
    eprint={2304.07916},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}
Related Work
T. Teepe, A. Khan, J. Gilg, F. Herzog, S. H¨ormann, and G. Rigoll. GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition In ICIP, pages 2314–2318, 2021.
Comment: Combines skeleton poses with Graph Convolutional Network (GCN) to obtain a modern model-based approach for gait recognition.
S. Yan, Y. Xiong, and D. Lin. Spatial temporal graph convolutional networks for skeleton-based action recognition AAAI, 2018.
Comment: Proposed a novel model of dynamic skeletons called SpatialTemporal Graph Convolutional Networks (ST-GCN).