GaitRef

Haidong Zhu*, Wanrong Zheng*, Zhaoheng Zheng, Ram Nevatia,

University of Southern California

Overview

This work combines the silhouettes and skeletons and refines the framewise joint predictions for gait recognition. With temporal information from the silhouette sequences, we show that the refined skeletons can improve gait recognition performance without extra annotations. We compare our methods on four public datasets, CASIA-B, OUMVLP, Gait3D and GREW, and show state-of-the-art performance.

Approach

Our proposed architecture for GaitRef and GaitMix. Trapezoids are trainable modules, and modules of the same color in the same model share the weight. Dashed lines are the operation of feature copying. S and J are the input silhouettes and skeletons. F_S represents silhouette features, while F_J and F_J∗ represent skeleton features from input and refined skeletons, respectively.

Architecture of the skeleton correction network. F_J^P is the skeleton features after average pooling. We concatenate the joint position J with its feature F_J along with the global feature after pooling F_J^P and the silhouette feature F_S before sending it into the decoder for calculating the position difference ∆J for each frame. Decoders at different timestamps share weights.

Results

Visualization of successful and failure refined skeletons with GaitRef. For each example, from left to right, we have original skeletons, silhouette of the nearby timestamp and corrected skeletons from skeleton correction network.

BibTeX

  @misc{zhu2023gaitref,
    title={GaitRef: Gait Recognition with Refined Sequential Skeletons}, 
    author={Haidong Zhu and Wanrong Zheng and Zhaoheng Zheng and Ram Nevatia},
    year={2023},
    eprint={2304.07916},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Related Work

T. Teepe, A. Khan, J. Gilg, F. Herzog, S. H¨ormann, and G. Rigoll. GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition In ICIP, pages 2314–2318, 2021.
Comment: Combines skeleton poses with Graph Convolutional Network (GCN) to obtain a modern model-based approach for gait recognition.

S. Yan, Y. Xiong, and D. Lin. Spatial temporal graph convolutional networks for skeleton-based action recognition AAAI, 2018.
Comment: Proposed a novel model of dynamic skeletons called SpatialTemporal Graph Convolutional Networks (ST-GCN).