Late Fusion of Multiple Convolutional Layers for Pedestrian Detection
- Others:
- Spatio-Temporal Activity Recognition Systems (STARS) ; Inria Sophia Antipolis - Méditerranée (CRISAM) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
- Institut Pascal (IP) ; Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-SIGMA Clermont (SIGMA Clermont)-Centre National de la Recherche Scientifique (CNRS)
- VEhicule DEcarboné et COmmuniquant et sa Mobilité (VeDeCom)
Description
We propose a system design for pedestrian detection by leveraging the power of multiple convolutional layers explicitly. We quantify the effect of different convolutional layers on the detection of pedestrians of varying scales and occlusion level. We show that earlier convolutional layers are better at handling small-scale and partially oc-cluded pedestrians. We take cue from these conclusions and propose a pedestrian detection system design based on Faster-RCNN which leverages multiple convolutional layers by late fusion. In our design, we introduce height-awareness in the loss function to make the network emphasize on pedestrian heights which are misclassified during the training process. The proposed system design achieves a log-average miss-rate of 9.25% on the caltech-reasonable dataset. This is within 1.5% of the current state-of-art approach , while being a more compact system.
Abstract
International audience
Additional details
- URL
- https://hal.inria.fr/hal-01926073
- URN
- urn:oai:HAL:hal-01926073v1
- Origin repository
- UNICA