Published August 26, 2014 | Version v1
Conference paper

Representing Visual Appearance by Video Brownian Covariance Descriptor for Human Action Recognition

Description

This paper addresses a problem of recognizing human actions in video sequences. Recent studies have shown that methods which use bag-of-features and space-time features achieve high recognition accuracy. Such methods extract both appearance-based and motion-based features. This paper focuses only on appearance features. We proposeto model relationships between different pixel-level appearance features such as intensity and gradient using Brownian covariance, which is a natural extension of classical covariance measure. While classical covariance can model only linear relationships, Brownian covariance models all kinds of possible relationships. We propose a method to compute Brownian covariance on space-time volume of a video sequence. We show that proposed Video Brownian Covariance (VBC) descriptor carries complementary information to the Histogram of Oriented Gradients (HOG) descriptor. The fusion of these two descriptors gives a significant improvement in performance on three challenging action recognition datasets.

Abstract

International audience

Additional details

Identifiers

URL
https://inria.hal.science/hal-01054943
URN
urn:oai:HAL:hal-01054943v2

Origin repository

Origin repository
UNICA