Second-order Temporal Pooling for Action Recognition
-
Altmetric Citations
Cherian, Anoop; Gould, Stephen
Description
Deep learning models for video-based action recognition usually generate features for short clips (consisting of a few frames); such clip-level features are aggregated to video-level representations by computing statistics on these features. Typically zero-th (max) or the first-order (average) statistics are used. In this paper, we explore the benefits of using second-order statistics.Specifically, we propose a novel end-to-end learnable feature aggregation scheme, dubbed temporal correlation...[Show more]
Collections | ANU Research Publications |
---|---|
Date published: | 2018-08-19 |
Type: | Journal article |
URI: | http://hdl.handle.net/1885/238332 |
Source: | International Journal of Computer Vision |
DOI: | 10.1007/s11263-018-1111-5 |
Access Rights: | Open Access |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
1704.06925.pdf | Author Accepted Manuscript | 1.91 MB | Adobe PDF | ![]() Request a copy |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator