Halin, Alfian Abdul; Rajeswari, Mandava; Abbasnejad, Ehsan
This paper presents a framework for soccer event detection through collaborative analysis of the textual, visual and aural modalities. The basic notion is to decompose a match video into smaller segments until ultimately the desired eventful segment is identified. Simple features are considered namely the minute-by-minute reports from sports websites (i.e. text), the semantic shot classes of far and closeup-views (i.e. visual), and the low-level features of pitch and log-energy (i.e. audio)....[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.