Unimore logo AImageLab

Action Categorization in Soccer Videos using String Kernels

Abstract: Action recognition is a crucial task to provide high-level semantic description of the video content, particularly in the case of sports videos. The bag-of-words (BoW) approach has proven to be successful for the categorization of objects and scenes in images, but it's unable to model temporal information between consecutive frames for video event recognition. In this paper, we present an approach to model actions as a sequence of histograms (one for each frame) represented using a traditional bag-of-words model. Actions are so described by a string (phrase) of variable size, depending on the clip's length, where each frame's representation is considered as a character. To compare these strings we use Needlemann-Wunsch distance, a metrics defined in the information theory, that deal with strings of different length. Finally, SVMs with a string kernel that includes this distance are used to perform classification. Experimental results demonstrate the validity of the proposed approach and they show that it outperforms baseline kNN classifiers.


Citation:

Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Serra, Giuseppe "Action Categorization in Soccer Videos using String Kernels" Proc. of IEEE International Workshop on Content-Based Multimedia Indexing (CBMI), Chania, Crete, grc, pp. 13 -18 , June 3-5, 2009, 2009 DOI: 10.1109/CBMI.2009.10

 not available