Detection of unique people in news programs using multimodal shot clustering
Download
Author
CM Taskiran, A Albiol, L Torres, EJ Delp
Entry type
article
Abstract
In this paper, we describe an approach that uses a combination of visual and audio features to cluster shots belonging to the same person in video programs. We use color histograms extracted from keyframes and faces, as well as cepstral coefficients derived from audio to calculate pairwise shot distances. These distances are then normalized and combined to a single confidence value which reflects our certainty that two shots contain the same person. We then use an agglomerative clustering algorithm to cluster shots based on these confidence values. We report the results of our system on a data set of approximately 8 hours of programming.
Download
Date
2004 – 10
Journal
Image Processing, 2004. ICIP '04. 2004 International Conference on
Key alpha
Delp
Pages
697-700
Volume
1
Publication Date
2004-10-01

