A Video Similarity Measure Combining Alignment, Graphical and Speech Features
Description
A large volume of video content on the web is available today, which demands efficient management. To effectively manage, search, retrieve and copy detection, similarity methods play a critica] role. In this paper, a novel video similarity measure using visual fcatures, alignment distances and speech transcripts is proposed. Video files are represented by a sequence of segments set where each segment contains color histograms, start time and a set of syllables extracted from the speech in the audio track. In a first step, textual, alignment and visual features are extracted. They complement each other and can be further combined to boost the segment similarity. The second step describes how the Maximum Bipaitite Matching and sorne statistical features are. applied to find segments crnTespondences and calculate a global similarity value respectively. Experiments far video similarity were performed on a dataset and promising results were achieved to demonstrate the effectiveness of this method.
Abstract
Ministerio de Ciencia e Innovación TIN2009-14378- C02-01
Additional details
- URL
- https://idus.us.es/handle//11441/146582
- URN
- urn:oai:idus.us.es:11441/146582
- Origin repository
- USE