LivingKnowledge goal is to bring a new quality into search and knowledge management technology for more concise, complete and contextualised search results.
The paper “Content Redundancy in YouTube and its Application to Video Tagging” co-written by J. San Pedro, M. Sanderson and M. Sanderson has been published in ACM Transactions on Information Systems, TOIS 2011, on July, 2011.
The emergence of large scale social web communities has enabled users to share online vast amounts of multimedia content. An analysis of the leading social video sharing platform YouTube reveals a high amount of redundancy, in the form of videos with overlapping or duplicated content. In this paper, we use robust content-based video analysis techniques to detect overlapping sequences between videos in YouTube. Based on the output of these techniques, we present an in-depth study of duplication and content overlap in YouTube, and analyze various dependencies between content overlap and meta data such as video titles, views, video ratings, and tags.
As an application, we show that content-based links in YouTube videos can provide useful information for generating new tag assignments. To this end, we propose diﬀerent tag propagation methods for automatically obtaining richer video annotations. Our techniques provide the user with additional information about videos, and lead to enhanced feature representations for applications such as automatic data organization and search. Experiments on video clustering and classiﬁcation as well as a user evaluation demonstrate the viability of our approach.