Abstract
The explosive expansion of the social web makes overwhelming amounts of web videos available, among which there are a large number of near-duplicate videos. Current web video search results rely exclusively on text keywords or usersupplied tags. A search on the keywords of a typical popular video often returns many duplicate and near-duplicate videos in the top results. Efficient near-duplicate web video detection is essential for effective search, retrieval, browsing and annotation. Due to the large variety of near-duplicate web video types ranging from simple formatting to complex editing, accurate detection generally comes at the cost of time complexity, particularly for web scale video applications. On the other hand, timely response to user queries is one important factor that fuels the popularity of the social web. This chapter will review approaches for near-duplicate web video detection from different technical viewpoints: combining global features and local features, integrating content and contextual information, and visual-word based scalable retrieval.
Keywords: Near-Duplicates, Video Copy Detection, Web Video, Content, Context, Local Points, Novelty and Redundancy Detection, Similarity Measure, Data-driven, Video Retreival, Social Web, Bag of Word