Video Scene Detection Using Compact Bag of Visual Word Models

المؤلفون المشاركون

Haroon, Muhammad
Baber, Junaid
Ullah, Ihsan
Daudpota, Sher Muhammad
Bakhtyar, Maheen
Devi, Varsha

المصدر

Advances in Multimedia

العدد

المجلد 2018، العدد 2018 (31 ديسمبر/كانون الأول 2018)، ص ص. 1-9، 9ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2018-11-08

دولة النشر

مصر

عدد الصفحات

9

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

Video segmentation into shots is the first step for video indexing and searching.

Videos shots are mostly very small in duration and do not give meaningful insight of the visual contents.

However, grouping of shots based on similar visual contents gives a better understanding of the video scene; grouping of similar shots is known as scene boundary detection or video segmentation into scenes.

In this paper, we propose a model for video segmentation into visual scenes using bag of visual word (BoVW) model.

Initially, the video is divided into the shots which are later represented by a set of key frames.

Key frames are further represented by BoVW feature vectors which are quite short and compact compared to classical BoVW model implementations.

Two variations of BoVW model are used: (1) classical BoVW model and (2) Vector of Linearly Aggregated Descriptors (VLAD) which is an extension of classical BoVW model.

The similarity of the shots is computed by the distances between their key frames feature vectors within the sliding window of length L, rather comparing each shot with very long lists of shots which has been previously practiced, and the value of L is 4.

Experiments on cinematic and drama videos show the effectiveness of our proposed framework.

The BoVW is 25000-dimensional vector and VLAD is only 2048-dimensional vector in the proposed model.

The BoVW achieves 0.90 segmentation accuracy, whereas VLAD achieves 0.83.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Haroon, Muhammad& Baber, Junaid& Ullah, Ihsan& Daudpota, Sher Muhammad& Bakhtyar, Maheen& Devi, Varsha. 2018. Video Scene Detection Using Compact Bag of Visual Word Models. Advances in Multimedia،Vol. 2018, no. 2018, pp.1-9.
https://search.emarefa.net/detail/BIM-1118403

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Haroon, Muhammad…[et al.]. Video Scene Detection Using Compact Bag of Visual Word Models. Advances in Multimedia No. 2018 (2018), pp.1-9.
https://search.emarefa.net/detail/BIM-1118403

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Haroon, Muhammad& Baber, Junaid& Ullah, Ihsan& Daudpota, Sher Muhammad& Bakhtyar, Maheen& Devi, Varsha. Video Scene Detection Using Compact Bag of Visual Word Models. Advances in Multimedia. 2018. Vol. 2018, no. 2018, pp.1-9.
https://search.emarefa.net/detail/BIM-1118403

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1118403