Multimodal Semantics Extraction from User-Generated Videos

تاريخ النشر

2012-05-17

دولة النشر

مصر

عدد الصفحات

التخصصات الرئيسية

هندسة كهربائية
تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

تسجيلات الفيديو

الملخص EN

User-generated video content has grown tremendously fast to the point of outpacing professional content creation.

In this work we develop methods that analyze contextual information of multiple user-generated videos in order to obtain semantic information about public happenings (e.g., sport and live music events) being recorded in these videos.

One of the key contributions of this work is a joint utilization of different data modalities, including such captured by auxiliary sensors during the video recording performed by each user.

In particular, we analyze GPS data, magnetometer data, accelerometer data, video- and audio-content data.

We use these data modalities to infer information about the event being recorded, in terms of layout (e.g., stadium), genre, indoor versus outdoor scene, and the main area of interest of the event.

Furthermore we propose a method that automatically identifies the optimal set of cameras to be used in a multicamera video production.

Finally, we detect the camera users which fall within the field of view of other cameras recording at the same public happening.

We show that the proposed multimodal analysis methods perform well on various recordings obtained in real sport events and live music performances.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Cricri, Francesco& Dabov, Kostadin& Roininen, Mikko J.& Mate, Sujeet& Curcio, Igor D. D.& Gabbouj, Moncef. 2012. Multimodal Semantics Extraction from User-Generated Videos. Advances in Multimedia،Vol. 2012, no. 2012, pp.1-17.
https://search.emarefa.net/detail/BIM-460945

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Cricri, Francesco…[et al.]. Multimodal Semantics Extraction from User-Generated Videos. Advances in Multimedia No. 2012 (2012), pp.1-17.
https://search.emarefa.net/detail/BIM-460945

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Cricri, Francesco& Dabov, Kostadin& Roininen, Mikko J.& Mate, Sujeet& Curcio, Igor D. D.& Gabbouj, Moncef. Multimodal Semantics Extraction from User-Generated Videos. Advances in Multimedia. 2012. Vol. 2012, no. 2012, pp.1-17.
https://search.emarefa.net/detail/BIM-460945

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-460945

حفظتم الحفظ طباعة

قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"

أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي

مرصد "معرفة"
لقياس الإنتاج العلمي العربي

تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر