Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

المؤلفون المشاركون

Bourlard, Hervé
Scheffler, Carl
Kallinger, Markus
Odobez, Jean-Marc
Motlicek, Petr
Korchagin, Danil
Thiergart, Oliver
Del Galdo, Giovanni
Duffner, Stefan

المصدر

Advances in Multimedia

العدد

المجلد 2013، العدد 2013 (31 ديسمبر/كانون الأول 2013)، ص ص. 1-21، 21ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2013-08-26

دولة النشر

مصر

عدد الصفحات

21

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

We describe the design of a system consisting of several state-of-the-art real-time audio and video processing components enabling multimodal stream manipulation (e.g., automatic online editing for multiparty videoconferencing applications) in open, unconstrained environments.

The underlying algorithms are designed to allow multiple people to enter, interact, and leave the observable scene with no constraints.

They comprise continuous localisation of audio objects and its application for spatial audio object coding, detection, and tracking of faces, estimation of head poses and visual focus of attention, detection and localisation of verbal and paralinguistic events, and the association and fusion of these different events.

Combined all together, they represent multimodal streams with audio objects and semantic video objects and provide semantic information for stream manipulation systems (like a virtual director).

Various experiments have been performed to evaluate the performance of the system.

The obtained results demonstrate the effectiveness of the proposed design, the various algorithms, and the benefit of fusing different modalities in this scenario.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Motlicek, Petr& Duffner, Stefan& Korchagin, Danil& Bourlard, Hervé& Scheffler, Carl& Odobez, Jean-Marc…[et al.]. 2013. Real-Time Audio-Visual Analysis for Multiperson Videoconferencing. Advances in Multimedia،Vol. 2013, no. 2013, pp.1-21.
https://search.emarefa.net/detail/BIM-451975

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Motlicek, Petr…[et al.]. Real-Time Audio-Visual Analysis for Multiperson Videoconferencing. Advances in Multimedia No. 2013 (2013), pp.1-21.
https://search.emarefa.net/detail/BIM-451975

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Motlicek, Petr& Duffner, Stefan& Korchagin, Danil& Bourlard, Hervé& Scheffler, Carl& Odobez, Jean-Marc…[et al.]. Real-Time Audio-Visual Analysis for Multiperson Videoconferencing. Advances in Multimedia. 2013. Vol. 2013, no. 2013, pp.1-21.
https://search.emarefa.net/detail/BIM-451975

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-451975