Hierarchical Multimodal Adaptive Fusion (HMAF)‎ Network for Prediction of RGB-D Saliency

Joint Authors

Lv, Ying
Zhou, Wujie

Source

Computational Intelligence and Neuroscience

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-9, 9 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-11-21

Country of Publication

Egypt

No. of Pages

9

Main Subjects

Biology

Abstract EN

Visual saliency prediction for RGB-D images is more challenging than that for their RGB counterparts.

Additionally, very few investigations have been undertaken concerning RGB-D-saliency prediction.

The proposed study presents a method based on a hierarchical multimodal adaptive fusion (HMAF) network to facilitate end-to-end prediction of RGB-D saliency.

In the proposed method, hierarchical (multilevel) multimodal features are first extracted from an RGB image and depth map using a VGG-16-based two-stream network.

Subsequently, the most significant hierarchical features of the said RGB image and depth map are predicted using three two-input attention modules.

Furthermore, adaptive fusion of saliencies concerning the above-mentioned fused saliency features of different levels (hierarchical fusion saliency features) can be accomplished using a three-input attention module to facilitate high-accuracy RGB-D visual saliency prediction.

Comparisons based on the application of the proposed HMAF-based approach against those of other state-of-the-art techniques on two challenging RGB-D datasets demonstrate that the proposed method outperforms other competing approaches consistently by a considerable margin.

American Psychological Association (APA)

Lv, Ying& Zhou, Wujie. 2020. Hierarchical Multimodal Adaptive Fusion (HMAF) Network for Prediction of RGB-D Saliency. Computational Intelligence and Neuroscience،Vol. 2020, no. 2020, pp.1-9.
https://search.emarefa.net/detail/BIM-1138885

Modern Language Association (MLA)

Lv, Ying& Zhou, Wujie. Hierarchical Multimodal Adaptive Fusion (HMAF) Network for Prediction of RGB-D Saliency. Computational Intelligence and Neuroscience No. 2020 (2020), pp.1-9.
https://search.emarefa.net/detail/BIM-1138885

American Medical Association (AMA)

Lv, Ying& Zhou, Wujie. Hierarchical Multimodal Adaptive Fusion (HMAF) Network for Prediction of RGB-D Saliency. Computational Intelligence and Neuroscience. 2020. Vol. 2020, no. 2020, pp.1-9.
https://search.emarefa.net/detail/BIM-1138885

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1138885