Monocular weakly supervised depth and pose estimation method based on multi-information fusion

المؤلفون المشاركون

Qiao, Jianzhong
Zhang, Zhimin
Lin, Shukuan

المصدر

Kuwait Journal of Science

العدد

المجلد 49، العدد 3 (31 يوليو/تموز 2022)، ص ص. 1-23، 23ص.

الناشر

جامعة الكويت مجلس النشر العلمي

تاريخ النشر

2022-07-31

دولة النشر

الكويت

عدد الصفحات

23

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

The depth and pose information are the basic issues in the field of robotics, autonomous driving, and virtual reality, and are also the focus and difficult issues of computer vision research.

The supervised monocular depth and pose estimation learning are not feasible in environments where labeled data is not abundant.

Self-supervised monocular video methods can learn effectively only by applying photometric constraints without expensive ground true depth label constraints, which results in an inefficient training process and suboptimal estimation accuracy.

To solve these problems, a monocular weakly supervised depth and pose estimation method based on multi-information fusion is proposed in this paper.

First, we design a high-precision stereo vision method to generate a depth and pose data as the "Ground Truth" labels to solve the problem that the ground truth labels are difficult to obtain.

Then, we construct a multi-information fusion network model based on the "Ground truth" labels, video sequence, and IMU information to improve the estimation accuracy.

Finally, we design the loss function of supervised cues based on "Ground Truth" labels cues and selfsupervised cues to optimize our model.

In the testing phase, the network model can separately output high-precision depth and pose data from a monocular video sequence.

The resulting model outperforms mainstream monocular depth and poses estimation methods as well as the partial stereo matching method in the challenging KITTI dataset by only using a small number of real training data(200 pairs)

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Zhang, Zhimin& Qiao, Jianzhong& Lin, Shukuan. 2022. Monocular weakly supervised depth and pose estimation method based on multi-information fusion. Kuwait Journal of Science،Vol. 49, no. 3, pp.1-23.
https://search.emarefa.net/detail/BIM-1502634

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Zhang, Zhimin…[et al.]. Monocular weakly supervised depth and pose estimation method based on multi-information fusion. Kuwait Journal of Science Vol. 49, no. 3 (Jul. 2022), pp.1-23.
https://search.emarefa.net/detail/BIM-1502634

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Zhang, Zhimin& Qiao, Jianzhong& Lin, Shukuan. Monocular weakly supervised depth and pose estimation method based on multi-information fusion. Kuwait Journal of Science. 2022. Vol. 49, no. 3, pp.1-23.
https://search.emarefa.net/detail/BIM-1502634

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Text in English ; abstracts in .

رقم السجل

BIM-1502634