Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm

تاريخ النشر

2020-01-22

دولة النشر

مصر

عدد الصفحات

التخصصات الرئيسية

هندسة مدنية

الملخص EN

Deep deterministic policy gradient algorithm operating over continuous space of actions has attracted great attention for reinforcement learning.

However, the exploration strategy through dynamic programming within the Bayesian belief state space is rather inefficient even for simple systems.

Another problem is the sequential and iterative training data with autonomous vehicles subject to the law of causality, which is against the i.i.d.

(independent identically distributed) data assumption of the training samples.

This usually results in failure of the standard bootstrap when learning an optimal policy.

In this paper, we propose a framework of m-out-of-n bootstrapped and aggregated multiple deep deterministic policy gradient to accelerate the training process and increase the performance.

Experiment results on the 2D robot arm game show that the reward gained by the aggregated policy is 10%–50% better than those gained by subpolicies.

Experiment results on the open racing car simulator (TORCS) demonstrate that the new algorithm can learn successful control policies with less training time by 56.7%.

Analysis on convergence is also given from the perspective of probability and statistics.

These results verify that the proposed method outperforms the existing algorithms in both efficiency and performance.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Wu, Junta& Li, Huiyun. 2020. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm. Mathematical Problems in Engineering،Vol. 2020, no. 2020, pp.1-12.
https://search.emarefa.net/detail/BIM-1195068

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Wu, Junta& Li, Huiyun. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm. Mathematical Problems in Engineering No. 2020 (2020), pp.1-12.
https://search.emarefa.net/detail/BIM-1195068

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Wu, Junta& Li, Huiyun. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm. Mathematical Problems in Engineering. 2020. Vol. 2020, no. 2020, pp.1-12.
https://search.emarefa.net/detail/BIM-1195068

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1195068

حفظتم الحفظ طباعة

قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"

أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي

مرصد "معرفة"
لقياس الإنتاج العلمي العربي

تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر