Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design
المؤلفون المشاركون
Chen, Xin
Xie, Penghuan
Xiong, Yonghua
He, Yong
Wu, Min
المصدر
Mathematical Problems in Engineering
العدد
المجلد 2015، العدد 2015 (31 ديسمبر/كانون الأول 2015)، ص ص. 1-14، 14ص.
الناشر
Hindawi Publishing Corporation
تاريخ النشر
2015-09-16
دولة النشر
مصر
عدد الصفحات
14
التخصصات الرئيسية
الملخص EN
Adaptive Dynamic Programming (ADP) with critic-actor architecture is an effective way to perform online learning control.
To avoid the subjectivity in the design of a neural network that serves as a critic network, kernel-based adaptive critic design (ACD) was developed recently.
There are two essential issues for a static kernel-based model: how to determine proper hyperparameters in advance and how to select right samples to describe the value function.
They all rely on the assessment of sample values.
Based on the theoretical analysis, this paper presents a two-phase simultaneous learning method for a Gaussian-kernel-based critic network.
It is able to estimate the values of samples without infinitively revisiting them.
And the hyperparameters of the kernel model are optimized simultaneously.
Based on the estimated sample values, the sample set can be refined by adding alternatives or deleting redundances.
Combining this critic design with actor network, we present a Gaussian-kernel-based Adaptive Dynamic Programming (GK-ADP) approach.
Simulations are used to verify its feasibility, particularly the necessity of two-phase learning, the convergence characteristics, and the improvement of the system performance by using a varying sample set.
نمط استشهاد جمعية علماء النفس الأمريكية (APA)
Chen, Xin& Xie, Penghuan& Xiong, Yonghua& He, Yong& Wu, Min. 2015. Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design. Mathematical Problems in Engineering،Vol. 2015, no. 2015, pp.1-14.
https://search.emarefa.net/detail/BIM-1074661
نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)
Chen, Xin…[et al.]. Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design. Mathematical Problems in Engineering No. 2015 (2015), pp.1-14.
https://search.emarefa.net/detail/BIM-1074661
نمط استشهاد الجمعية الطبية الأمريكية (AMA)
Chen, Xin& Xie, Penghuan& Xiong, Yonghua& He, Yong& Wu, Min. Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design. Mathematical Problems in Engineering. 2015. Vol. 2015, no. 2015, pp.1-14.
https://search.emarefa.net/detail/BIM-1074661
نوع البيانات
مقالات
لغة النص
الإنجليزية
الملاحظات
Includes bibliographical references
رقم السجل
BIM-1074661
قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"
أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي
تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر