Accelerated decomposition techniques for large discounted Markov decision processes

Larach, Abdelhadi; Chafik, S.; Daoui, C.

رقم المقالة : 676717 زيارة : 121 الصفحة: 0 - 0

نوع المخطوط: ابحاث

Accelerated decomposition techniques for large discounted Markov decision processes

الموضوعات :

Abdelhadi Larach ¹ , S. Chafik ² , C. Daoui ³

1 - Faculty of Sciences and Techniques, Laboratory of Information Processing and Decision Support, Sultan Moulay Slimane University, B.P. 523, Benimellal, Morocco
2 - Faculty of Sciences and Techniques, Laboratory of Information Processing and Decision Support, Sultan Moulay Slimane University, B.P. 523, Benimellal, Morocco
3 - Faculty of Sciences and Techniques, Laboratory of Information Processing and Decision Support, Sultan Moulay Slimane University, B.P. 523, Benimellal, Morocco

تاريخ الإرسال : 18 الإثنين , صفر, 1442 تاريخ التأكيد : 18 الإثنين , صفر, 1442 تاريخ الإصدار : 13 الجمعة , ربيع الأول, 1439

الکلمات المفتاحية: Decomposition, Markov decision process Graph theory , Tarjan’s algorithm Strongly connected components ,

ملخص المقالة :

Many hierarchical techniques to solve large Markov decision processes (MDPs) are based on the partition of the state space into strongly connected components (SCCs) that can be classified into some levels. In each level, smaller problems named restricted MDPs are solved, and then these partial solutions are combined to obtain the global solution. In this paper, we first propose a novel algorithm, which is a variant of Tarjan’s algorithm that simultaneously finds the SCCs and their belonging levels. Second, a new definition of the restricted MDPs is presented to ameliorate some hierarchical solutions in discounted MDPs using value iteration (VI) algorithm based on a list of state-action successors. Finally, a robotic motion-planning example and the experiment results are presented to illustrate the benefit of the proposed decomposition algorithms.

المصادر:

شارک

عنوان URL للمقالة

Accelerated decomposition techniques for large discounted Markov decision processes

سند

الروابط

المراكز ذات الصلة

دعامة

الصفحات الرسمية