HS-OCPA学习系统设计及其在机器人 姿态平衡控制中的应用

Design of HS-OCPA Bionic Learning System and Applied on Posture Balance Control of Robot

  • 摘要: 针对单层操作条件反射概率自动机的操作行为个数较多的问题,构造了一个层次结构的操作条件反射自动机,简称HS-OCPA仿生自主学习系统. 该系统主要基于Skinner操作条件反射机理和概率自动机进行设计,学习控制不需要系统的模型,在操作行为和系统性能的基础上,采用操作条件反射学习机制实现寻优学习,并利用操作行为的取向信息对操作条件反射学习机制进行调整,最终实现在线搜索最优的控制策略. 理论证明设计的操作条件反射学习机制可以确保学习系统依概率1收敛于最优的行为路径. 应用于两轮机器人姿态平衡控制的仿真和实验结果

     

    Abstract: Aiming at the problem that the numbers of operant actions are more in operant conditioning probabilistic automaton, this paper constructs a hierarchical structural operant conditioning probabilistic automaton, which is called as HS-OCPA bionic autonomous learning system. The HS-OCPA learning system which doesnt require the system model is designed mainly based on Skinner operant conditioning (Skinner OC) mechanism and probabilistic automata (PA). The HS-OCPA learning system uses OC learning mechanism to realize optimizing learning based on operant actions and system performance, and the OC learning mechanism is adjusted by the reorientation information of operant actions. Finally the optimal control strategy is searched on line. The simulation and experiment applied in two-wheeled robot poster balance control both show that the designed HS-OCPA learning system not only has quickly learning velocity but also has strong adaptive ability when numbers of operant actions are more.

     

/

返回文章
返回
Baidu
map