Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation

Multi-agent collaborative target search is one of the main challenges in the multi-agent field, and deep reinforcement learning (DRL) is a good way to learn such a task. However, DRL always faces the problem of sparse reward, which to some extent reduces its efficiency in task learning. Introducing intrinsic motivation has proved to be a useful way to make the sparse reward in DRL. So, based on the multi-agent deep deterministic policy gradient (MADDPG) structure, a new MADDPG algorithm with the emotional intrinsic motivation name MADDPG-E is proposed in this paper for the multi-agent collaborative target search. In MADDPG-E, a new emotional intrinsic motivation module with three emotions, joy, sadness, and fear, is designed. The three emotions are defined by corresponding psychological knowledge to the multi-agent embodied situations in an environment. An emotional steady-state variable function H is then designed to help judge the goodness of the emotions. Based on H, an emotion-based intrinsic reward function is finally proposed. With the designed emotional intrinsic motivation module, the multi-agent system always tries to make itself joy, which means it always learns to search the target. To show the effectiveness of the proposed MADDPG-E algorithm, two kinds of simulation experiments with a determined initial position and random initial position, respectively, are carried out, and comparisons are performed with MADDPG as well as MADDPG-ICM (MADDPG with an intrinsic curiosity module). The results show that with the designed emotional intrinsic motivation module, MADDPG-E has a higher learning speed and better learning stability, and the advantage is more obvious when facing complex situations.

Description

Peer reviewed: True

Publication status: Published

Keywords

multi-agent collaboration, intrinsic motivation, MADDPG, emotion, deep reinforcement learning

Journal Title

Applied Sciences Switzerland

Journal ISSN

2076-3417
2076-3417

Volume Title

13

Publisher

MDPI AG

Publisher DOI

https://doi.org/10.3390/app132111951

Rights and licensing

Except where otherwised noted, this item's license is described as Attribution 4.0 International

Sponsorship

Natural Science Foundation of China (61903006)
Beijing Natural Science Foundation (4202022, 4204096)
R&D Program of Beijing Municipal Education Commission (KM202210009012)
Beijing Municipal Great Wall Scholar Program (CIT&TCD 20190304)
China Scholarship Council and Beijing Association for Science and Technology Young Talent Promotion Project (2023YZZKYO3)

Collections

Jisc Publications Router