HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding

Zhaoyi Song, Rongqing Zhang, Xiang Cheng

October, 2023

IROS 2023 Poster

Abstract

The Multi-Agent Path Finding (MAPF) problem is a critical challenge in dynamic multi-robot systems. Recent studies have revealed that multi-agent reinforcement learning (MARL) is a promising approach to solving MAPF problems in a fully decentralized manner. However, as the size of the multirobot system increases, sample inefﬁciency becomes a major impediment to learning-based methods. This paper presents a hierarchical reinforcement learning (HRL) framework for large-scale multi-agent path ﬁnding, featuring applying spatial and temporal abstraction to capture intermediate reward and thus encourage efﬁcient exploration. Speciﬁcally, we introduce a meta controller that partitions the map into interconnected regions and optimizes agents’ region-wise paths towards globally better solutions. Additionally, we design a lower-level controller that efﬁciently solves each sub-problem by incorporating heuristic guidance and an inter-agent communication mechanism with RL-based policies. Our empirical results on test instances of various scales demonstrate that our method outperforms existing approaches in terms of both success rate and makespan.

Type

Conference paper

Publication

In The 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

HELSA: Hierarchical Reinforcement Learning with Spatiotemporal Abstraction for Large-Scale Multi-Agent Path Finding

Abstract

Zhaoyi Song

Master’s Student in Software Engineering