We will be undertaking essential maintenance work on Apollo's infrastructure on Thursday 14 August and Friday 15 August, therefore expect intermittent access to Apollo's content and search interface during that time. Please also note that Apollo's "Request a copy" service will be temporarily disabled while we undertake this work.
Repository logo
 

Towards a Competitive 3-Player Mahjong AI using Deep Reinforcement Learning

Accepted version
Peer-reviewed

Loading...
Thumbnail Image

Change log

Abstract

Mahjong is a multi-player imperfect-information game with challenging features for AI research. Sanma, being a 3-player variant of Japanese Riichi Mahjong, possesses unique characteristics and a more aggressive playing style than the 4- player game. It is thus challenging and of research interest in its own right, but has not been explored. We present Meowjong, the first ever AI for Sanma using deep reinforcement learning (RL). We define a 2-dimensional data structure for encoding the observable information in a game. We pre-train 5 convolutional neural networks (CNNs) for Sanma’s 5 actions—discard, Pon, Kan, Kita and Riichi, and enhance the major (discard) action’s model via self-play reinforcement learning. Meowjong demon- strates potential for becoming the state-of-the-art in Sanma, by achieving test accuracies comparable with AIs for 4-player Mahjong through supervised learning, and gaining a significant further enhancement from reinforcement learning.

Description

Keywords

Journal Title

Conference Name

2022 IEEE Conference on Games

Journal ISSN

Volume Title

Publisher

Rights and licensing

Except where otherwised noted, this item's license is described as Attribution 4.0 International