Repository logo
 

Democratizing cost-effective, agentic artificial intelligence to multilingual medical summarization through knowledge distillation

Accepted version
Peer-reviewed

Loading...
Thumbnail Image

Change log

Abstract

The increasing demand for multilingual capabilities in healthcare technology highlights the critical need for AI solutions capable of handling underrepresented languages, such as Arabic, in clinical documentation. Arabic’s unique linguistic complexities—morphological richness, syntactic variations, and diglossia—present significant challenges for foundational large language models (LLMs), especially in domain-specific tasks like medical summarization. This study introduces AraSum, a domain-specific AI agent built using a novel knowledge distillation framework that transforms large multilingual LLMs into lightweight, task-optimized small language models (SLMs). Leveraging a synthetic dataset of Arabic medical dialogues, AraSum demonstrates superior performance over JAIS-30B, a foundational Arabic LLM, across key evaluation metrics, including BLEU and ROUGE scores. AraSum also outperforms JAIS in Arabic-speaking evaluator assessments of accuracy, comprehensiveness, and clinical utility while maintaining comparable linguistic performance as measured by a modified PDQI-9 inventory. Beyond accuracy, AraSum achieves these results with significantly lower computational and environmental costs, demonstrating the feasibility of deploying resource-efficient AI models in low-resource settings for domain-specific tasks. This work underscores the potential of SLM-based agentic architectures for advancing multilingual healthcare, encouraging sustainable artificial intelligence, and fostering equity in access to care.

Description

Keywords

Journal Title

Scientific Reports

Conference Name

Journal ISSN

2045-2322
2045-2322

Volume Title

Publisher

Nature Portfolio

Rights and licensing

Except where otherwised noted, this item's license is described as Attribution 4.0 International
Sponsorship
This project was supported computationally by Microsoft for Startups, Google, and Amazon Web Services.