arxiv:2506.06091

MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Published on Jun 6

· Submitted by

salman-abdullah on Jun 9

Upvote

Authors:

Qinyue Zheng ,

Salman Abdullah ,

Cyril Zakka ,

Abstract

MIRIAD, a large-scale, curated medical QA corpus, enhances LLM accuracy and hallucination detection in healthcare applications.

AI-generated summary

LLMs are bound to transform healthcare with advanced decision support and flexible chat assistants. However, LLMs are prone to generate inaccurate medical content. To ground LLMs in high-quality medical knowledge, LLMs have been equipped with external knowledge via RAG, where unstructured medical knowledge is split into small text chunks that can be selectively retrieved and integrated into the LLMs context. Yet, existing RAG pipelines rely on raw, unstructured medical text, which can be noisy, uncurated and difficult for LLMs to effectively leverage. Systematic approaches to organize medical knowledge to best surface it to LLMs are generally lacking. To address these challenges, we introduce MIRIAD, a large-scale, curated corpus of 5,821,948 medical QA pairs, each rephrased from and grounded in a passage from peer-reviewed medical literature using a semi-automated pipeline combining LLM generation, filtering, grounding, and human annotation. Unlike prior medical corpora, which rely on unstructured text, MIRIAD encapsulates web-scale medical knowledge in an operationalized query-response format, which enables more targeted retrieval. Experiments on challenging medical QA benchmarks show that augmenting LLMs with MIRIAD improves accuracy up to 6.7% compared to unstructured RAG baselines with the same source corpus and with the same amount of retrieved text. Moreover, MIRIAD improved the ability of LLMs to detect medical hallucinations by 22.5 to 37% (increase in F1 score). We further introduce MIRIAD-Atlas, an interactive map of MIRIAD spanning 56 medical disciplines, enabling clinical users to visually explore, search, and refine medical knowledge. MIRIAD promises to unlock a wealth of down-stream applications, including medical information retrievers, enhanced RAG applications, and knowledge-grounded chat interfaces, which ultimately enables more reliable LLM applications in healthcare.

View arXiv page View PDF Project page GitHub repository Add to collection

Community

salman-abdullah

Paper author Paper submitter 6 days ago

Massive-scale dataset of 5.8M+ synthetic, medical QA pairs that unlocks a myriad downstream uses: medical information retrievers, enhanced RAG applications, and knowledge-grounded chat interfaces.

librarian-bot

5 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2506.06091 in a model README.md to link it from this page.

MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Abstract

Community

Models citing this paper 0

Datasets citing this paper 2

Spaces citing this paper 1

Collections including this paper 2