Upload README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,105 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
pipeline_tag: image-text-to-text
|
4 |
+
base_model:
|
5 |
+
- epfl-llm/meditron-7b
|
6 |
+
- microsoft/rad-dino
|
7 |
+
base_model_relation: merge
|
8 |
+
library_name: transformers
|
9 |
+
tags:
|
10 |
+
- RRG
|
11 |
+
- Radiology Report Generation
|
12 |
+
- Chest X-ray
|
13 |
+
- Multimodal Large Language Models
|
14 |
+
---
|
15 |
+
<br>
|
16 |
+
|
17 |
+
# **Libra Model Card**
|
18 |
+
|
19 |
+
**Version**: Libra-v1.0
|
20 |
+
|
21 |
+
## Overview
|
22 |
+
|
23 |
+
**Libra** is a multimodal Large Language Model (LLM) specialized in **radiology report generation**, particularly **chest X-ray** interpretations. It can produce detailed _Findings_ sections with **temporal comparisons** (e.g., comparing a current chest X-ray with prior ones). Libra integrates the following key components:
|
24 |
+
|
25 |
+
- **RAD-DINO**: A vision encoder pre-trained on medical imaging datasets for robust feature extraction from chest X-rays.
|
26 |
+
- **Meditron-7B**: A 7B-parameter large language model (based on Llama-2) specialized in medical text generation.
|
27 |
+
- **Temporal Alignment Connector (TAC)**: A custom adapter that fuses features across multiple time points to enable temporal comparisons.
|
28 |
+
|
29 |
+
This model card provides an overview of Libra’s architecture, training methodology, limitations, and recommended usage guidelines.
|
30 |
+
|
31 |
+
---
|
32 |
+
|
33 |
+
## Paper and Resources
|
34 |
+
|
35 |
+
For more detailed information regarding Libra’s methodology, theoretical foundation, and performance benchmarks, please refer to the following resources:
|
36 |
+
|
37 |
+
- **Project Website**: [Libra v1.0](https://x-izhang.github.io/Libra_v1.0/)
|
38 |
+
- **Paper**: [arXiv:2411.19378](https://arxiv.org/abs/2411.19378)
|
39 |
+
- **Code Repository**: [X-iZhang/Libra (GitHub)](https://github.com/X-iZhang/Libra)
|
40 |
+
|
41 |
+
Or check out our Spaces demo! [](https://huggingface.co/spaces/X-iZhang/Libra)
|
42 |
+
|
43 |
+
|
44 |
+
---
|
45 |
+
|
46 |
+
## Training Strategy
|
47 |
+
|
48 |
+
Libra is trained in a **two-stage process**:
|
49 |
+
|
50 |
+
1. **Temporal Feature Alignment**
|
51 |
+
- Trains TAC to effectively fuse and align features from different time points (current and previous chest X-rays).
|
52 |
+
- Focuses on capturing notable changes (e.g., appearance or progression of opacities, devices, and lines).
|
53 |
+
|
54 |
+
2. **Fine-Tuning for Radiology Report Generation**
|
55 |
+
- The language model part is fine-tuned on a large dataset of paired chest X-ray images and radiology reports.
|
56 |
+
- Emphasizes the generation of the _Findings_ section, especially incorporating temporal descriptors.
|
57 |
+
|
58 |
+
---
|
59 |
+
|
60 |
+
## Intended Use
|
61 |
+
|
62 |
+
Libra is primarily designed to **assist** clinical practitioners, researchers, and medical students in generating chest X-ray reports. Key applications include:
|
63 |
+
|
64 |
+
- **Clinical Decision Support**: Providing draft findings that can be refined by a radiologist.
|
65 |
+
- **Educational Tool**: Demonstrating example interpretations and temporal changes for training radiology residents.
|
66 |
+
- **Research**: Facilitating studies on automated report generation and temporal feature learning in medical imaging.
|
67 |
+
|
68 |
+
> **Important**: Outputs should be reviewed by qualified radiologists or medical professionals before final clinical decisions are made.
|
69 |
+
|
70 |
+
---
|
71 |
+
|
72 |
+
## Limitations and Recommendations
|
73 |
+
|
74 |
+
1. **Data Bias**: The model’s performance may be less reliable for underrepresented demographics or rare pathologies.
|
75 |
+
2. **Clinical Oversight**: Always involve a medical professional to verify the results—Libra is not a substitute for professional judgment.
|
76 |
+
3. **Temporal Inaccuracies**: Despite TAC’s focus on temporal alignment, subtle or uncommon changes may go unrecognized.
|
77 |
+
4. **Generalization**: Libra’s performance on chest X-ray types or conditions not seen during training may be limited.
|
78 |
+
|
79 |
+
---
|
80 |
+
|
81 |
+
## Ethical Considerations
|
82 |
+
|
83 |
+
- **Patient Privacy**: Ensure the data is fully de-identified and compliant with HIPAA/GDPR (or relevant privacy regulations).
|
84 |
+
- **Responsible Use**: Deploy Libra’s outputs carefully; they are not guaranteed to be error-free.
|
85 |
+
- **Accountability**: Users and organizations must assume responsibility for verifying clinical accuracy and safety.
|
86 |
+
|
87 |
+
---
|
88 |
+
|
89 |
+
## How to Cite ✒️
|
90 |
+
|
91 |
+
If you use Libra in academic or research contexts, please cite:
|
92 |
+
|
93 |
+
```bibtex
|
94 |
+
@misc{zhang2024libraleveragingtemporalimages,
|
95 |
+
title={Libra: Leveraging Temporal Images for Biomedical Radiology Analysis},
|
96 |
+
author={Xi Zhang and Zaiqiao Meng and Jake Lever and Edmond S. L. Ho},
|
97 |
+
year={2024},
|
98 |
+
eprint={2411.19378},
|
99 |
+
archivePrefix={arXiv},
|
100 |
+
primaryClass={cs.CV},
|
101 |
+
url={https://arxiv.org/abs/2411.19378},
|
102 |
+
}
|
103 |
+
```
|
104 |
+
## Disclaimer:
|
105 |
+
This tool is for research and educational purposes only. It is not FDA-approved or CE-marked for clinical use. Users should consult qualified healthcare professionals for any clinical decisions.
|