X-iZhang commited on
Commit
92460a9
·
verified ·
1 Parent(s): ea8f7b3

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +105 -3
README.md CHANGED
@@ -1,3 +1,105 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: image-text-to-text
4
+ base_model:
5
+ - epfl-llm/meditron-7b
6
+ - microsoft/rad-dino
7
+ base_model_relation: merge
8
+ library_name: transformers
9
+ tags:
10
+ - RRG
11
+ - Radiology Report Generation
12
+ - Chest X-ray
13
+ - Multimodal Large Language Models
14
+ ---
15
+ <br>
16
+
17
+ # **Libra Model Card**
18
+
19
+ **Version**: Libra-v1.0
20
+
21
+ ## Overview
22
+
23
+ **Libra** is a multimodal Large Language Model (LLM) specialized in **radiology report generation**, particularly **chest X-ray** interpretations. It can produce detailed _Findings_ sections with **temporal comparisons** (e.g., comparing a current chest X-ray with prior ones). Libra integrates the following key components:
24
+
25
+ - **RAD-DINO**: A vision encoder pre-trained on medical imaging datasets for robust feature extraction from chest X-rays.
26
+ - **Meditron-7B**: A 7B-parameter large language model (based on Llama-2) specialized in medical text generation.
27
+ - **Temporal Alignment Connector (TAC)**: A custom adapter that fuses features across multiple time points to enable temporal comparisons.
28
+
29
+ This model card provides an overview of Libra’s architecture, training methodology, limitations, and recommended usage guidelines.
30
+
31
+ ---
32
+
33
+ ## Paper and Resources
34
+
35
+ For more detailed information regarding Libra’s methodology, theoretical foundation, and performance benchmarks, please refer to the following resources:
36
+
37
+ - **Project Website**: [Libra v1.0](https://x-izhang.github.io/Libra_v1.0/)
38
+ - **Paper**: [arXiv:2411.19378](https://arxiv.org/abs/2411.19378)
39
+ - **Code Repository**: [X-iZhang/Libra (GitHub)](https://github.com/X-iZhang/Libra)
40
+
41
+ Or check out our Spaces demo! [![Open in Spaces](https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-md-dark.svg)](https://huggingface.co/spaces/X-iZhang/Libra)
42
+
43
+
44
+ ---
45
+
46
+ ## Training Strategy
47
+
48
+ Libra is trained in a **two-stage process**:
49
+
50
+ 1. **Temporal Feature Alignment**
51
+ - Trains TAC to effectively fuse and align features from different time points (current and previous chest X-rays).
52
+ - Focuses on capturing notable changes (e.g., appearance or progression of opacities, devices, and lines).
53
+
54
+ 2. **Fine-Tuning for Radiology Report Generation**
55
+ - The language model part is fine-tuned on a large dataset of paired chest X-ray images and radiology reports.
56
+ - Emphasizes the generation of the _Findings_ section, especially incorporating temporal descriptors.
57
+
58
+ ---
59
+
60
+ ## Intended Use
61
+
62
+ Libra is primarily designed to **assist** clinical practitioners, researchers, and medical students in generating chest X-ray reports. Key applications include:
63
+
64
+ - **Clinical Decision Support**: Providing draft findings that can be refined by a radiologist.
65
+ - **Educational Tool**: Demonstrating example interpretations and temporal changes for training radiology residents.
66
+ - **Research**: Facilitating studies on automated report generation and temporal feature learning in medical imaging.
67
+
68
+ > **Important**: Outputs should be reviewed by qualified radiologists or medical professionals before final clinical decisions are made.
69
+
70
+ ---
71
+
72
+ ## Limitations and Recommendations
73
+
74
+ 1. **Data Bias**: The model’s performance may be less reliable for underrepresented demographics or rare pathologies.
75
+ 2. **Clinical Oversight**: Always involve a medical professional to verify the results—Libra is not a substitute for professional judgment.
76
+ 3. **Temporal Inaccuracies**: Despite TAC’s focus on temporal alignment, subtle or uncommon changes may go unrecognized.
77
+ 4. **Generalization**: Libra’s performance on chest X-ray types or conditions not seen during training may be limited.
78
+
79
+ ---
80
+
81
+ ## Ethical Considerations
82
+
83
+ - **Patient Privacy**: Ensure the data is fully de-identified and compliant with HIPAA/GDPR (or relevant privacy regulations).
84
+ - **Responsible Use**: Deploy Libra’s outputs carefully; they are not guaranteed to be error-free.
85
+ - **Accountability**: Users and organizations must assume responsibility for verifying clinical accuracy and safety.
86
+
87
+ ---
88
+
89
+ ## How to Cite ✒️
90
+
91
+ If you use Libra in academic or research contexts, please cite:
92
+
93
+ ```bibtex
94
+ @misc{zhang2024libraleveragingtemporalimages,
95
+ title={Libra: Leveraging Temporal Images for Biomedical Radiology Analysis},
96
+ author={Xi Zhang and Zaiqiao Meng and Jake Lever and Edmond S. L. Ho},
97
+ year={2024},
98
+ eprint={2411.19378},
99
+ archivePrefix={arXiv},
100
+ primaryClass={cs.CV},
101
+ url={https://arxiv.org/abs/2411.19378},
102
+ }
103
+ ```
104
+ ## Disclaimer:
105
+ This tool is for research and educational purposes only. It is not FDA-approved or CE-marked for clinical use. Users should consult qualified healthcare professionals for any clinical decisions.