Chest X-Ray Pneumonia Detection Model
A robust deep learning system for automated pneumonia detection in chest radiographs, featuring comprehensive external validation and clinical-grade performance metrics.
π― Model Overview
This model implements a binary classification system designed to identify pneumonia in chest X-ray images. Built on MobileNetV2 architecture with transfer learning, the system has undergone rigorous external validation on 485 independent samples, demonstrating strong clinical applicability and generalization capabilities.
Key Performance Highlights
- External Validation Accuracy: 86.0% on 485 independent samples
- Clinical Sensitivity: 96.4% - optimal for screening applications
- Robust Generalization: Validated on completely unseen data from independent sources
- Production Ready: Comprehensive evaluation with detailed performance analysis
π Performance Metrics
Validation Results Comparison
Performance Metric | Internal Validation | External Validation | Clinical Assessment |
---|---|---|---|
Accuracy | 94.8% | 86.0% | Excellent generalization (8.8% variance) |
Sensitivity (Recall) | 89.6% | 96.4% | Outstanding screening capability |
Specificity | 100.0% | 74.8% | Acceptable false positive management |
Precision (PPV) | 100.0% | 80.4% | Strong positive predictive value |
F1-Score | 94.5% | 87.7% | Well-balanced performance profile |
External Validation Dataset
- Sample Size: 485 radiographs (234 normal, 251 pneumonia cases)
- Data Source: Independent pneumonia radiography dataset
- Validation Method: Complete external testing on previously unseen data
- Statistical Significance: Large sample size ensures reliable performance estimates
π¬ Clinical Significance
Screening Applications
The model's 96.4% sensitivity makes it particularly suitable for pneumonia screening workflows, where missing positive cases carries high clinical risk. The balanced performance profile supports its use as a clinical decision support tool.
Generalization Capability
With only an 8.8% accuracy decrease from internal to external validation, the model demonstrates robust learning patterns that generalize well across different data sources and imaging protocols.
π Implementation Guide
Quick Start Example
import tensorflow as tf
from tensorflow.keras.preprocessing import image
import numpy as np
# Load the pre-trained model from Hugging Face Hub
from huggingface_hub import hf_hub_download
# Download model from Hugging Face Hub
model_path = hf_hub_download(
repo_id="ayushirathour/chest-xray-pneumonia-detection",
filename="best_chest_xray_model.h5"
)
model = tf.keras.models.load_model(model_path)
def predict_pneumonia(img_path):
"""
Predict pneumonia from chest X-ray image
Args:
img_path (str): Path to chest X-ray image
Returns:
dict: Prediction results with confidence scores
"""
# Load and preprocess image
img = image.load_img(img_path, target_size=(224, 224))
img_array = image.img_to_array(img) / 255.0
img_array = np.expand_dims(img_array, axis=0)
# Generate prediction
prediction = model.predict(img_array)[0][0]
# Interpret results
if prediction > 0.5:
result = {
'diagnosis': 'PNEUMONIA',
'confidence': f"{prediction:.1%}",
'recommendation': 'Clinical review recommended'
}
else:
result = {
'diagnosis': 'NORMAL',
'confidence': f"{1-prediction:.1%}",
'recommendation': 'No pneumonia indicators detected'
}
return result
# Example usage
results = predict_pneumonia("chest_xray_sample.jpg")
print(f"Diagnosis: {results['diagnosis']}")
print(f"Confidence: {results['confidence']}")
print(f"Recommendation: {results['recommendation']}")
Model Architecture Details
- Base Architecture: MobileNetV2 with transfer learning optimization
- Input Specifications: 224Γ224 pixel RGB chest X-ray images
- Output Format: Binary classification probabilities (Normal/Pneumonia)
- Framework: TensorFlow 2.x / Keras
- Model Size: Optimized for clinical deployment scenarios
π Performance Visualizations
External Validation Results
Detailed classification results with percentage breakdown
Internal vs External validation performance comparison
Clinical balance optimization for screening applications
Balanced external validation dataset distribution
π Clinical Applications
Primary Use Cases
- Pneumonia Screening Programs: High-sensitivity detection for population screening
- Clinical Decision Support: Augmenting radiologist workflow with AI insights
- Triage Optimization: Prioritizing cases requiring urgent clinical attention
- Medical Education: Demonstrating AI validation methodologies in healthcare
Implementation Considerations
- Screening Focus: Optimized for high sensitivity to minimize missed diagnoses
- Clinical Oversight: Designed to support, not replace, professional medical judgment
- Quality Assurance: Comprehensive validation ensures reliable performance metrics
β οΈ Usage Guidelines & Limitations
Clinical Limitations
- Diagnostic Support Only: Not intended as a standalone diagnostic tool
- Professional Supervision Required: All results require clinical interpretation
- False Positive Management: 25.2% false positive rate necessitates clinical review
- Population Considerations: Performance may vary across different demographic groups
Technical Considerations
- Dataset Scope: Trained on specific chest X-ray imaging protocols
- Input Requirements: Optimal performance requires standard posteroanterior chest radiographs
- Quality Dependencies: Image quality significantly impacts prediction accuracy
π Dataset & Training Information
Training Dataset
- Primary Source: Kaggle Chest X-ray Dataset (carefully balanced subset)
- Preprocessing Pipeline: Standardized resizing, normalization, and augmentation
- Quality Control: Systematic filtering for optimal training data quality
External Validation Protocol
- Independent Dataset: 485 samples from completely separate data source
- Balanced Composition: 234 normal cases, 251 pneumonia cases
- Validation Rigor: Zero data leakage between training and validation sets
π Repository Contents
File | Description |
---|---|
best_chest_xray_model.h5 |
Production-ready trained Keras model |
comprehensive_external_validation_results.csv |
Detailed performance metrics and analysis |
classification_report.csv |
Complete sklearn classification report |
*.png |
Professional visualization suite (8 comprehensive charts) |
π Citation & Attribution
If this model contributes to your research or clinical work, please cite:
@misc{rathour2025chestxray,
title={Chest X-Ray Pneumonia Detection: Externally Validated Deep Learning System},
author={Rathour, Ayushi},
year={2025},
note={External validation study on 485 independent samples with clinical performance analysis},
url={https://huggingface.co/ayushirathour/chest-xray-pneumonia-detection}
}
π©βπ» Author & Contact
Ayushi Rathour - Biotechnology Graduate | Exploring AI in Healthcare
- π GitHub: @ayushirathour
- πΌ LinkedIn: Ayushi Rathour
- π§ Email: [email protected]
π₯ Advancing Medical AI Through Rigorous Validation
This model exemplifies the critical importance of external validation in medical artificial intelligence, achieving clinical-grade performance through systematic methodology, comprehensive evaluation, and transparent reporting of both capabilities and limitations.
License: MIT | Tags: medical, chest-xray, pneumonia-detection, healthcare, computer-vision, keras
- Downloads last month
- -
Evaluation results
- External Validation Accuracy on External Validation Datasetself-reported0.860
- Sensitivity on External Validation Datasetself-reported0.964
- Specificity on External Validation Datasetself-reported0.748