Chest X-Ray Pneumonia Detection Model

A robust deep learning system for automated pneumonia detection in chest radiographs, featuring comprehensive external validation and clinical-grade performance metrics.

🎯 Model Overview

This model implements a binary classification system designed to identify pneumonia in chest X-ray images. Built on MobileNetV2 architecture with transfer learning, the system has undergone rigorous external validation on 485 independent samples, demonstrating strong clinical applicability and generalization capabilities.

Key Performance Highlights

  • External Validation Accuracy: 86.0% on 485 independent samples
  • Clinical Sensitivity: 96.4% - optimal for screening applications
  • Robust Generalization: Validated on completely unseen data from independent sources
  • Production Ready: Comprehensive evaluation with detailed performance analysis

πŸ“Š Performance Metrics

Validation Results Comparison

Performance Metric Internal Validation External Validation Clinical Assessment
Accuracy 94.8% 86.0% Excellent generalization (8.8% variance)
Sensitivity (Recall) 89.6% 96.4% Outstanding screening capability
Specificity 100.0% 74.8% Acceptable false positive management
Precision (PPV) 100.0% 80.4% Strong positive predictive value
F1-Score 94.5% 87.7% Well-balanced performance profile

External Validation Dataset

  • Sample Size: 485 radiographs (234 normal, 251 pneumonia cases)
  • Data Source: Independent pneumonia radiography dataset
  • Validation Method: Complete external testing on previously unseen data
  • Statistical Significance: Large sample size ensures reliable performance estimates

πŸ”¬ Clinical Significance

Screening Applications

The model's 96.4% sensitivity makes it particularly suitable for pneumonia screening workflows, where missing positive cases carries high clinical risk. The balanced performance profile supports its use as a clinical decision support tool.

Generalization Capability

With only an 8.8% accuracy decrease from internal to external validation, the model demonstrates robust learning patterns that generalize well across different data sources and imaging protocols.

πŸš€ Implementation Guide

Quick Start Example

import tensorflow as tf
from tensorflow.keras.preprocessing import image
import numpy as np

# Load the pre-trained model from Hugging Face Hub
from huggingface_hub import hf_hub_download

# Download model from Hugging Face Hub
model_path = hf_hub_download(
    repo_id="ayushirathour/chest-xray-pneumonia-detection",
    filename="best_chest_xray_model.h5"
)
model = tf.keras.models.load_model(model_path)

def predict_pneumonia(img_path):
    """
    Predict pneumonia from chest X-ray image
    
    Args:
        img_path (str): Path to chest X-ray image
    
    Returns:
        dict: Prediction results with confidence scores
    """
    # Load and preprocess image
    img = image.load_img(img_path, target_size=(224, 224))
    img_array = image.img_to_array(img) / 255.0
    img_array = np.expand_dims(img_array, axis=0)
    
    # Generate prediction
    prediction = model.predict(img_array)[0][0]
    
    # Interpret results
    if prediction > 0.5:
        result = {
            'diagnosis': 'PNEUMONIA',
            'confidence': f"{prediction:.1%}",
            'recommendation': 'Clinical review recommended'
        }
    else:
        result = {
            'diagnosis': 'NORMAL',
            'confidence': f"{1-prediction:.1%}",
            'recommendation': 'No pneumonia indicators detected'
        }
    
    return result

# Example usage
results = predict_pneumonia("chest_xray_sample.jpg")
print(f"Diagnosis: {results['diagnosis']}")
print(f"Confidence: {results['confidence']}")
print(f"Recommendation: {results['recommendation']}")

Model Architecture Details

  • Base Architecture: MobileNetV2 with transfer learning optimization
  • Input Specifications: 224Γ—224 pixel RGB chest X-ray images
  • Output Format: Binary classification probabilities (Normal/Pneumonia)
  • Framework: TensorFlow 2.x / Keras
  • Model Size: Optimized for clinical deployment scenarios

πŸ“ˆ Performance Visualizations

External Validation Results

Enhanced Confusion Matrix Detailed classification results with percentage breakdown

Performance Comparison Internal vs External validation performance comparison

Precision-Recall Analysis Clinical balance optimization for screening applications

Class Distribution Balanced external validation dataset distribution

πŸ“‹ Clinical Applications

Primary Use Cases

  1. Pneumonia Screening Programs: High-sensitivity detection for population screening
  2. Clinical Decision Support: Augmenting radiologist workflow with AI insights
  3. Triage Optimization: Prioritizing cases requiring urgent clinical attention
  4. Medical Education: Demonstrating AI validation methodologies in healthcare

Implementation Considerations

  • Screening Focus: Optimized for high sensitivity to minimize missed diagnoses
  • Clinical Oversight: Designed to support, not replace, professional medical judgment
  • Quality Assurance: Comprehensive validation ensures reliable performance metrics

⚠️ Usage Guidelines & Limitations

Clinical Limitations

  • Diagnostic Support Only: Not intended as a standalone diagnostic tool
  • Professional Supervision Required: All results require clinical interpretation
  • False Positive Management: 25.2% false positive rate necessitates clinical review
  • Population Considerations: Performance may vary across different demographic groups

Technical Considerations

  • Dataset Scope: Trained on specific chest X-ray imaging protocols
  • Input Requirements: Optimal performance requires standard posteroanterior chest radiographs
  • Quality Dependencies: Image quality significantly impacts prediction accuracy

πŸ“Š Dataset & Training Information

Training Dataset

  • Primary Source: Kaggle Chest X-ray Dataset (carefully balanced subset)
  • Preprocessing Pipeline: Standardized resizing, normalization, and augmentation
  • Quality Control: Systematic filtering for optimal training data quality

External Validation Protocol

  • Independent Dataset: 485 samples from completely separate data source
  • Balanced Composition: 234 normal cases, 251 pneumonia cases
  • Validation Rigor: Zero data leakage between training and validation sets

πŸ“ Repository Contents

File Description
best_chest_xray_model.h5 Production-ready trained Keras model
comprehensive_external_validation_results.csv Detailed performance metrics and analysis
classification_report.csv Complete sklearn classification report
*.png Professional visualization suite (8 comprehensive charts)

πŸ“š Citation & Attribution

If this model contributes to your research or clinical work, please cite:

@misc{rathour2025chestxray,
  title={Chest X-Ray Pneumonia Detection: Externally Validated Deep Learning System},
  author={Rathour, Ayushi},
  year={2025},
  note={External validation study on 485 independent samples with clinical performance analysis},
  url={https://huggingface.co/ayushirathour/chest-xray-pneumonia-detection}
}

πŸ‘©β€πŸ’» Author & Contact

Ayushi Rathour - Biotechnology Graduate | Exploring AI in Healthcare


πŸ₯ Advancing Medical AI Through Rigorous Validation

This model exemplifies the critical importance of external validation in medical artificial intelligence, achieving clinical-grade performance through systematic methodology, comprehensive evaluation, and transparent reporting of both capabilities and limitations.


License: MIT | Tags: medical, chest-xray, pneumonia-detection, healthcare, computer-vision, keras

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Evaluation results