CapCheck AI Image Detection

Vision Transformer (ViT) fine-tuned for detecting AI-generated images.

Model Lineage & Attribution

This model builds on the work of others:

Layer Model Author License
Base Architecture google/vit-base-patch16-224-in21k Google Apache 2.0
AI Detection Fine-tune dima806/ai_vs_real_image_detection dima806 Apache 2.0
This Model capcheck/ai-image-detection CapCheck Apache 2.0

Special thanks to:

  • Google for the Vision Transformer (ViT) architecture
  • dima806 for fine-tuning on the CIFAKE dataset for AI image detection

Model Description

  • Architecture: ViT-Base (86M parameters)
  • Input Size: 224x224 pixels
  • Training Data: CIFAKE dataset (AI-generated vs real images)
  • Task: Binary classification (Real vs Fake/AI-generated)

Usage

from transformers import pipeline

detector = pipeline("image-classification", model="capcheck/ai-image-detection")
result = detector("path/to/image.jpg")

# Output:
# [{"label": "Fake", "score": 0.95}, {"label": "Real", "score": 0.05}]

Labels

Label Description
Real Authentic photograph or real-world image
Fake AI-generated or synthetically created image

Performance

This model was trained on the CIFAKE dataset. Performance on modern AI generators (Flux, Midjourney v6, DALL-E 3, Stable Diffusion 3) may vary.

See dima806's model card for detailed training metrics.

Limitations

  • Trained primarily on older AI generators (pre-2024)
  • May have reduced accuracy on:
    • Very new AI generators not in training data
    • Heavily compressed images (low JPEG quality)
    • Images smaller than 224x224 pixels
  • Works best on images with clear subjects

Intended Use

  • Content moderation and authenticity verification
  • Research into AI-generated content detection
  • Educational purposes

Not intended for:

  • Making consequential decisions without human review
  • Law enforcement evidence without corroborating sources

Ethical Considerations

  • This tool is not 100% accurate - false positives harm legitimate creators
  • False negatives can allow misinformation to spread
  • Use in conjunction with other verification methods
  • Human review is recommended for high-stakes decisions

Roadmap

Current Version (v1.0.0)

Base model from dima806's CIFAKE-trained ViT. Solid foundation for AI detection.

Planned Improvements

Phase 1: Modern Generator Training

  • Fine-tune on images from Flux, Midjourney v6, DALL-E 3, Stable Diffusion 3
  • Target: Reduce false negatives on 2024+ AI generators

Phase 2: False Positive Reduction

  • Curate dataset of real images commonly flagged as AI
  • Photography edge cases: HDR, heavy editing, digital art
  • Target: <5% false positive rate

Phase 3: Continuous Updates

  • Quarterly re-training as new generators emerge
  • Community feedback integration
  • Benchmark against latest AI generators

Contributing

We welcome:

  • Dataset contributions (properly licensed images)
  • Bug reports and false positive/negative examples
  • Benchmark results on new generators

Join the discussion: https://huggingface.co/capcheck/ai-image-detection/discussions

License

Apache 2.0 (inherited from Google ViT and dima806's fine-tuned model)

Citation

If you use this model, please cite:

@misc{capcheck-ai-detection,
  author = {CapCheck},
  title = {AI Image Detection Model},
  year = {2024},
  publisher = {HuggingFace},
  url = {https://huggingface.co/capcheck/ai-image-detection},
  note = {Based on dima806/ai_vs_real_image_detection, fine-tuned from google/vit-base-patch16-224-in21k}
}

Changelog

v1.0.0 (Initial Release)

  • Published base model from dima806/ai_vs_real_image_detection
  • Added proper attribution and documentation
  • Established as CapCheck's source of truth for AI image detection
Downloads last month
27
Safetensors
Model size
85.8M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for capcheck/ai-image-detection

Finetuned
(1)
this model