CapCheck AI Image Detection

Vision Transformer (ViT) fine-tuned for detecting AI-generated images.

Model Lineage & Attribution

This model builds on the work of others:

Layer	Model	Author	License
Base Architecture	google/vit-base-patch16-224-in21k	Google	Apache 2.0
AI Detection Fine-tune	dima806/ai_vs_real_image_detection	dima806	Apache 2.0
This Model	capcheck/ai-image-detection	CapCheck	Apache 2.0

Special thanks to:

Google for the Vision Transformer (ViT) architecture
dima806 for fine-tuning on the CIFAKE dataset for AI image detection

Model Description

Architecture: ViT-Base (86M parameters)
Input Size: 224x224 pixels
Training Data: CIFAKE dataset (AI-generated vs real images)
Task: Binary classification (Real vs Fake/AI-generated)

Usage

from transformers import pipeline

detector = pipeline("image-classification", model="capcheck/ai-image-detection")
result = detector("path/to/image.jpg")

# Output:
# [{"label": "Fake", "score": 0.95}, {"label": "Real", "score": 0.05}]

Labels

Label	Description
`Real`	Authentic photograph or real-world image
`Fake`	AI-generated or synthetically created image

Performance

This model was trained on the CIFAKE dataset. Performance on modern AI generators (Flux, Midjourney v6, DALL-E 3, Stable Diffusion 3) may vary.

See dima806's model card for detailed training metrics.

Limitations

Trained primarily on older AI generators (pre-2024)
May have reduced accuracy on:
- Very new AI generators not in training data
- Heavily compressed images (low JPEG quality)
- Images smaller than 224x224 pixels
Works best on images with clear subjects

Intended Use

Content moderation and authenticity verification
Research into AI-generated content detection
Educational purposes

Not intended for:

Making consequential decisions without human review
Law enforcement evidence without corroborating sources

Ethical Considerations

This tool is not 100% accurate - false positives harm legitimate creators
False negatives can allow misinformation to spread
Use in conjunction with other verification methods
Human review is recommended for high-stakes decisions

Roadmap

Current Version (v1.0.0)

Base model from dima806's CIFAKE-trained ViT. Solid foundation for AI detection.

Planned Improvements

Phase 1: Modern Generator Training

Fine-tune on images from Flux, Midjourney v6, DALL-E 3, Stable Diffusion 3
Target: Reduce false negatives on 2024+ AI generators

Phase 2: False Positive Reduction

Curate dataset of real images commonly flagged as AI
Photography edge cases: HDR, heavy editing, digital art
Target: <5% false positive rate

Phase 3: Continuous Updates

Quarterly re-training as new generators emerge
Community feedback integration
Benchmark against latest AI generators

Contributing

We welcome:

Dataset contributions (properly licensed images)
Bug reports and false positive/negative examples
Benchmark results on new generators

Join the discussion: https://huggingface.co/capcheck/ai-image-detection/discussions

License

Apache 2.0 (inherited from Google ViT and dima806's fine-tuned model)

Citation

If you use this model, please cite:

@misc{capcheck-ai-detection,
  author = {CapCheck},
  title = {AI Image Detection Model},
  year = {2024},
  publisher = {HuggingFace},
  url = {https://huggingface.co/capcheck/ai-image-detection},
  note = {Based on dima806/ai_vs_real_image_detection, fine-tuned from google/vit-base-patch16-224-in21k}
}

Changelog

v1.0.0 (Initial Release)

Published base model from dima806/ai_vs_real_image_detection
Added proper attribution and documentation
Established as CapCheck's source of truth for AI image detection

Downloads last month: 27

Safetensors

Model size

85.8M params

Tensor type

F32

Model tree for capcheck/ai-image-detection

Base model

google/vit-base-patch16-224-in21k

Finetuned

dima806/ai_vs_real_image_detection

Finetuned

(1)

this model