EfficientNet-B0 Document Image Classifier

This is an image classification model based on Google EfficientNet-B0, fine-tuned to classify input images into one of the following 16 categories:

  1. bar_chart
  2. bar_code
  3. chemistry_markush_structure
  4. chemistry_molecular_structure
  5. flow_chart
  6. icon
  7. line_chart
  8. logo
  9. map
  10. other
  11. pie_chart
  12. qr_code
  13. remote_sensing
  14. screenshot
  15. signature
  16. stamp

Citation

If you use this model in your work, please cite the following papers:

@article{Tan2019EfficientNetRM,
  title={EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks},
  author={Mingxing Tan and Quoc V. Le},
  journal={ArXiv},
  year={2019},
  volume={abs/1905.11946}
}

@techreport{Docling,
  author = {Deep Search Team},
  month = {8},
  title = {{Docling Technical Report}},
  url={https://arxiv.org/abs/2408.09869},
  eprint={2408.09869},
  doi = "10.48550/arXiv.2408.09869",
  version = {1.0.0},
  year = {2024}
}
Downloads last month
70
Safetensors
Model size
4.07M params
Tensor type
F32
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for ds4sd/DocumentFigureClassifier

Finetuned
(14)
this model

Collection including ds4sd/DocumentFigureClassifier