ShareCaptioner Model Card

Model details

Model type: ShareCaptioner is an open-source captioner fine-tuned on GPT4-Vision-assisted ShareGPT4V detailed caption data with a resolution of 448x448. ShareCaptioner is based on the improved InternLM-Xcomposer-7B base model.

Model date: ShareCaptioner was trained in Nov 2023.

Paper or resources for more information: [Project] [Paper] [Code]

License

Intended use

Primary intended uses: The primary use of ShareCaptioner is about producing high-quality image captions.

Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.

Finetuning dataset

100K GPT4-Vision-generated image-text pairs

Downloads last month: 75

Spaces using Lin-Chen/ShareCaptioner 2

Collection including Lin-Chen/ShareCaptioner

ShareGPT4V

Collection

7 items • Updated May 26, 2024 • 2

Paper for Lin-Chen/ShareCaptioner

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Paper • 2311.12793 • Published Nov 21, 2023 • 18