prithivMLmods commited on
Commit
80e9a19
·
verified ·
1 Parent(s): 8c2ebad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -28,9 +28,11 @@ tags:
28
 
29
  > The **Inkscope-Captions-2B-0526** model is a fine-tuned version of *Qwen2-VL-2B-Instruct*, optimized for **image captioning**, **vision-language understanding**, and **English-language caption generation**. This model was fine-tuned on the `conceptual-captions-cc12m-llavanext` dataset (first 30k entries) to generate **detailed, high-quality captions** for images, including complex or abstract scenes.
30
 
31
- > [!warning]
32
  Colab Demo : https://huggingface.co/prithivMLmods/Inkscope-Captions-2B-0526/blob/main/Inkscope%20Captions%202B%200526%20Demo/Inkscope-Captions-2B-0526.ipynb
33
 
 
 
34
  ---
35
 
36
  #### Key Enhancements:
@@ -115,6 +117,12 @@ for new_text in streamer:
115
 
116
  ![Screenshot 2025-05-27 at 03-59-36 Gradio.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/ykPB8Yxk0Z_1WDmSoCjKD.png)
117
  ![Screenshot 2025-05-27 at 03-59-53 (anonymous) - output_8dc4ad31-403a-4f59-a483-be2aec11b756.pdf.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/tBPdM1iyRf8Fi12urNUbt.png)
 
 
 
 
 
 
118
 
119
  ---
120
 
 
28
 
29
  > The **Inkscope-Captions-2B-0526** model is a fine-tuned version of *Qwen2-VL-2B-Instruct*, optimized for **image captioning**, **vision-language understanding**, and **English-language caption generation**. This model was fine-tuned on the `conceptual-captions-cc12m-llavanext` dataset (first 30k entries) to generate **detailed, high-quality captions** for images, including complex or abstract scenes.
30
 
31
+ > [!note]
32
  Colab Demo : https://huggingface.co/prithivMLmods/Inkscope-Captions-2B-0526/blob/main/Inkscope%20Captions%202B%200526%20Demo/Inkscope-Captions-2B-0526.ipynb
33
 
34
+ > [!note]
35
+ Video Understanding Demo : https://huggingface.co/prithivMLmods/Inkscope-Captions-2B-0526/blob/main/Inkscope-Captions-2B-0526-Video-Understanding/Inkscope-Captions-2B-0526-Video-Understanding.ipynb
36
  ---
37
 
38
  #### Key Enhancements:
 
117
 
118
  ![Screenshot 2025-05-27 at 03-59-36 Gradio.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/ykPB8Yxk0Z_1WDmSoCjKD.png)
119
  ![Screenshot 2025-05-27 at 03-59-53 (anonymous) - output_8dc4ad31-403a-4f59-a483-be2aec11b756.pdf.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/tBPdM1iyRf8Fi12urNUbt.png)
120
+
121
+ ---
122
+
123
+ ### **Video Inference**
124
+
125
+ ![Screenshot 2025-05-27 at 20-35-30 Video Understanding with Inkscope-Captions-2B-0526.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/LrHNNYV1elysHjAmzOXw3.png)
126
 
127
  ---
128