Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ A novel aspect of Cephalo's development is the innovative dataset generation met
|
|
10 |
|
11 |
Cephalo can interpret complex visual scenes and generating contextually accurate language descriptions and answer queries.
|
12 |
|
13 |
-
The
|
14 |
|
15 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/kl5GWBP9WS0D4uwd1t3S7.png)
|
16 |
|
@@ -26,6 +26,8 @@ Cephalo provides a robust framework for multimodal interaction and understanding
|
|
26 |
- Trained on Idefics-2 distilled image-text data from Wikipedia and scientific papers. Gives shorter answers, to the point, and generaly accurate.
|
27 |
- [Cephalo-Idefics-2-vision-8b-beta](https://huggingface.co/lamm-mit/Cephalo-Idefics-2-vision-8b-beta)
|
28 |
- Trained on GPT-4o distilled image-text data from Wikipedia and scientific papers. Gives longer answers, with enhanced reasoning. Can struggle with complex concepts.
|
|
|
|
|
29 |
|
30 |
## Citation
|
31 |
|
|
|
10 |
|
11 |
Cephalo can interpret complex visual scenes and generating contextually accurate language descriptions and answer queries.
|
12 |
|
13 |
+
The models are developed to process diverse inputs, including images and text, facilitating a broad range of applications such as image captioning, visual question answering, and multimodal content generation. The architecture combines a vision encoder model and an autoregressive transformer to process complex natural language understanding.
|
14 |
|
15 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/kl5GWBP9WS0D4uwd1t3S7.png)
|
16 |
|
|
|
26 |
- Trained on Idefics-2 distilled image-text data from Wikipedia and scientific papers. Gives shorter answers, to the point, and generaly accurate.
|
27 |
- [Cephalo-Idefics-2-vision-8b-beta](https://huggingface.co/lamm-mit/Cephalo-Idefics-2-vision-8b-beta)
|
28 |
- Trained on GPT-4o distilled image-text data from Wikipedia and scientific papers. Gives longer answers, with enhanced reasoning. Can struggle with complex concepts.
|
29 |
+
- [Cephalo-Llava-v1.6-Mistral-8b-alpha](https://huggingface.co/lamm-mit/Cephalo-Llava-v1.6-Mistral-8b-alpha)
|
30 |
+
- Trained on GPT-4o distilled image-text data from Wikipedia and scientific papers.
|
31 |
|
32 |
## Citation
|
33 |
|