Model Summary

Cephalo is a series of multimodal materials science focused vision large language models (V-LLMs) designed to integrate visual and linguistic data for advanced understanding and interaction in human-AI or multi-agent AI frameworks.

Model Capabilities

This version of Cephalo, lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-beta, is trained to convert images of equations to LaTeX code. This version is trained on a larger dataset and for more epochs than lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha.

Downloads last month: 52

Inference Providers NEW

Image-to-Text

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Datasets used to train lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-beta

Collection including lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-beta

Cephalo

Collection

Cephalo is a series of multimodal vision large language models (V-LLMs) designed to integrate visual and linguistic reasoning in materials science. • 15 items • Updated Jan 22 • 4