Collections

Discover the best community collections!

Collections including paper arxiv:2404.03413
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
video llm
video llm works
Foundation Models
Collection by 16 days ago
Vision Language Models
Collection by 15 days ago
Image Generation
Collection by 11 days ago
Multimodal Papers
Collection by Apr 22
Vision Language Models Papers 🖼️💬📝
Papers about vision-language models, most important ones are on top of the list.