YannisTevissen (Yannis Tevissen)

upvoted an article 3 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

Jun 3, 2025

•

321

upvoted an article 4 months ago

Article

On the Shifting Global Compute Landscape

Oct 29, 2025

•

61

upvoted a paper 11 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

upvoted an article 11 months ago

Article

Are AI Agents Sustainable? It depends

Apr 7, 2025

•

22

upvoted an article 12 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

+1

Jul 18, 2024

•

62

upvoted a collection about 1 year ago

AI for Disability

Collection

A collection of datasets, models, spaces and papers that uses AI to address a disability-related topic. • 4 items • Updated Jun 10, 2025 • 3

upvoted a collection over 1 year ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 302

upvoted 4 papers over 1 year ago

Multimodal Chaptering for Long-Form TV Newscast Video

Paper • 2406.17590 • Published Mar 20, 2024 • 2

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 133

Goldfish: Vision-Language Understanding of Arbitrarily Long Videos

Paper • 2407.12679 • Published Jul 17, 2024 • 8

Towards Retrieval Augmented Generation over Large Video Libraries

Paper • 2406.14938 • Published Jun 21, 2024 • 22

upvoted a paper almost 2 years ago

Inserting Faces inside Captions: Image Captioning with Attention Guided Merging

Paper • 2405.02305 • Published Mar 20, 2024 • 2

Yannis Tevissen

AI & ML interests

Organizations

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

On the Shifting Global Compute Landscape

SmolVLM: Redefining small and efficient multimodal models

Are AI Agents Sustainable? It depends

TGI Multi-LoRA: Deploy Once, Serve 30 Models

AI for Disability

SmolLM2

Multimodal Chaptering for Long-Form TV Newscast Video

Building and better understanding vision-language models: insights and future directions

Goldfish: Vision-Language Understanding of Arbitrarily Long Videos

Towards Retrieval Augmented Generation over Large Video Libraries

Inserting Faces inside Captions: Image Captioning with Attention Guided Merging

Yannis Tevissen

AI & ML interests

Organizations

YannisTevissen's activity

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

On the Shifting Global Compute Landscape

Are AI Agents Sustainable? It depends

TGI Multi-LoRA: Deploy Once, Serve 30 Models