Mohsen Dowlatshah

upvoted an article 10 days ago

upvoted an article 10 days ago

upvoted an article 10 days ago

mones2222's activity

upvoted 7 articles 10 days ago

and 4 others •

and 5 others •

and 4 others •

and 1 other •

Apr 11, 2024

and 1 other •

upvoted 9 articles 11 days ago

and 3 others •

and 1 other •

and 8 others •

Nov 25, 2024

and 4 others •

Nov 26, 2024

and 2 others •

Dec 31, 2024

and 1 other •

upvoted an article 12 days ago

and 2 others •

upvoted 3 articles 13 days ago

Dec 24, 2024

Nov 25, 2024

and 3 others •

Mohsen Dowlatshah

AI & ML interests

Recent Activity

Organizations

mones2222's activity

KV Cache from scratch in nanoVLM

KV Caching Explained: Optimizing Transformer Inference Efficiency

🐯 Liger GRPO meets TRL

Vision Language Models (Better, Faster, Stronger)

Vision Language Models Explained

My Journey Into Vision Models

17 Reasons Why Gradio Isn't Just Another UI Library

Training and Finetuning Reranker Models with Sentence Transformers v4

Train 400x faster Static Embedding Models with Sentence Transformers

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Zero to Hero with the TRL learning link bomb 💣

SmolVLM - small yet mighty Vision Language Model

Introducing smolagents: simple agents that write actions in code.

FastRTC: The Real-Time Communication Library for Python

Open-R1: a fully open reproduction of DeepSeek-R1

Visualize and understand GPU memory in PyTorch

You could have designed state of the art positional encoding

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality