Sumit Yadav's picture

2 7 38

Sumit Yadav

rockerritesh

·

https://sumityadav.com.np

AI & ML interests

AI(GAN) || LLM RAG

Recent Activity

liked a model 7 days ago

Qwen/Qwen2.5-Omni-3B

liked a dataset 16 days ago

rockerritesh/maithiliNewsData

updated a model 16 days ago

rockerritesh/maiBERT_TF

View all activity

Organizations

rockerritesh's activity

upvoted a collection 2 months ago

Cogito v1 Preview

5 items • Updated Apr 8 • 111

upvoted a paper 2 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 188

upvoted a collection 2 months ago

Vision Language Models Quantization

Vision Language Models (VLMs) quantized by Neural Magic • 20 items • Updated Mar 4 • 6

upvoted 2 collections 3 months ago

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 1 day ago • 31

MoshiVis v0.1

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated Mar 21 • 22

upvoted an article 3 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 427

upvoted an article 4 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

By

and 6 others •

Feb 20

• 262