Jaykumaran R's picture

Jaykumaran R

Jaykumaran17

·

Jaykumaran

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

upvoted a collection 2 days ago

liked a model 3 days ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

View all activity

Organizations

Jaykumaran17's activity

upvoted a paper 1 day ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published 5 days ago • 75

upvoted a collection 2 days ago

SmolVLA

Small, efficient and light-weight VLAs pretrained on community datasets • 1 item • Updated 7 days ago • 19

upvoted a paper 3 days ago

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Paper • 2401.02117 • Published Jan 4, 2024 • 33

upvoted an article 4 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

5 days ago

• 96

upvoted an article 9 days ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

27 days ago

• 420

upvoted a collection 10 days ago

NVILA

10 items • Updated 19 days ago • 14

upvoted an article 27 days ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 375

upvoted an article about 2 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 41

upvoted a collection about 2 months ago

Multimodal DSE Retrievers

A collection of DSE models for multimodal retrieval • 5 items • Updated Apr 15 • 14

upvoted 3 articles 2 months ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

By

and 4 others •

Mar 18

• 41

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

By

and 3 others •

Feb 4

• 158

Article

SmolVLM - small yet mighty Vision Language Model

By

and 4 others •

Nov 26, 2024

• 308

upvoted an article 3 months ago

Article

MCP is All You Need: The Future of AI Interoperability

By

•

Mar 18

• 8

upvoted a paper 3 months ago

VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published Mar 14 • 21

upvoted an article 3 months ago

Article

Open-Source Handwritten Signature Detection Model

By

•

Mar 14

• 113

upvoted a paper 3 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 72

upvoted a collection 6 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 305

upvoted a collection 9 months ago

ViDoRe Benchmark

Benchmark for document retrieval using visual features, introduced in the ColPali paper. Datasets are using the QA format. • 10 items • Updated Jan 23 • 18

upvoted an article 10 months ago

Article

Vision Language Models Explained

By

and 1 other •

Apr 11, 2024

• 373

upvoted an article 11 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

By

and 3 others •

May 1, 2024

• 77