Harpreet Sahota's picture

Harpreet Sahota PRO

harpreetsahota

·

AI & ML interests

Deep learning, laguage models, prompt engineering, agents, multi-agent systems

Recent Activity

liked a dataset about 3 hours ago

harpreetsahota/STONE

updated a dataset about 4 hours ago

Voxel51/Syn4D_RGBD

updated a dataset about 5 hours ago

harpreetsahota/STONE

View all activity

Organizations

upvoted 2 collections 9 days ago

MolmoAct Data Mixture

All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated Dec 23, 2025 • 20

MolmoAct

All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated 25 days ago • 37

upvoted a collection 17 days ago

Sapiens2

28 items • Updated 14 days ago • 39

upvoted a collection 24 days ago

Gemma 4

12 items • Updated 24 days ago • 857

upvoted a collection 2 months ago

3DV 2026

Collection of all the 3DV models, datasets and demos • 27 items • Updated Mar 25 • 4

upvoted a paper 2 months ago

PALM: A Dataset and Baseline for Learning Multi-subject Hand Prior

Paper • 2511.05403 • Published Nov 7, 2025 • 1

upvoted a collection 4 months ago

Physical AI

Collection of open, commercial-grade datasets for physical AI developers • 50 items • Updated about 2 hours ago • 157

upvoted 2 collections 5 months ago

OpenX-LeRobot

Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot) • 32 items • Updated Mar 2 • 37

Molmo2

Artifacts for the Molmo2 release • 5 items • Updated Mar 2 • 36

upvoted 2 papers 8 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 135

CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20, 2025 • 22

upvoted 2 collections 8 months ago

ModernVBERT

Resources for ModernVBERT • 5 items • Updated Oct 3, 2025 • 10

Qwen3-VL

37 items • Updated Dec 31, 2025 • 730

upvoted an article 8 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 111

upvoted a collection 8 months ago

Granite Docling

Models for parsing complex PDFs and structured documents, designed to complement Docling. • 4 items • Updated about 1 month ago • 64

upvoted an article 9 months ago

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

baidu

•

Sep 10, 2025

• 111

upvoted a collection 9 months ago

PP-OCRv5

PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated 17 days ago • 57

upvoted 3 collections 10 months ago

UI-Venus

12 items • Updated Mar 25 • 40

Releases July 25

28 items • Updated Jul 30, 2025 • 3

Releases July 18

34 items • Updated Jul 23, 2025 • 4