3 9 12

Jie Wang

Everloom

AI & ML interests

Foundation Models for Robot Learning

Recent Activity

liked a dataset 27 days ago

IDEA-Research/HumanRef-CoT-45k

new activity 27 days ago

IDEA-Research/HumanRef-CoT-45k:Add task_categories metadata

upvoted an article 28 days ago

Vision Language Models Explained

View all activity

Organizations

None yet

liked a dataset 27 days ago

IDEA-Research/HumanRef-CoT-45k

Viewer • Updated 21 days ago • 2 • 269 • 3

New activity in IDEA-Research/HumanRef-CoT-45k 27 days ago

Add task_categories metadata

#2 opened 2 months ago by

nielsr

upvoted an article 28 days ago

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 432

upvoted a paper about 1 month ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 127

New activity in big-vision/paligemma about 1 month ago

Gemma

#6 opened 3 months ago by

shivamRaiADS

upvoted an article about 1 month ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

and 2 others •

May 14, 2024

• 265

upvoted a collection about 1 month ago

PaliGemma FT Models

Collection

108 items • Updated Jul 10 • 33

upvoted a paper about 1 month ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

upvoted an article 5 months ago

Article

Mixture of Experts Explained

and 5 others •

Dec 11, 2023

• 816

liked a model 5 months ago

lerobot/pi0

Robotics • 4B • Updated Mar 6 • 12.1k • 285

liked 3 models 7 months ago

liked a model 10 months ago

facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24, 2024 • 155k • 91

liked a Space 10 months ago

Grounded SAM

💩

upvoted a collection about 1 year ago

PEFT papers

Collection

A collection of methods that have been implemented in the 🤗 PEFT library • 12 items • Updated Jan 30, 2024 • 28

commented a paper about 1 year ago

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 33 •

upvoted a paper about 1 year ago

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 33

upvoted an article about 1 year ago

Article

💃Introducing the first LLM-based Motion understanding model: MotionLLM

•

Jun 26, 2024

• 3

liked a Space about 1 year ago

Movid Vis

💻

Browse and view videos with questions and answers from MoVid dataset

Jie Wang

AI & ML interests

Recent Activity

Organizations

Everloom's activity

Add task_categories metadata

Vision Language Models Explained

Gemma

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Mixture of Experts Explained

Grounded SAM

💃Introducing the first LLM-based Motion understanding model: MotionLLM

Movid Vis