Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

upvoted a collection 7 days ago

NVIDIA Nemotron v3

upvoted a collection 7 days ago

Nemotron-Post-Training-v3

liked a model 7 days ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8

View all activity

Organizations

upvoted 2 collections 7 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated about 8 hours ago • 236

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated about 8 hours ago • 98

upvoted an article 13 days ago

Article

How NVIDIA Builds Open Data for AI

13 days ago

•

14

upvoted a collection 21 days ago

Olmo Hybrid

6 items • Updated 19 days ago • 20

upvoted a paper about 1 month ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 30

upvoted a collection 3 months ago

Olmo 3.1

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated Dec 23, 2025 • 48

upvoted a paper 4 months ago

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22, 2025 • 16

upvoted a collection 4 months ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 51

upvoted a collection 5 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated 22 days ago • 167

upvoted an article 8 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

Jul 29, 2025

•

219

upvoted a paper 9 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2, 2025 • 58

upvoted a collection 9 months ago

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated about 8 hours ago • 23

upvoted 2 collections 10 months ago

Reward Bench 2

Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated Dec 23, 2025 • 16

Common Pile v0.1

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6, 2025 • 40

upvoted 2 collections 11 months ago

OpenVision

27 items • Updated Aug 15, 2025 • 33

Qwen3

84 items • Updated Dec 31, 2025 • 1.73k

upvoted a paper 11 months ago

Reinforcement Learning from Human Feedback

Paper • 2504.12501 • Published Apr 16, 2025 • 4

upvoted a collection about 1 year ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated Dec 23, 2025 • 16

upvoted an article about 1 year ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

111

upvoted a collection about 1 year ago

2024 Interconnects Artifacts

Models & datasets mentioned in the bottom section of posts! • 280 items • Updated Jan 2, 2025 • 6