Ai2 Release Collaborations

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

natolambert authored a paper 5 days ago

Reinforcement Learning from Human Feedback

natolambert authored a paper 4 months ago

Objective Mismatch in Model-based Reinforcement Learning

natolambert authored a paper 4 months ago

Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

View all activity

ai2-release-colabs's activity

natolambert

authored a paper 5 days ago

Reinforcement Learning from Human Feedback

Paper • 2504.12501 • Published 12 days ago • 2

natolambert

authored 9 papers 4 months ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 3

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

Paper • 2405.15802 • Published May 17, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 114

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 19

natolambert

authored a paper 5 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 63

natolambert

authored a paper 6 months ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 12

natolambert

authored 2 papers 8 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 80

Self-Directed Synthetic Dialogues and Revisions Technical Report

Paper • 2407.18421 • Published Jul 25, 2024

daweih

authored a paper 12 months ago

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Paper • 2405.07518 • Published May 13, 2024 • 28

mingranw

authored a paper 12 months ago

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Paper • 2405.07518 • Published May 13, 2024 • 28

natolambert

authored 4 papers about 1 year ago

RewardBench: Evaluating Reward Models for Language Modeling

Paper • 2403.13787 • Published Mar 20, 2024 • 23

The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback

Paper • 2311.00168 • Published Oct 31, 2023

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 64

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 84

AI & ML interests

Recent Activity

Team members 5

ai2-release-colabs's activity