Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Matthew Hollings's picture

2 2 57

Matthew Hollings

matthh

·

https://applyingai.dev/

mattholl

AI & ML interests

Generative AI, computational creativity, reinforcement learning

Organizations

None yet

matthh 's collections 2

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 62
PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 60

Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

Paper • 2311.08263 • Published Nov 14, 2023 • 16

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 62
PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 60

Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

Paper • 2311.08263 • Published Nov 14, 2023 • 16

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs