Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ernie-research

community
ernie-research
Activity Feed

AI & ML interests

Large Language Models

lilei's profile picture CYK's profile picture Shuohuan Wang's profile picture Haoran Sun's profile picture dingyuchen's profile picture

Moyu-hrsun 
authored a paper 3 months ago

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 8
cyk1337 
updated a model 3 months ago

ernie-research/Themis-7b

Updated Jun 6 • 2 • 4
cyk1337 
updated a dataset 3 months ago

ernie-research/TARA

Preview • Updated Jun 6 • 25 • 1
cyk1337 
updated a collection 3 months ago

Macro-Action RLHF

Collection
[ICLR'25] [MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions](https://openreview.net/forum?id=WWXjMYZxfH) • 7 items • Updated Jun 5
msdyc 
updated 4 collections 3 months ago

Tool-Augmented Reward Models

Collection
[ICLR'24 Spotlight] Tool-Augmented Reward Modeling • 3 items • Updated May 21

Multilingual Code Pre-training (ERNIE-Code)

Collection
[ACL'23 Findings] ERNIE-Code, the First multilingual text and multlingual code pre-training. • 2 items • Updated May 21

Pixel-based Pre-training (PixelGPT)

Collection
[EMNLP'24] [Autoregressive Pre-Training on Pixels and Texts](https://arxiv.org/pdf/2404.10710). • 6 items • Updated May 21

Macro-Action RLHF

Collection
[ICLR'25] [MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions](https://openreview.net/forum?id=WWXjMYZxfH) • 7 items • Updated Jun 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs