1 6 9

CYK

cyk1337

AI & ML interests

Large language models

Recent Activity

updated a model 1 day ago

ernie-research/Themis-7b

updated a dataset 1 day ago

ernie-research/TARA

updated a collection 3 days ago

Macro-Action RLHF

View all activity

Organizations

cyk1337's activity

updated a model 1 day ago

ernie-research/Themis-7b

Updated 1 day ago • 27 • 4

updated a dataset 1 day ago

ernie-research/TARA

Preview • Updated 1 day ago • 72 • 1

updated a collection 3 days ago

Macro-Action RLHF

Collection

[ICLR'25] [MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions](https://openreview.net/forum?id=WWXjMYZxfH) • 7 items • Updated 3 days ago

updated a dataset 3 months ago

floatai/HumanEval-XL

Updated Mar 7 • 95 • 12

authored 2 papers 8 months ago

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 8

On Training Data Influence of GPT Models

Paper • 2404.07840 • Published Apr 11, 2024

upvoted a paper 8 months ago

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 8

commented a paper 8 months ago

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 8 •

updated a dataset 8 months ago

ernie-research/GPTDynamics

Preview • Updated Oct 4, 2024 • 57 • 1

updated a model 8 months ago

ernie-research/ernie-code-560m

Text2Text Generation • Updated Oct 4, 2024 • 55 • 10

updated 2 datasets 8 months ago

ernie-research/rendered_xnli

Updated Oct 4, 2024 • 45 • 1

ernie-research/rendered_GLUE

Updated Oct 4, 2024 • 25 • 1

updated 3 models 8 months ago

liked a model 10 months ago

ernie-research/MonoGPT

Text Generation • Updated Oct 4, 2024 • 11 • 2

liked 3 models 11 months ago

meta-llama/Llama-3.1-405B

Text Generation • Updated Sep 25, 2024 • 9.98k • 934

ernie-research/DualGPT

Text Generation • Updated Oct 4, 2024 • 9 • 2

ernie-research/PixelGPT

Text Generation • Updated Oct 4, 2024 • 15 • 4

upvoted a paper 12 months ago

Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17, 2024 • 16