1 5

Changhao

lichangh20

https://lichangh20.github.io/

lichangh20

AI & ML interests

RL, Agent, Efficient ML

Recent Activity

upvoted an article about 1 month ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

upvoted a paper about 2 months ago

Matryoshka: Learning to Drive Black-Box LLMs with LLMs

upvoted a paper 3 months ago

MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline

View all activity

Organizations

upvoted an article about 1 month ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11

•

upvoted a paper about 2 months ago

Matryoshka: Learning to Drive Black-Box LLMs with LLMs

Paper • 2410.20749 • Published Oct 28, 2024 • 1

upvoted a paper 3 months ago

MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline

Paper • 2510.07307 • Published Oct 8 • 5

upvoted a paper 5 months ago

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Paper • 2507.16782 • Published Jul 22 • 9

commented a paper 5 months ago

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Paper • 2507.16782 • Published Jul 22 • 9 •

authored 4 papers 5 months ago

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22

Matryoshka: Learning to Drive Black-Box LLMs with LLMs

Paper • 2410.20749 • Published Oct 28, 2024 • 1

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12 • 19

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Paper • 2507.16782 • Published Jul 22 • 9

upvoted a paper 7 months ago

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12 • 19

updated a dataset 8 months ago

lichangh20/s1K_initial_filtered_for_llama8b

Viewer • Updated May 2 • 1k • 6

published a dataset 8 months ago

lichangh20/s1K_initial_filtered_for_llama8b

Viewer • Updated May 2 • 1k • 6

updated a dataset 8 months ago

lichangh20/olympiadbench

Viewer • Updated Apr 22 • 674 • 14

published a dataset 8 months ago

lichangh20/olympiadbench

Viewer • Updated Apr 22 • 674 • 14

updated a dataset 8 months ago

lichangh20/minervamath

Viewer • Updated Apr 22 • 272 • 8

published a dataset 8 months ago

lichangh20/minervamath

Viewer • Updated Apr 22 • 272 • 8

updated a model 9 months ago

lichangh20/s1k_format_filtered_bf16

Feature Extraction • 7B • Updated Mar 25 • 6

published a model 9 months ago

lichangh20/s1k_format_filtered_bf16

Feature Extraction • 7B • Updated Mar 25 • 6

updated a model 9 months ago

lichangh20/s1k_format_filtered

Feature Extraction • 7B • Updated Mar 25 • 8

published a model 9 months ago

lichangh20/s1k_format_filtered

Feature Extraction • 7B • Updated Mar 25 • 8

Changhao

AI & ML interests

Recent Activity

Organizations

lichangh20's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment