sfairXC (FsfairX)

hendrydong

authored 4 papers 9 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19, 2025 • 23

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8, 2025 • 26

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published May 5, 2025 • 25

bpucla

authored a paper about 1 year ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6, 2025 • 25

hendrydong

authored 3 papers about 1 year ago

hendrydong

updated a model over 1 year ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • 8B • Updated Oct 14, 2024 • 6.39k • 60

hendrydong

authored a paper over 1 year ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

hendrydong

updated 4 models over 1 year ago

sfairXC/llama-3.1-sft-2ep

Text Generation • 8B • Updated Sep 18, 2024 • 2

sfairXC/llama-3.1-sft-1ep

Text Generation • 8B • Updated Sep 18, 2024 • 2

sfairXC/gemma-sft-2ep

Text Generation • 3B • Updated Aug 30, 2024 • 1

sfairXC/gemma-sft-1ep

Text Generation • 3B • Updated Aug 30, 2024

hendrydong

authored a paper over 1 year ago

ThinK: Thinner Key Cache by Query-Driven Pruning

Paper • 2407.21018 • Published Jul 30, 2024 • 32

hendrydong

updated a model over 1 year ago

sfairXC/FsfairX-Gemma2-RM-v0.1

Text Classification • 9B • Updated Jul 9, 2024 • 47 • 7

hendrydong

authored 4 papers over 1 year ago

Reverse Diffusion Monte Carlo

Paper • 2307.02037 • Published Jul 5, 2023 • 1

Spurious Feature Diversification Improves Out-of-distribution Generalization

Paper • 2309.17230 • Published Sep 29, 2023

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

Paper • 2312.11456 • Published Dec 18, 2023 • 1

Local Augmentation for Graph Neural Networks

Paper • 2109.03856 • Published Sep 8, 2021 • 1

AI & ML interests

Team members 3

sfairXC's activity