-
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Paper • 2501.12273 • Published • 14 -
CritiQ: Mining Data Quality Criteria from Human Preferences
Paper • 2502.19279 • Published • 9 -
Instruction Pre-Training: Language Models are Supervised Multitask Learners
Paper • 2406.14491 • Published • 94
Eric NG
Eric108
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
3 days ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
upvoted
a
paper
3 days ago
Agentic Reasoning and Tool Integration for LLMs via Reinforcement
Learning
Organizations
None yet
Collections
2
-
Large Language Models Can Self-Improve in Long-context Reasoning
Paper • 2411.08147 • Published • 67 -
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Paper • 2411.11504 • Published • 23 -
Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework
Paper • 2410.06328 • Published • 2 -
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability
Paper • 2411.19943 • Published • 64
models
0
None public yet
datasets
0
None public yet