7 37 55

Eric NG

Eric108

AI & ML interests

NLP

Recent Activity

upvoted a paper 5 days ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

upvoted a paper 5 days ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

upvoted a paper 5 days ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

View all activity

Organizations

None yet

Eric108's activity

upvoted 4 papers 5 days ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 6 days ago • 107

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published 14 days ago • 34

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published 7 days ago • 20

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published 7 days ago • 65

upvoted 7 papers 6 days ago

Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Paper • 2505.00234 • Published 11 days ago • 22

upvoted 2 papers 13 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 23

upvoted a collection 13 days ago

Qwen3

Collection

37 items • Updated 3 days ago • 559

upvoted 2 papers 14 days ago

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published 20 days ago • 54

Process Reward Models That Think

Paper • 2504.16828 • Published 19 days ago • 16

upvoted a paper 20 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 23 days ago • 121

upvoted 3 papers 21 days ago

ReZero: Enhancing LLM search ability by trying one-more-time

Paper • 2504.11001 • Published 27 days ago • 14

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Paper • 2504.10326 • Published 28 days ago • 25

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 28