Zhuoran Yang's picture

4 1

Zhuoran Yang PRO

zhuoranyang

·

AI & ML interests

reinforcement learning, game theory, AGI

Recent Activity

upvoted a paper 3 days ago

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

upvoted a paper about 2 months ago

Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

authored a paper 5 months ago

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

View all activity

Organizations

upvoted a paper 3 days ago

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published 7 days ago • 26

upvoted a paper about 2 months ago

Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

Paper • 2506.14002 • Published Jun 16 • 6

authored a paper 5 months ago

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Paper • 2502.16707 • Published Feb 23 • 13

updated 2 collections 6 months ago

LLM Agents (Prompting)

2 items • Updated Feb 16

LLM-Reasoning (training)

LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16

liked a dataset 6 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 16.8k • 629

updated a collection 6 months ago

LLM-Reasoning (training)

LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16

upvoted a paper 6 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

authored a paper 8 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

updated a collection over 1 year ago

in-context learning & chain of thought

1 item • Updated Jan 21, 2024

updated a collection almost 2 years ago

Control

1 item • Updated Oct 26, 2023

authored a paper almost 2 years ago

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Paper • 2207.14800 • Published Jul 29, 2022