Zihan Liu's picture

Zihan Liu

zihanliu

·

https://zliucr.github.io/

zliucr

AI & ML interests

None yet

Recent Activity

updated a model 29 days ago

nvidia/AceReason-Nemotron-1.1-7B

new activity about 2 months ago

nvidia/AceReason-1.1-SFT:Add task_categories and library_name to metadata

updated a collection about 2 months ago

View all activity

Organizations

upvoted a paper about 2 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 24

upvoted a collection about 2 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 19 days ago • 14

upvoted a paper about 2 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 33

upvoted a collection 4 months ago

AceMath-RL

Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 19 days ago • 4

upvoted a collection 7 months ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 19 days ago • 14

upvoted a paper 11 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

upvoted a collection over 1 year ago

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 19 days ago • 44