12 29 22

Justin Zhao PRO

justinxzhao

AI & ML interests

None yet

Recent Activity

updated a dataset 6 days ago

justinxzhao/hf_daily_papers

liked a Space 15 days ago

togethercomputer/FutureBench

updated a dataset 2 months ago

justinxzhao/hf_daily_papers

View all activity

Organizations

upvoted 6 papers 4 months ago

upvoted a paper 7 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 298

upvoted a paper 9 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 85

upvoted a paper 10 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

upvoted 3 papers 11 months ago

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 40

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5, 2024 • 21

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 45

upvoted an article 12 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 77

upvoted a paper 12 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 74

upvoted an article about 1 year ago

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 83

upvoted a paper about 1 year ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 124

upvoted 2 articles about 1 year ago

Article

MMLU-Pro-NoMath

•

Jul 11, 2024

• 4

Article

Our Transformers Code Agent beats the GAIA benchmark!

and 1 other •

Jul 1, 2024

• 94

upvoted a paper about 1 year ago

Language Model Council: Benchmarking Foundation Models on Highly Subjective Tasks by Consensus

Paper • 2406.08598 • Published Jun 12, 2024 • 6

upvoted a collection about 1 year ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 12 days ago • 209

Justin Zhao PRO

AI & ML interests

Recent Activity

Organizations

justinxzhao's activity

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

The Rise of Agentic Data Generation

MMLU-Pro-NoMath

Our Transformers Code Agent beats the GAIA benchmark!