daje kang's picture

4 8 37

daje kang

daje

·

daje0601

AI & ML interests

None yet

Organizations

upvoted an article 2 months ago

Article

KV Cache from scratch in nanoVLM

By

and 4 others •

Jun 4

• 89

upvoted a collection 4 months ago

Qwen3

84 items • Updated 7 days ago • 1.08k

upvoted 2 papers 7 months ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 92

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 127

upvoted 3 papers about 1 year ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 98

MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning

Paper • 2406.17770 • Published Jun 25, 2024 • 19

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25, 2024 • 23

upvoted a paper over 1 year ago

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Paper • 2312.12742 • Published Dec 20, 2023 • 14