3 11 22

j2

ej2

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

ej2/Holmes_moe_history

upvoted an article about 1 month ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

upvoted an article about 1 month ago

What makes good reasoning data

View all activity

Organizations

None yet

upvoted 3 articles about 1 month ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

Article

What makes good reasoning data

Oct 30, 2025

•

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

upvoted a collection about 2 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.62k

upvoted a collection 4 months ago

RWKV World v3 Corpus

Collection

RWKV World v3.0 Dataset for training RWKV-7 Goose World v3 models • 64 items • Updated Mar 9, 2025 • 3

upvoted a paper 8 months ago

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 194

upvoted an article 8 months ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

•

127

upvoted a paper 10 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61

upvoted a paper 11 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 160

upvoted an article 11 months ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

Sep 13, 2023

•

upvoted a collection about 1 year ago

"Physics of Language Models" series

Collection

7 items • Updated Dec 22, 2025 • 53

j2

AI & ML interests

Recent Activity

Organizations

ej2's activity

Aligning to What? Rethinking Agent Generalization in MiniMax M2

What makes good reasoning data

Why Did MiniMax M2 End Up as a Full Attention Model?

Introduction to 3D Gaussian Splatting

Fine-tuning Llama 2 70B using PyTorch FSDP