3 40 67

Chao Zhou

ASHIDAKA

AI & ML interests

Object Detection, Transformer

Recent Activity

liked a model 5 days ago

deepseek-ai/DeepSeek-V3.1

upvoted a paper 19 days ago

SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation

liked a model 20 days ago

openai/gpt-oss-120b

View all activity

Organizations

None yet

liked a model 5 days ago

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated 4 days ago • 21.4k • • 559

upvoted a paper 19 days ago

SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation

Paper • 2505.19151 • Published May 25 • 2

liked a model 20 days ago

openai/gpt-oss-120b

Text Generation • 120B • Updated 12 days ago • 1.65M • • 3.59k

upvoted a paper 23 days ago

RecGPT Technical Report

Paper • 2507.22879 • Published 26 days ago • 36

New activity in Kratos-AI/KAI_handwriting-ocr 23 days ago

Botted likes

👍 2

#1 opened 23 days ago by

Delta-Vector

liked a model 29 days ago

Wan-AI/Wan2.2-I2V-A14B

Image-to-Video • Updated 19 days ago • 11.3k • • 252

liked a model 2 months ago

ibm-ai-platform/llama3-8b-accelerator

3B • Updated May 15, 2024 • 415 • 18

liked a Space 4 months ago

3.12k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 5 months ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Jul 1 • 75

liked a dataset 5 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 7.88k • 562

upvoted an article 6 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted 3 papers 6 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 106

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 200

liked a dataset 7 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 13k • 640

upvoted 3 articles 7 months ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 218

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

How to train a Language Model with Megatron-LM

•

Sep 7, 2022

• 18

liked a model 8 months ago

facebook/multi-token-prediction

Updated Jun 18, 2024 • 371

liked a dataset 8 months ago

allenai/dolma

Updated Apr 17, 2024 • 744 • 933

Chao Zhou

AI & ML interests

Recent Activity

Organizations

ASHIDAKA's activity

Botted likes

The Ultra-Scale Playbook

Open R1: Update #3

Open R1: Update #2

Open-R1: Update #1

How to train a Language Model with Megatron-LM