Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

nanoVLM: The simplest repository to train your VLM in pure PyTorch

upvoted an article 28 days ago

Tiny Agents: a MCP-powered agent in 50 lines of code

upvoted a paper about 1 month ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

View all activity

Organizations

None yet

soates's activity

upvoted an article 2 days ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

4 days ago

• 76

upvoted an article 28 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

30 days ago

• 254

upvoted a paper about 1 month ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 125

upvoted an article about 1 month ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 36

upvoted a collection 2 months ago

Gemma 3

Collection

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 50 items • Updated 22 days ago • 63

updated a dataset 3 months ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 14

published a dataset 3 months ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 14

updated a dataset 3 months ago

soates/australian-insurance-pii-dataset

Viewer • Updated Feb 25 • 1.55k • 10

published a dataset 3 months ago

soates/australian-insurance-pii-dataset

Viewer • Updated Feb 25 • 1.55k • 10

liked a Space 3 months ago

2.62k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 articles 4 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 860

upvoted a collection 4 months ago

EvaByte

Collection

3 items • Updated Jan 21 • 3

upvoted an article 4 months ago

Article

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 61

upvoted a paper 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

liked a model 5 months ago

Datou1111/shou_xin

Text-to-Image • Updated Mar 16 • 108 • • 874

upvoted a paper 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

liked a model 8 months ago

lamm-mit/LifeGPT

Updated Sep 19, 2024 • 8

upvoted an article 8 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

and 5 others •

Sep 18, 2024

• 244

liked a Space 9 months ago

119

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

Update leaderboard for fair model evaluation