8 211 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

upvoted an article about 2 months ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted a collection about 2 months ago

Encoders vs Decoders: the Ettin Suite

View all activity

Organizations

None yet

Collections 12

View 12 collections

models 10

datasets 2

bfuzzy1/gunny_v2_solo_dolo

Viewer • Updated Oct 10, 2024 • 2.9k • 20 • 1

bfuzzy1/gunny_x

Viewer • Updated Oct 1, 2024 • 10k • 12 • 3

Robin Williams PRO

AI & ML interests

Recent Activity

Organizations

Collections 12

bfuzzy1/acheron-m

bfuzzy1/acheron-m1a-llama

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Deliberation in Latent Space via Differentiable Cache Augmentation

Outcome-Refining Process Supervision for Code Generation

bfuzzy1/acheron-m

bfuzzy1/acheron-m1a-llama

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Deliberation in Latent Space via Differentiable Cache Augmentation

Outcome-Refining Process Supervision for Code Generation

models 10

bfuzzy1/acheron-m1a-llama

bfuzzy1/acheron-m

bfuzzy1/acheron-d

bfuzzy1/llambses-1

bfuzzy1/acheron-o9

bfuzzy1/acheron

bfuzzy1/acheron-c

bfuzzy1/Gunny

bfuzzy1/llambses-1_4bit

bfuzzy1/acheron-x

datasets 2

bfuzzy1/gunny_v2_solo_dolo

bfuzzy1/gunny_x

Robin Williams PRO

AI & ML interests

Recent Activity

Organizations

Collections 12

models 10 Sort: Recently updated

datasets 2 Sort: Recently updated

models 10

datasets 2