25 14 229

Trolle Karlsson PRO

trollek

hosteren

AI & ML interests

Stable Diffusion and small language models, but I'm a curious fella.

Recent Activity

new activity 7 days ago

trollek/NinjaMouse-3B-40L-danube:training time

upvoted a paper 11 days ago

Training Large Language Models to Reason in a Continuous Latent Space

upvoted a paper 11 days ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

View all activity

Organizations

None yet

upvoted 2 papers 11 days ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 89

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Paper • 2507.10524 • Published 16 days ago • 63

upvoted 2 papers 3 months ago

BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

Paper • 2504.14538 • Published Apr 20 • 29

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 39

upvoted a paper 6 months ago

SnakModel: Lessons Learned from Training an Open Danish Large Language Model

Paper • 2412.12956 • Published Dec 17, 2024 • 2

upvoted a paper 9 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 50

upvoted a paper 12 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 70

upvoted 2 articles about 1 year ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 633

Article

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

•

Jun 3, 2024

• 27

upvoted a paper over 1 year ago

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 26

upvoted an article over 1 year ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

and 8 others •

Mar 20, 2024

• 29

upvoted 3 papers over 1 year ago

Trolle Karlsson PRO

AI & ML interests

Recent Activity

Organizations

trollek's activity

Uncensor any LLM with abliteration

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

GaLore: Advancing Large Model Training on Consumer-grade Hardware