4 18 8

Roxanna

borntobeignored

AI & ML interests

None yet

Recent Activity

upvoted an article 29 days ago

20x Faster TRL Fine-tuning with RapidFire AI

upvoted an article 29 days ago

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

upvoted an article 29 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

upvoted 4 articles 29 days ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

Nov 21, 2025

•

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Apr 29, 2025

•

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

887

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Jan 29, 2025

•

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 108

published a Space about 1 month ago

RLagents

🚀

yoyoyo

upvoted 3 articles 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

176

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

liked a Space 4 months ago

AgentSeer

🔍

upvoted a paper 4 months ago

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27, 2025 • 33

liked a dataset 4 months ago

ai-safety-institute/AgentHarm

Viewer • Updated Dec 19, 2024 • 468 • 6.17k • 45

commented a paper 5 months ago

Memp: Exploring Agent Procedural Memory

Paper • 2508.06433 • Published Aug 8, 2025 • 35 •

upvoted a paper 5 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 121

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

741

commented a paper 5 months ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113 •

liked a dataset 5 months ago

samuelyeh/HalluEntity

Viewer • Updated Apr 24, 2025 • 157 • 123 • 2

liked a model 5 months ago

Wan-AI/Wan2.2-T2V-A14B

Text-to-Video • Updated Aug 7, 2025 • 4.22k • • 385

commented a paper 5 months ago

Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

Paper • 2506.14002 • Published Jun 16, 2025 • 5 •

liked a Space 5 months ago

Monet 1.4B Experts Viewer

🔍

Show expert routing examples

Roxanna

AI & ML interests

Recent Activity

Organizations

borntobeignored's activity

20x Faster TRL Fine-tuning with RapidFire AI

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Open-R1: a fully open reproduction of DeepSeek-R1

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

RLagents

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

AgentSeer

SmolLM3: smol, multilingual, long-context reasoner

Monet 1.4B Experts Viewer