Lize Pirenne

Inversta

Pangasius

AI & ML interests

LLMs, RL

Recent Activity

liked a model 10 days ago

black-forest-labs/FLUX.2-klein-4B

upvoted a paper 10 days ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper 10 days ago

Scaling Laws for Code: Every Programming Language Matters

View all activity

Organizations

None yet

upvoted 3 papers 10 days ago

upvoted a paper about 2 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 254

upvoted 8 papers 2 months ago

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Paper • 2511.20626 • Published Nov 25, 2025 • 43

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 107

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Paper • 2511.08633 • Published Nov 9, 2025 • 55

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 126

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published Nov 13, 2025 • 127

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 133

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 208

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 223

upvoted 8 papers 4 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 662

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 293

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Paper • 2507.10524 • Published Jul 14, 2025 • 71

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10, 2025 • 54

Lize Pirenne

AI & ML interests

Recent Activity

Organizations

Inversta's activity