-
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Paper β’ 2506.01049 β’ Published β’ 38 -
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Paper β’ 2402.09240 β’ Published β’ 5 -
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Paper β’ 2410.06373 β’ Published β’ 36 -
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning
Paper β’ 2209.04851 β’ Published β’ 3
Juanxi Tian
Juanxi
AI & ML interests
Efficient AI & Gen AI
Recent Activity
upvoted
a
paper
1 day ago
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
upvoted
a
paper
2 days ago
Latent Implicit Visual Reasoning