LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Paper • 2404.03214 • Published Apr 4, 2024 • 3
VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs Paper • 2512.21194 • Published Dec 24, 2025
view post Post 832 I like these models nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16 and nvidia/NVIDIA-Nemotron-3-Nano-4B-FP8 and TradingAgents: Multi-Agents LLM Financial Trading Framework (2412.20138) and https://arxiv.org/abs/2412.20138 mlabonne/FineTome-100k See translation 👍 1 1 + Reply
SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation Paper • 2603.19053 • Published Mar 19 • 2
BBQ-to-Image: Numeric Bounding Box and Qolor Control in Large-Scale Text-to-Image Models Paper • 2602.20672 • Published Feb 24 • 10
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11, 2025 • 3
Measuring what Matters: Construct Validity in Large Language Model Benchmarks Paper • 2511.04703 • Published Nov 3, 2025 • 8
AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model Paper • 2512.20157 • Published Dec 23, 2025 • 5
YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation Paper • 2601.08441 • Published Jan 13 • 8
YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation Paper • 2601.08441 • Published Jan 13 • 8
SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations Paper • 2512.14080 • Published Dec 16, 2025 • 9
HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing Paper • 2111.15666 • Published Nov 30, 2021
Self-Distilled StyleGAN: Towards Generation from Internet Photos Paper • 2202.12211 • Published Feb 24, 2022
Prompt-to-Prompt Image Editing with Cross Attention Control Paper • 2208.01626 • Published Aug 2, 2022 • 3
Text-Only Training for Image Captioning using Noise-Injected CLIP Paper • 2211.00575 • Published Nov 1, 2022
Null-text Inversion for Editing Real Images using Guided Diffusion Models Paper • 2211.09794 • Published Nov 17, 2022
Image Generation from Contextually-Contradictory Prompts Paper • 2506.01929 • Published Jun 2, 2025 • 2