1 5334 312

fdsqefsgergd

T-representer

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper 1 day ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

upvoted a paper 2 days ago

Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 6 days ago • 60

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Paper • 2509.03867 • Published 3 days ago • 170

upvoted 6 papers 2 days ago

Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

Paper • 2509.04406 • Published 3 days ago • 8

Transition Models: Rethinking the Generative Learning Objective

Paper • 2509.04394 • Published 3 days ago • 21

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published 3 days ago • 45

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published 3 days ago • 54

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published 3 days ago • 73

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published 6 days ago • 47

upvoted 5 papers 3 days ago

MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement

Paper • 2509.01977 • Published 5 days ago • 9

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

Paper • 2509.00428 • Published 8 days ago • 11

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published 4 days ago • 17

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published 8 days ago • 52

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Paper • 2509.01106 • Published 6 days ago • 40

upvoted 7 papers 4 days ago

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published 6 days ago • 30

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Paper • 2509.01215 • Published 6 days ago • 42

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Paper • 2508.21496 • Published 9 days ago • 53

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 7 days ago • 74

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 5 days ago • 104

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 5 days ago • 76

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published 5 days ago • 155

fdsqefsgergd

AI & ML interests

Recent Activity

Organizations

T-representer's activity