3 18 4

Tianduo Wang

Tianduo

TianduoWang

AI & ML interests

nlp, representation learning

Recent Activity

upvoted a paper 1 day ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

upvoted a paper 2 days ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

upvoted a paper 9 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

View all activity

Organizations

upvoted a paper 1 day ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published 2 days ago • 42

upvoted a paper 2 days ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 3 days ago • 45

upvoted a paper 9 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published 10 days ago • 239

upvoted 2 papers 27 days ago

On-Policy RL with Optimal Reward Baseline

Paper • 2505.23585 • Published 28 days ago • 14

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Paper • 2505.23604 • Published 28 days ago • 24

upvoted a paper about 1 month ago

From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

Paper • 2505.16972 • Published May 22 • 9

commented a paper about 1 month ago

From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

Paper • 2505.16972 • Published May 22 • 9 •

liked a dataset 3 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 5.4k • 131

upvoted a paper 3 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 132

upvoted a paper 5 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 52

liked a dataset 8 months ago

neuralwork/arxiver

Viewer • Updated Nov 1, 2024 • 63.4k • 270 • 362

upvoted a paper 8 months ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 32

upvoted a paper 10 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 74

upvoted a paper 11 months ago

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31, 2024 • 18

authored a paper 11 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 34

upvoted a paper 11 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 34

commented a paper 11 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 34 •

authored 2 papers 11 months ago

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Paper • 2306.01707 • Published Jun 2, 2023 • 1

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 95

upvoted a paper 11 months ago

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 45

Tianduo Wang

AI & ML interests

Recent Activity

Organizations

Tianduo's activity