Shijie Geng's picture

24 6

Shijie Geng

makitanikaze

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

upvoted a paper 14 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

upvoted a paper 20 days ago

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

View all activity

Organizations

None yet

makitanikaze's activity

upvoted a paper 4 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted a paper 14 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 21 days ago • 141

upvoted 8 papers 20 days ago

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published 24 days ago • 30

Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published 24 days ago • 33

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 24 days ago • 32

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 23 days ago • 16

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published 23 days ago • 31

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published 23 days ago • 51

Large Language Diffusion Models

Paper • 2502.09992 • Published 23 days ago • 98

upvoted 9 papers 23 days ago

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23 • 49

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 65

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 57

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published Feb 5 • 43

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 30 days ago • 121

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 27 days ago • 142

Retrieval-augmented Large Language Models for Financial Time Series Forecasting

Paper • 2502.05878 • Published 28 days ago • 39

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 24 days ago • 143

upvoted a paper 29 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 109