shijie xia
seven-cat
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
about 1 month ago
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling
upvoted
a
paper
about 2 months ago
ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm
Engineering
upvoted
a
paper
2 months ago
Thinking with Generated Images