Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published 7 days ago • 26
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders Paper • 2506.14002 • Published Jun 16 • 6
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation Paper • 2502.16707 • Published Feb 23 • 13
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Paper • 2207.14800 • Published Jul 29, 2022