From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Paper • 2504.06214 • Published Apr 8
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning Paper • 2505.16400 • Published May 22 • 33
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published Feb 18 • 42
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study Paper • 2304.06762 • Published Apr 13, 2023 • 1
CrossNER: Evaluating Cross-Domain Named Entity Recognition Paper • 2012.04373 • Published Dec 8, 2020
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18, 2024 • 37
Multi-Stage Prompting for Knowledgeable Dialogue Generation Paper • 2203.08745 • Published Mar 16, 2022
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Paper • 2407.02485 • Published Jul 2, 2024 • 5
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper • 2407.14482 • Published Jul 19, 2024 • 27