CFBench: A Comprehensive Constraints-Following Benchmark for LLMs Paper ā¢ 2408.01122 ā¢ Published Aug 2, 2024
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark Paper ā¢ 2408.07543 ā¢ Published Aug 14, 2024
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Paper ā¢ 2408.15079 ā¢ Published Aug 27, 2024 ā¢ 52
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System Paper ā¢ 2407.06027 ā¢ Published Jul 8, 2024 ā¢ 8
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System Paper ā¢ 2407.06027 ā¢ Published Jul 8, 2024 ā¢ 8
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge Paper ā¢ 2405.00263 ā¢ Published May 1, 2024 ā¢ 14