Spam-T5: Benchmarking Large Language Models for Few-Shot Email Spam Detection Paper • 2304.01238 • Published Apr 3, 2023 • 2
The FinBen: An Holistic Financial Benchmark for Large Language Models Paper • 2402.12659 • Published Feb 20 • 16
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Paper • 2402.13249 • Published Feb 20 • 10