BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation Paper • 2506.00482 • Published 8 days ago • 8
Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems Paper • 2505.18366 • Published 15 days ago • 25
FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding Paper • 2505.17330 • Published 16 days ago • 22
Can LLMs faithfully generate their layperson-understandable 'self'?: A Case Study in High-Stakes Domains Paper • 2412.07781 • Published Nov 25, 2024 • 2
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use Paper • 2505.17332 • Published 16 days ago • 31
view article Article Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B By asoria and 3 others • Apr 4, 2024 • 28
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 267
InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Paper • 2401.05335 • Published Jan 10, 2024 • 30
Make-A-Character: High Quality Text-to-3D Character Generation within Minutes Paper • 2312.15430 • Published Dec 24, 2023 • 29
Gemini: A Family of Highly Capable Multimodal Models Paper • 2312.11805 • Published Dec 19, 2023 • 45
HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles Paper • 2312.11666 • Published Dec 18, 2023 • 13