view article Article Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 13 days ago • 18
view article Article Accelerating LLM Inference with TGI on Intel Gaudi By baptistecolle and 4 others • Mar 28 • 13
view article Article Organizing a Privacy-preserving Hackathon By binoua and 1 other • Oct 17, 2024 • 9
view article Article Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2 By regisss and 1 other • Jun 29, 2023 • 2
view article Article Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator By regisss • Mar 28, 2023 • 1
view article Article Faster Training and Inference: Habana Gaudi®2 vs Nvidia A100 80GB By regisss • Dec 14, 2022 • 2