Joonhyung Lee

joonhyung-lee-naver

AI & ML interests

None yet

Recent Activity

commented on an article 26 days ago

Accelerating LLM Inference with TGI on Intel Gaudi

upvoted an article 26 days ago

Accelerating LLM Inference with TGI on Intel Gaudi

liked a Space 2 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

joonhyung-lee-naver's activity

commented on Accelerating LLM Inference with TGI on Intel Gaudi 26 days ago

Great work!

upvoted an article 26 days ago

Article

Accelerating LLM Inference with TGI on Intel Gaudi

and 4 others •

27 days ago

• 13

liked a Space 2 months ago

2.51k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

reacted to regisss's post with 🔥 2 months ago

Post

1714

Nice paper comparing the fp8 inference efficiency of Nvidia H100 and Intel Gaudi2: An Investigation of FP8 Across Accelerators for LLM Inference (2502.01070)

The conclusion is interesting: "Our findings highlight that the Gaudi 2, by leveraging FP8, achieves higher throughput-to-power efficiency during LLM inference"

One aspect of AI hardware accelerators that is often overlooked is how they consume less energy than GPUs. It's nice to see researchers starting carrying out experiments to measure this!

Gaudi3 results soon...

authored 2 papers 9 months ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 25

To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability

Paper • 2405.18710 • Published May 29, 2024

liked a Space 11 months ago

920

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training