Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published about 1 month ago • 15
view post Post 1284 Just for the meme.But the clear lesson I learnt from building these demos are, the more powerful the underlying base model is, the closer you will get to GPT4o1. CoT is nothing more than simply inducing the latent reasoning capability from the model. kz919/GPT4-O1-Proximas 🚀 6 6 🔥 2 2 😎 1 1 + Reply
view post Post 2447 "It's Sunday night, fancy a game?"https://kz919-can-you-beat-405b-in-chess.hf.space/built with the one and only SN fast API:https://sambanova.ai/fast-api?api_ref=907266 7 replies · 🧠 8 8 🔥 2 2 + Reply
view post Post 636 Good lord... Spent almost a day debugging this and it turns out it was an issue of gradio update incompatible with the new fastapi.https://discuss.huggingface.co/t/huggingface-space-failed-after-working-initially/105514/8Finally got it back online! Come chat with your favorite anime characters here: kz919/Persona-AI 👀 3 3 + Reply
view post Post 1586 Spent a few minutes to build an alternative to Character AI on top of llama3.1 405B through SambaNova's super fast inference API Space: kz919/Persona-AIAPI referral link: https://sambanova.ai/fast-api?api_ref=907266 3 replies · 🔥 3 3 😎 3 3 🚀 2 2 🤗 2 2 🤯 2 2 🧠 2 2 + Reply
view post Post 1688 The only 405B spaces still freely accessible are powered by SN fast api. xianbao/SambaNova-fasthttps://sambanova.ai/fast-api?api_ref=907266 👀 6 6 🔥 4 4 🤗 2 2 😎 1 1 + Reply
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17 • 50
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13 • 24
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13 • 24
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13 • 24
Communication Efficient Distributed Training with Distributed Lion Paper • 2404.00438 • Published Mar 30 • 2
Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts Paper • 2310.05898 • Published Oct 9, 2023 • 2
PIE: Simulating Disease Progression via Progressive Image Editing Paper • 2309.11745 • Published Sep 21, 2023 • 3