AI & ML interests

None defined yet.

Recent Activity

optimum-nvidia's activity

jeffboudierΒ 
posted an update about 1 month ago
view post
Post
2178
Llama4 is out and Scout is already on the Dell Enterprise Hub to deploy on Dell systems πŸ‘‰ dell.huggingface.co
jeffboudierΒ 
posted an update about 1 month ago
view post
Post
1546
Enterprise orgs now enable serverless Inference Providers for all members
- includes $2 free usage per org member (e.g. an Enterprise org with 1,000 members share $2,000 free credit each month)
- admins can set a monthly spend limit for the entire org
- works today with Together, fal, Novita, Cerebras and HF Inference.

Here's the doc to bill Inference Providers usage to your org: https://huggingface.co/docs/inference-providers/pricing#organization-billing
  • 2 replies
Β·
hlarcherΒ 
posted an update 4 months ago
view post
Post
1112
We are introducing multi-backend support in Hugging Face Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned πŸ€— !

Check out the details: https://huggingface.co/blog/tgi-multi-backend
jeffboudierΒ 
posted an update 4 months ago
view post
Post
744
NVIDIA just announced the Cosmos World Foundation Models, available on the Hub: nvidia/cosmos-6751e884dc10e013a0a0d8e6

Cosmos is a family of pre-trained models purpose-built for generating physics-aware videos and world states to advance physical AI development.
The release includes Tokenizers nvidia/cosmos-tokenizer-672b93023add81b66a8ff8e6

Learn more in this great community article by @mingyuliutw and @PranjaliJoshi https://huggingface.co/blog/mingyuliutw/nvidia-cosmos
  • 1 reply
Β·
jeffboudierΒ 
posted an update 5 months ago
jeffboudierΒ 
posted an update 7 months ago
jeffboudierΒ 
posted an update 8 months ago
view post
Post
471
Inference Endpoints got a bunch of cool updates yesterday, this is my top 3
jeffboudierΒ 
posted an update 8 months ago
view post
Post
4096
Pro Tip - if you're a Firefox user, you can set up Hugging Chat as integrated AI Assistant, with contextual links to summarize or simplify any text - handy!

In this short video I show how to set it up
Β·
jeffboudierΒ 
posted an update about 1 year ago
jeffboudierΒ 
posted an update about 1 year ago