Joao Gante's picture

Joao Gante

joaogante

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Hugging Face Internal Testing Organization's profile picture pytorch's profile picture Hugging Face OSS Metrics's profile picture gg-hf's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture nltpt's profile picture mv's profile picture Transformers Community's profile picture Inference Endpoints Images's profile picture

joaogante's activity

upvoted an article 1 day ago
view article
Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

By celinah and 3 others β€’
β€’ 42
upvoted 2 changelogs 1 day ago
view changelog
Changelog

Filter by MCP compatibility available in HF Spaces

β€’ 57
view changelog
Changelog

AI-generated Abstract summaries on Hugging Face Papers

β€’ 49
posted an update 3 days ago
view post
Post
354
Let's go! Custom generation code has landed in transformers πŸš€

Have you designed a new cool KV cache? Maybe you're comparing new test-time compute ideas you've been researching? Have you found a way to do diffusion with existing models? You can now easily share your findings with the community with custom generation code, sharing the well-known generate interface πŸ€“

In a nutshell, we have expanded the support of custom modeling code on the Hub with *model-agnostic* custom generation code. Write for one model, reuse with any model -- hopefully, this will democratize access to new generation ideas 🫑

As a creator, you gain the ability to get your ideas in transformers with minimal effort. You'll also have access to all Hub features: a landing page for your creation, discussions, usage metrics, ... πŸ€“

πŸ’Ž Resources πŸ’Ž
- docs: https://huggingface.co/docs/transformers/generation_strategies#custom-decoding-methods
- minimal example: transformers-community/custom_generate_example
- discussion: transformers-community/support#10
upvoted an article 5 days ago
view article
Article

Vision Language Models (Better, Faster, Stronger)

By merve and 4 others β€’
β€’ 366
New activity in nanotron/ultrascale-playbook about 1 month ago

Incorrect link in Data Parallelism?

#108 opened about 1 month ago by
joaogante
upvoted an article about 1 month ago
view article
Article

Introducing Pull Requests and Discussions πŸ₯³

By victor β€’
β€’ 13