Joao Gante's picture

Joao Gante

joaogante

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Hugging Face Internal Testing Organization's profile picture pytorch's profile picture Hugging Face OSS Metrics's profile picture gg-hf's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture nltpt's profile picture mv's profile picture Transformers Community's profile picture Inference Endpoints Images's profile picture

Posts 4

view post
Post
352
Let's go! Custom generation code has landed in transformers πŸš€

Have you designed a new cool KV cache? Maybe you're comparing new test-time compute ideas you've been researching? Have you found a way to do diffusion with existing models? You can now easily share your findings with the community with custom generation code, sharing the well-known generate interface πŸ€“

In a nutshell, we have expanded the support of custom modeling code on the Hub with *model-agnostic* custom generation code. Write for one model, reuse with any model -- hopefully, this will democratize access to new generation ideas 🫑

As a creator, you gain the ability to get your ideas in transformers with minimal effort. You'll also have access to all Hub features: a landing page for your creation, discussions, usage metrics, ... πŸ€“

πŸ’Ž Resources πŸ’Ž
- docs: https://huggingface.co/docs/transformers/generation_strategies#custom-decoding-methods
- minimal example: transformers-community/custom_generate_example
- discussion: transformers-community/support#10

Articles 8

Article
18

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques