AI & ML interests

An open source machine learning framework that accelerates the path from research prototyping to production deployment.

Recent Activity

jerryzh168  updated a collection 3 days ago
TorchAO Quantized Qwen3
andrewor14  updated a model 7 days ago
pytorch/Qwen3-8B-QAT-INT4
andrewor14  updated a model 7 days ago
pytorch/gemma-3-12b-it-QAT-INT4
View all activity

abidlabs 
posted an update about 1 month ago
abidlabs 
posted an update 5 months ago
joaogante 
posted an update 5 months ago
view post
Post
685
Let's go! Custom generation code has landed in transformers 🚀

Have you designed a new cool KV cache? Maybe you're comparing new test-time compute ideas you've been researching? Have you found a way to do diffusion with existing models? You can now easily share your findings with the community with custom generation code, sharing the well-known generate interface 🤓

In a nutshell, we have expanded the support of custom modeling code on the Hub with *model-agnostic* custom generation code. Write for one model, reuse with any model -- hopefully, this will democratize access to new generation ideas 🫡

As a creator, you gain the ability to get your ideas in transformers with minimal effort. You'll also have access to all Hub features: a landing page for your creation, discussions, usage metrics, ... 🤓

💎 Resources 💎
- docs: https://huggingface.co/docs/transformers/generation_strategies#custom-decoding-methods
- minimal example: transformers-community/custom_generate_example
- discussion: transformers-community/support#10