Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 2 days ago • 55
Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction Paper • 2510.01817 • Published about 1 month ago • 15
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 26 days ago • 461
view post Post 2177 Cool stuff these past weeks on huggingface! 🤗 🚀 !• 📈Trackio, local-first W&B alternativehttps://github.com/gradio-app/trackio/issues• 🌍EmbeddingGemma, 300M-param, multilingual embeddings, on-devicehttps://huggingface.co/blog/embeddinggemma• 💻Open LLMs in VS Code (Inference Providers)https://x.com/reach_vb/status/1966185427582497171• 🤖Smol2Operator GUI agentshttps://huggingface.co/blog/smol2operator• 🖼️Gradio visible watermarkinghttps://huggingface.co/blog/watermarking-with-gradio See translation 🔥 4 4 🤗 3 3 + Reply