352 38 762

Sebastian Gabarain

Locutusque

SebastianG74019

AI & ML interests

Pushing performance in small language models

Recent Activity

upvoted a paper about 4 hours ago

Kimi Linear: An Expressive, Efficient Attention Architecture

updated a dataset 8 days ago

Locutusque/liberalis-cogitator

updated a dataset 8 days ago

Locutusque/Medical-R1-Distill-Data-ShareGPT

View all activity

Organizations

upvoted a paper about 4 hours ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published 2 days ago • 55

updated 2 datasets 8 days ago

Locutusque/liberalis-cogitator

Viewer • Updated 8 days ago • 1.1M • 318 • 1

Locutusque/Medical-R1-Distill-Data-ShareGPT

Viewer • Updated 8 days ago • 22k • 14

published a dataset 8 days ago

Locutusque/Medical-R1-Distill-Data-ShareGPT

Viewer • Updated 8 days ago • 22k • 14

upvoted a paper 11 days ago

Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction

Paper • 2510.01817 • Published about 1 month ago • 15

liked a dataset 13 days ago

karpathy/fineweb-edu-100b-shuffle

Viewer • Updated Sep 25 • 97.2M • 50.6k • 121

upvoted a paper 17 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 18 days ago • 96

upvoted a paper 21 days ago

GCPO: When Contrast Fails, Go Gold

Paper • 2510.07790 • Published 24 days ago • 5

updated a Space 22 days ago

Locutusque Models

🏃

Generate text responses using various language models

upvoted a paper 24 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 26 days ago • 461

reacted to lunarflu's post with 🔥 26 days ago

Post

2177

Cool stuff these past weeks on huggingface! 🤗 🚀 !
• 📈Trackio, local-first W&B alternative
https://github.com/gradio-app/trackio/issues
• 🌍EmbeddingGemma, 300M-param, multilingual embeddings, on-device
https://huggingface.co/blog/embeddinggemma
• 💻Open LLMs in VS Code (Inference Providers)
https://x.com/reach_vb/status/1966185427582497171
• 🤖Smol2Operator GUI agents
https://huggingface.co/blog/smol2operator
• 🖼️Gradio visible watermarking
https://huggingface.co/blog/watermarking-with-gradio

liked a model 29 days ago

inclusionAI/Ring-1T-preview

Text Generation • 1000B • Updated 10 days ago • 1.64k • 268

updated a dataset 29 days ago

Locutusque/Hermes-3-shuffled

Viewer • Updated 29 days ago • 959k • 27

published a dataset 29 days ago

Locutusque/Hermes-3-shuffled

Viewer • Updated 29 days ago • 959k • 27

updated a dataset 29 days ago

Locutusque/lmsys-best-2

Viewer • Updated 29 days ago • 26.9k • 52

published a dataset 29 days ago

Locutusque/lmsys-best-2

Viewer • Updated 29 days ago • 26.9k • 52

published a model 29 days ago

Locutusque/CollectiveLM-Falcon-3-7B

Text Generation • 7B • Updated Jan 8 • 6 • 1

liked 2 datasets about 1 month ago

openai/gdpval

Viewer • Updated Sep 25 • 220 • 27.6k • 250

DSULT-Core/ShareGPT-X

Viewer • Updated Sep 27 • 109k • 139 • 16

liked a model about 1 month ago

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 325k • 690

Sebastian Gabarain

AI & ML interests

Recent Activity

Organizations

Locutusque's activity

Locutusque Models