Matricardi Fabio

FM-1976

16 42 501

https://medium.com/@fabio.matricardi

AI & ML interests

control system engineering, AI, LLM with python. ThePoorGPUguy on substack

Recent Activity

liked a model about 18 hours ago

King3Djbl/mythos-9b-merged

reacted to constannnt's post with ❤️ 14 days ago

We are excited to announce Sipp.sh: a high-performance library for running AI inference locally and in the cloud through a unified API. We began to realize that an LLM isn't just a chat interface for information retrieval. It can be integrated directly into web, games, or productivity apps to handle continuous monitoring and decision-making. It can act as a sort of "second brain,” the silent hand that guides and helps a user without them even realizing it. We see this as the next frontier of UX design, but this is only possible if developers have access to low-cost, zero-latency compute and absolute data privacy. That's why we created Sipp. It’s an opinionated library that lets developers integrate local AI into any application, giving them the superpowers to completely rethink user experiences across the web, games, and desktop. To achieve this, we built an entirely new stack in Rust and C++, working alongside the llama.cpp project. Through our work, we were able to contribute back to that community to help upgrade the GGML WebGPU backend. This deep optimization is what enables our fast, responsive decode speeds directly in the browser. Sipp ships as a zero-dependency library for desktop and web, achieving 3x to 5x speedup in token decode compared to popular alternatives. We are already seeing some incredible use cases emerge from this, from continuous monitoring using local vision to the dynamic generation of game elements in a real-time wizard vs. wizard game. The best part? It's fully open-source! We see this as the start of a dialogue about what the future of user interaction is going to look like, and we built Sipp to lay the foundation for that exciting future. Check out the live demos on our site, run your own benchmarks, or come hang out with us in our Discord. Website: https://www.sipp.sh/ Github: https://github.com/noumena-labs/Sipp

liked a model 24 days ago

mradermacher/Tool-Star-Qwen-3B-GGUF

View all activity

Organizations

None yet

liked a model about 18 hours ago

King3Djbl/mythos-9b-merged

Text Generation • 8B • Updated 4 days ago • 1.52k • 3

liked 4 models 24 days ago

liked a model 27 days ago

bartowski/nex-agi_Nex-N2-mini-GGUF

Image-Text-to-Text • 35B • Updated 29 days ago • 50.4k • 30

liked a model 29 days ago

mradermacher/Salience-1-9B-GGUF

8B • Updated 2 days ago • 1.19k • 5

liked 10 models about 1 month ago

CohereLabs/North-Mini-Code-1.0

Text Generation • 30B • Updated 25 days ago • 36.2k • 517

Arki05/North-Mini-Code-1.0-GGUF

Text Generation • 30B • Updated 21 days ago • 4.44k • 8

ai-sage/GigaChat3.1-10B-A1.8B-GGUF

Text Generation • 11B • Updated Mar 25 • 3.84k • 77

silx-ai/Quasar-Preview

Text Generation • 17B • Updated 21 days ago • 5.72k • 93

ideogram-ai/ideogram-4-nf4

Text-to-Image • Updated Jun 4 • 5.89k • 424

Green-Sky/bonsai-image-binary-4B-GGUF

Text-to-Image • 4B • Updated about 5 hours ago • 1.75k • 13

mradermacher/LMT-60-0.6B-GGUF

0.8B • Updated Jun 2 • 624 • 6

JetBrains/Mellum2-12B-A2.5B-Thinking

Text Generation • 12B • Updated 28 days ago • 9.78k • 318

huihui-ai/Huihui4-8B-A4B

Image-Text-to-Text • 9B • Updated Apr 27 • 150 • 19

Jackrong/Qwopus3.5-4B-Coder

Text Generation • 5B • Updated May 28 • 9.33k • 18

liked 3 models about 2 months ago

openbmb/MiniCPM5-1B

Text Generation • 1B • Updated May 26 • 363k • 890

nvidia/Nemotron-Cascade-8B-Thinking

Text Generation • 8B • Updated Jan 1 • 325 • • 41

nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF

Text Generation • 4B • Updated Mar 16 • 14.9k • 172

Matricardi Fabio

AI & ML interests

Recent Activity

Organizations

FM-1976's activity