Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

KVCache.ai

community
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zhang-mingxing  authored a paper about 1 month ago
Efficient and Economic Large Language Model Inference with Attention Offloading
zhang-mingxing  authored a paper about 1 month ago
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
zhang-mingxing  authored a paper about 1 month ago
MoBA: Mixture of Block Attention for Long-Context LLMs
View all activity

A's profile picture ZHANG Mingxing's profile picture Bin CHEN's profile picture boxin's profile picture qiu chengyu's profile picture unicorn chan's profile picture

models 5

KVCache-ai/Kimi-K2-Instruct-0905-GGUF

1T • Updated Sep 5 • 37 • 1

KVCache-ai/Kimi-K2-Instruct-GGUF

1T • Updated Jul 12 • 64 • 18

KVCache-ai/Qwen3-30BA3B-GGUF

31B • Updated Apr 29 • 27 • 1

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

Updated Mar 4 • 13

KVCache-ai/DeepSeek-V3-GGML-FP8-Hybrid

Updated Feb 24 • 1

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs