Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
KVCache.ai
community
Activity Feed
Request to join this org
Follow
38
AI & ML interests
None defined yet.
Recent Activity
zhang-mingxing
authored
a paper
about 1 month ago
Efficient and Economic Large Language Model Inference with Attention Offloading
zhang-mingxing
authored
a paper
about 1 month ago
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
zhang-mingxing
authored
a paper
about 1 month ago
MoBA: Mixture of Block Attention for Long-Context LLMs
View all activity
Team members
6
models
5
Sort: Recently updated
KVCache-ai/Kimi-K2-Instruct-0905-GGUF
1T
•
Updated
Sep 5
•
37
•
1
KVCache-ai/Kimi-K2-Instruct-GGUF
1T
•
Updated
Jul 12
•
64
•
18
KVCache-ai/Qwen3-30BA3B-GGUF
31B
•
Updated
Apr 29
•
27
•
1
KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid
Updated
Mar 4
•
13
KVCache-ai/DeepSeek-V3-GGML-FP8-Hybrid
Updated
Feb 24
•
1
datasets
0
None public yet