IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 10 days ago • 51
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published 19 days ago • 61
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published 17 days ago • 91
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published 14 days ago • 83
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 26 days ago • 99
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 156
A decoder-only foundation model for time-series forecasting Paper • 2310.10688 • Published Oct 14, 2023 • 15
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published Feb 15 • 53
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-Code-Reasoning-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 166 • 1
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-HealthCare-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 46
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-HealthCare-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 46
rizkysulaeman/Qwen3-VL-8B-Vision-GRPO-HealthCare-Q8_0-GGUF Image-Text-to-Text • 8B • Updated Feb 17 • 52 • 1
rizkysulaeman/Qwen3-VL-8B-Vision-GRPO-HealthCare-Q8_0-GGUF Image-Text-to-Text • 8B • Updated Feb 17 • 52 • 1
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-Code-Reasoning-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 166 • 1