Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages Paper • 2606.20517 • Published 6 days ago • 57
🍎 Qwopus3.6 Collection This collection features the advanced Qwopus3.6 series of multimodal large models, which are fine-tuned from the Qwen3.6 base models with a focus on e • 10 items • Updated about 1 month ago • 68
🚀 Qwen-MTP Collection ⚡ MTP (Multi Token Prediction) speculative decoding enables models like Qwen3.6 to have ~1.4-2.2x faster generation with no change in accuracy. • 8 items • Updated 3 days ago • 29
StarVector SVG Datasets (🏆SVG-Bench) Collection Datasets for training and evaluating SVG generation models • 11 items • Updated Jan 12, 2025 • 23
view article Article EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios ServiceNow-AI • 19 days ago • 41
view article Article Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech ServiceNow-AI • 14 days ago • 44
view article Article Introducing RTEB: A New Standard for Retrieval Evaluation +4 fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll • Oct 1, 2025 • 146
view article Article StarCoder: A State-of-the-Art LLM for Code lvwerra, loubnabnl • May 4, 2023 • 75
view article Article MosaicLeaks: Can your research agent keep a secret? ServiceNow • 5 days ago • 11
AI Engineering Collection A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book. • 239 items • Updated Mar 29, 2025 • 28