Unified Reinforcement and Imitation Learning for Vision-Language Models Paper • 2510.19307 • Published 5 days ago • 22
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 5 days ago • 54
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP Dec 17, 2024 • 7
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face 11 days ago • 12
Attention Is All You Need for KV Cache in Diffusion LLMs Paper • 2510.14973 • Published 10 days ago • 36
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published 10 days ago • 32
Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Paper • 2510.12586 • Published 12 days ago • 106
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published 17 days ago • 117
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 13 days ago • 157
TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control Paper • 2510.09561 • Published 16 days ago • 7
Don't Just Fine-tune the Agent, Tune the Environment Paper • 2510.10197 • Published 15 days ago • 27
Training-Free Group Relative Policy Optimization Paper • 2510.08191 • Published 17 days ago • 42
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 13 days ago • 165
Self-Improvement in Multimodal Large Language Models: A Survey Paper • 2510.02665 • Published 24 days ago • 19
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models Paper • 2510.03561 • Published 23 days ago • 23