Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training Paper • 2510.08008 • Published 17 days ago • 5
Behind RoPE: How Does Causal Mask Encode Positional Information? Paper • 2509.21042 • Published Sep 25 • 8
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation Paper • 2506.18899 • Published Jun 23 • 5
Struct-Bench: A Benchmark for Differentially Private Structured Text Generation Paper • 2509.10696 • Published Sep 12
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification Paper • 2509.15591 • Published Sep 19 • 45
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks Paper • 2411.04468 • Published Nov 7, 2024 • 2