Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities Paper • 2505.15692 • Published May 21 • 14
DReSS: Data-driven Regularized Structured Streamlining for Large Language Models Paper • 2501.17905 • Published Jan 29 • 2