When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning Paper • 2504.01005 • Published Apr 1, 2025 • 15
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection Paper • 2503.12271 • Published Mar 15, 2025 • 9
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning Paper • 2302.08560 • Published Feb 16, 2023 • 1
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces Paper • 2410.09918 • Published Oct 13, 2024 • 3
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback Paper • 2410.23022 • Published Oct 30, 2024
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published Feb 5, 2025 • 18
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Paper • 2412.01169 • Published Dec 2, 2024 • 13
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories Paper • 2210.06518 • Published Oct 12, 2022 • 1