Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning Paper • 2504.21561 • Published Apr 30 • 1