PuzzleClone: An SMT-Powered Framework for Synthesizing Verifiable Data Paper • 2508.15180 • Published Aug 21 • 1
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Paper • 2507.16802 • Published Jul 22 • 8 • 4
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper • 2505.19457 • Published May 26 • 64 • 4
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper • 2505.19457 • Published May 26 • 64
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper • 2505.19457 • Published May 26 • 64 • 4
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning Paper • 2411.03314 • Published Nov 5, 2024 • 1
Nexus-O: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision Paper • 2503.01879 • Published Feb 26 • 2
Using AI to Hack IA: A New Stealthy Spyware Against Voice Assistance Functions in Smart Phones Paper • 1805.06187 • Published May 16, 2018
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning Paper • 2411.03314 • Published Nov 5, 2024 • 1
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published Feb 26 • 63