PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding Paper • 2510.20155 • Published Oct 23, 2025 • 6
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets Paper • 2509.21245 • Published Sep 25, 2025 • 39
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs Paper • 2508.18264 • Published Aug 25, 2025 • 25
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published Aug 21, 2025 • 47
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning Paper • 2401.10727 • Published Jan 19, 2024 • 2
TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting Paper • 2204.01018 • Published Apr 3, 2022
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos Paper • 2303.12370 • Published Mar 22, 2023
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models Paper • 2506.08552 • Published Jun 10, 2025 • 1
Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives Paper • 2506.24124 • Published Jun 30, 2025 • 1
Leveraging Large Language Models for Effective Label-free Node Classification in Text-Attributed Graphs Paper • 2412.11983 • Published Dec 16, 2024 • 1
Intelligent System for Automated Molecular Patent Infringement Assessment Paper • 2412.07819 • Published Dec 10, 2024 • 2
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians Paper • 2409.13648 • Published Sep 20, 2024 • 11
ZePo: Zero-Shot Portrait Stylization with Faster Sampling Paper • 2408.05492 • Published Aug 10, 2024 • 7