INF-Retriever-v1 Collection LLM-based dense retrieval models for EN & ZH (also effective in other languages) • 3 items • Updated 13 days ago • 4
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning Paper • 2505.24850 • Published May 30 • 8
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published Apr 10 • 43
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values Paper • 2504.05535 • Published Apr 7 • 44
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 127
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 127
CostFormer:Cost Transformer for Cost Aggregation in Multi-view Stereo Paper • 2305.10320 • Published May 17, 2023 • 1