Ziqi wang's picture

3 10

Ziqi wang

wzq016

·

https://wzq016.github.io

AI & ML interests

NLP

Organizations

upvoted 2 papers 6 months ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10, 2025 • 79

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 89

upvoted 2 papers 7 months ago

Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance

Paper • 2506.06444 • Published Jun 6, 2025 • 73

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

upvoted a paper 8 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5, 2025 • 79

upvoted a paper 10 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26, 2025 • 82

upvoted 2 papers over 1 year ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

Eliminating Position Bias of Language Models: A Mechanistic Approach

Paper • 2407.01100 • Published Jul 1, 2024 • 8

upvoted a collection over 1 year ago

Model Extrapolation Expedites Alignment

Better aligned models obtained by model extrapolation (ExPO) • 25 items • Updated May 27, 2025 • 17

upvoted a paper over 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11