Zijian Wu's picture

1 12 1

Zijian Wu PRO

Jakumetsu

·

zjwu0522

AI & ML interests

AGI

Recent Activity

commented on a paper 29 days ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

commented on a paper about 1 month ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

upvoted a paper about 1 month ago

GEM: A Gym for Agentic LLMs

View all activity

Organizations

upvoted 4 papers about 1 month ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 87

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 170

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26 • 67

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26 • 68

upvoted a paper 4 months ago

Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning

Paper • 2502.11962 • Published Feb 17 • 38

upvoted a collection 5 months ago

SynthRL

Models & Datasets of SynthRL • 10 items • Updated Jun 4 • 5

upvoted a paper 5 months ago

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Paper • 2506.02096 • Published Jun 2 • 52

upvoted a collection 5 months ago

NoisyRollout

8 items • Updated May 20 • 6

upvoted a paper 6 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21 • 47

upvoted 2 papers 7 months ago

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Paper • 2504.13055 • Published Apr 17 • 19

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Paper • 2503.19622 • Published Mar 25 • 31

upvoted a paper about 1 year ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75