Jiawei Wang's picture

Jiawei Wang

Jarvis1111

·

https://jarvisustc.github.io/

AI & ML interests

None yet

Recent Activity

commented on a paper 10 days ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

authored a paper about 1 month ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

upvoted a paper about 1 month ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

View all activity

Organizations

None yet

commented a paper 10 days ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 45 •

commented a paper about 2 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 45 •

commented a paper 3 months ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11 • 109 •

New activity in Jarvis1111/DoctorAgent-RL-SFT-1k-Thinking 3 months ago

Improve model card: Update pipeline tag, add `transformers` library, and enhance content with paper/code links

#1 opened 3 months ago by

New activity in Jarvis1111/DoctorAgent-RL 3 months ago

Add comprehensive model card

#1 opened 3 months ago by

commented a paper 5 months ago

DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue

Paper • 2505.19630 • Published May 26 • 7 •

New activity in Jarvis1111/llava-v1.5-7b-RobustVLGuard 6 months ago

Add pipeline tag and library name

#1 opened 7 months ago by

New activity in Jarvis1111/InternVL2-8B-RobustVLGuard 6 months ago

Add base model

#2 opened 7 months ago by

New activity in Jarvis1111/MiniGPT4-RobustVLGuard 7 months ago

Add pipeline tag, library name, and project page link

#1 opened 7 months ago by

New activity in Jarvis1111/InternVL2-8B-RobustVLGuard 7 months ago

Add pipeline tag and library name

#1 opened 7 months ago by

New activity in Jarvis1111/RobustVLGuard 7 months ago

Fix paper link

#2 opened 7 months ago by

commented 2 papers 7 months ago

Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks

Paper • 2504.01308 • Published Apr 2 • 14 •

UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis

Paper • 2503.15893 • Published Mar 20 • 2 •

New activity in jordyvl/DUDE_loader over 2 years ago

The difference between azure_due and azure_original

#3 opened over 2 years ago by