7 15 16

Kaixin Li

likaixin

https://likaixin2000.github.io/

likaixin2000

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Grounding Computer Use Agents on Human Demonstrations

upvoted a paper 6 days ago

Grounding Computer Use Agents on Human Demonstrations

liked a dataset 8 days ago

ServiceNow/GroundCUA

View all activity

Organizations

upvoted a paper 6 days ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published 8 days ago • 98

upvoted a paper 14 days ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published 14 days ago • 100

upvoted a collection about 1 month ago

Qwen3-VL

Collection

37 items • Updated 17 days ago • 411

upvoted 3 papers about 1 month ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9 • 35

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Paper • 2404.09486 • Published Apr 15, 2024 • 2

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Paper • 2507.01702 • Published Jul 2 • 3

upvoted an article about 1 month ago

Article

BigCodeArena: Judging code generations end to end with code executions

Oct 7

•

upvoted a paper about 2 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 171

upvoted a paper 3 months ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20 • 42

upvoted 2 articles 5 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21

•

Article

GRPO for GUI Grounding Done Right

Jun 11

•

upvoted 3 papers 6 months ago

ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

Paper • 2504.07981 • Published Apr 4 • 2

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Paper • 2504.12764 • Published Apr 17 • 41

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 109

upvoted an article 10 months ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

Jan 3

•

Kaixin Li

AI & ML interests

Recent Activity

Organizations

likaixin's activity

BigCodeArena: Judging code generations end to end with code executions

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

GRPO for GUI Grounding Done Right

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use