6 21 5

Siyuan Hu

h-siyuan

AI & ML interests

None yet

Recent Activity

updated a dataset 8 days ago

h-siyuan/ScreenDrag

published a dataset 10 days ago

h-siyuan/ScreenDrag

upvoted a paper 16 days ago

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

View all activity

Organizations

updated a dataset 8 days ago

h-siyuan/ScreenDrag

Preview • Updated 8 days ago • 231

published a dataset 10 days ago

h-siyuan/ScreenDrag

Preview • Updated 8 days ago • 231

upvoted a paper 16 days ago

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Paper • 2601.03928 • Published 24 days ago • 17

upvoted a paper 17 days ago

ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

Paper • 2512.24965 • Published about 1 month ago • 41

upvoted 2 papers about 2 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 64

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published Dec 2, 2025 • 30

upvoted a paper 2 months ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 28

updated a Space 2 months ago

AUI

🌖

Display a gallery of images

upvoted a paper 2 months ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 53

liked a Space 2 months ago

AUI

🌖

Display a gallery of images

upvoted 3 papers 3 months ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 45

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 106

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 102

upvoted a paper 4 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 119

upvoted a paper 8 months ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27, 2025 • 109

upvoted 3 papers 11 months ago

upvoted 2 papers 12 months ago

WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation

Paper • 2502.08047 • Published Feb 12, 2025 • 28

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11, 2025 • 45

Siyuan Hu

AI & ML interests

Recent Activity

Organizations

h-siyuan's activity

AUI

AUI