Abstract
Paper2Web is a benchmark and evaluation framework for academic webpage generation, featuring PWAgent, an autonomous pipeline that enhances content and layout through MCP tools, outperforming end-to-end baselines.
Academic project websites can more effectively disseminate research when they clearly present core content and enable intuitive navigation and interaction. However, current approaches such as direct Large Language Model (LLM) generation, templates, or direct HTML conversion struggle to produce layout-aware, interactive sites, and a comprehensive evaluation suite for this task has been lacking. In this paper, we introduce Paper2Web, a benchmark dataset and multi-dimensional evaluation framework for assessing academic webpage generation. It incorporates rule-based metrics like Connectivity, Completeness and human-verified LLM-as-a-Judge (covering interactivity, aesthetics, and informativeness), and PaperQuiz, which measures paper-level knowledge retention. We further present PWAgent, an autonomous pipeline that converts scientific papers into interactive and multimedia-rich academic homepages. The agent iteratively refines both content and layout through MCP tools that enhance emphasis, balance, and presentation quality. Our experiments show that PWAgent consistently outperforms end-to-end baselines like template-based webpages and arXiv/alphaXiv versions by a large margin while maintaining low cost, achieving the Pareto-front in academic webpage generation.
Community
tldr; Transforming static academic papers into interactive, visually coherent websites for enhanced dissemination and engagement.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- PosterForest: Hierarchical Multi-Agent Collaboration for Scientific Poster Generation (2025)
- IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video? (2025)
- Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs (2025)
- AutoPR: Let's Automate Your Academic Promotion! (2025)
- UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG (2025)
- WebRenderBench: Enhancing Web Interface Generation through Layout-Style Consistency and Reinforcement Learning (2025)
- RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper