📑 arXiv 3d ago
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
MM-WebAgent uses hierarchical planning and iterative self-reflection to coordinate AIGC tools for webpage generation, jointly optimizing layout, multimodal content, and integration. Solves style inconsistency problems in prior approaches that generate visual elements independently, introducing a new multimodal webpage generation benchmark.