Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering
https://arxiv.org/abs/2403.03163
In this work, we construct Design2Code
we manually curate 484 diverse real-world webpages as test cases and develop a set of automatic evaluation metrics to assess how well current multimodal LLMs can generate the code implementations that directly render into the given reference webpages, given the screenshots as input.
https://www.design2code.dev/