Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering

In this work, we construct Design2Code

we manually curate 484 diverse real-world webpages as test cases and develop a set of automatic evaluation metrics to assess how well current multimodal LLMs can generate the code implementations that directly render into the given reference webpages, given the screenshots as input.

https://www.design2code.dev/