Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering
In this work, we construct Design2Code
we manually curate 484 diverse real-world webpages as test cases and develop a set of automatic evaluation metrics to assess how well current multimodal LLMs can generate the code implementations that directly render into the given reference webpages, given the screenshots as input.