animation2code benchmark
For best compatibility, please view this dashboard in a Chrome browser.
← back to model results

ground truth

Nice spinny stuff

model outputs

GPT-5.4
A 0.85T 0.32
LLaMA 4 Scout (no output)
A T
1<div class="container">
2<div class="poop lol"></div>
3</div>