animation2code benchmark

For best compatibility, please view this dashboard in a Chrome browser.

← back to model results

ground truth

Exploring Bourbon

model outputs

Gemini 3 Flash Preview →

A 0.80T 0.23

Qwen3-VL-8B-Instruct →

A 0.63T 0.00

A 0.77T 0.32

Claude Sonnet 4.6 →

A 0.75T 0.31

LLaMA 4 Scout →

A 0.66T 0.26

1<div class="container"><div class="item three"></div>
2