animation2code benchmark

animation2code benchmark

For best compatibility, please view this dashboard in a Chrome browser.

Zero-shot video (or image-frame) → code results on the test set, across commercial and open-source models.

Each output is tagged with A = appearance similarity and T = temporal similarity; higher is better for both. Click a video to inspect its code.

145–152 of 214

ground truth

#CodeVember #12 - shrinking hexagon

model outputs

Gemini 3 Flash Preview

A 0.68T 0.23

Qwen3-VL-8B-Instruct

A 0.51T 0.25

GPT-5.4

A 0.82T 0.25

Claude Sonnet 4.6

A 0.85T 0.30

LLaMA 4 Scout

A 0.72T 0.37

ground truth

Animated Dot Loaders

model outputs

Gemini 3 Flash Preview

A 0.82T 0.19

no output

Qwen3-VL-8B-Instruct

A —T —

GPT-5.4

A 0.96T 0.17

Claude Sonnet 4.6

A 0.93T 0.16

LLaMA 4 Scout

A 0.68T 0.16

ground truth

Animated Dot Loaders

model outputs

Gemini 3 Flash Preview

A 0.78T 0.26

no output

Qwen3-VL-8B-Instruct

A —T —

GPT-5.4

A 0.92T 0.24

Claude Sonnet 4.6

A 0.87T 0.22

LLaMA 4 Scout

A 0.65T 0.22

ground truth

Animated Dot Loaders

model outputs

Gemini 3 Flash Preview

A 0.82T 0.24

Qwen3-VL-8B-Instruct

A 0.86T 0.24

GPT-5.4

A 0.93T 0.21

Claude Sonnet 4.6

A 0.76T 0.24

LLaMA 4 Scout

A 0.41T 0.00

ground truth

Tiny Single Element Loading Animations

model outputs

Gemini 3 Flash Preview

A 0.62T 0.26

Qwen3-VL-8B-Instruct

A 0.83T 0.23

GPT-5.4

A 0.81T 0.22

Claude Sonnet 4.6

A 0.85T 0.25

LLaMA 4 Scout

A 0.54T 0.00

ground truth

Tiny Single Element Loading Animations

model outputs

Gemini 3 Flash Preview

A 0.85T 0.33

Qwen3-VL-8B-Instruct

A 0.71T 0.21

GPT-5.4

A 0.84T 0.18

Claude Sonnet 4.6

A 0.88T 0.20

LLaMA 4 Scout

A 0.55T 0.14

ground truth

Tiny Single Element Loading Animations

model outputs

Gemini 3 Flash Preview

A 0.70T 0.24

Qwen3-VL-8B-Instruct

A 0.77T 0.25

GPT-5.4

A 0.92T 0.20

Claude Sonnet 4.6

A 0.79T 0.15

LLaMA 4 Scout

A 0.44T 0.00

ground truth

Tiny Single Element Loading Animations

model outputs

Gemini 3 Flash Preview

A 0.76T 0.28

Qwen3-VL-8B-Instruct

A 0.69T 0.19

GPT-5.4

A 0.91T 0.61

Claude Sonnet 4.6

A 0.94T 0.26

LLaMA 4 Scout

A 0.56T 0.00

← Previous19 / 27Next →