Zero-shot video (or image-frame) → code results on the test set, across commercial and open-source models.
Each output is tagged with A = appearance similarity and T = temporal similarity; higher is better for both. Click a video to inspect its code.
1–8 of 214
ground truth
Merry Christmas Tree!
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
Road Block
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
SVG Draught Beer
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
SVG Multi-Drip
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
sting
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
CSS Direction Animation
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
Motion Table - Solid Rotation
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout
ground truth
Orbit 3D
model outputs
Gemini 3 Flash Preview
Qwen3-VL-8B-Instruct
GPT-5.4
Claude Sonnet 4.6
LLaMA 4 Scout