← back to model results
ground truth
Animated Concepts #3
csssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.84T 0.26
Qwen3-VL-8B-Instruct →
A 0.82T 0.21
GPT-5.4 →
A 0.90T 0.28
Claude Sonnet 4.6 →
A 0.86T 0.26
LLaMA 4 Scout →
A 0.70T 0.20
1<span class="load9"></span>
2