← back to model results
ground truth
Animated Concepts #3
csssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.81T 0.31
Qwen3-VL-8B-Instruct →
A 0.51T 0.30
GPT-5.4 →
A 0.87T 0.29
Claude Sonnet 4.6 →
A 0.89T 0.25
LLaMA 4 Scout →
A 0.69T 0.25
1<span class="load1"></span>
2