← back to model results
ground truth
Such Spinners, Much Loading
csssource ↗
model outputs
Gemini 3 Flash Preview (no output)
A —T —
Qwen3-VL-8B-Instruct →
A 0.79T 0.16
GPT-5.4 →
A 0.79T 0.28
Claude Sonnet 4.6 →
A 0.68T 0.31
LLaMA 4 Scout →
A 0.82T 0.38
1<div class="loader loader-6"></div>
2