← back to model results
ground truth
Such Spinners, Much Loading
csssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.88T 0.39
Qwen3-VL-8B-Instruct →
A 0.90T 0.24
GPT-5.4 →
A 0.92T 0.38
Claude Sonnet 4.6 →
A 0.90T 0.37
LLaMA 4 Scout →
A 0.78T 0.15
1<div class="loader loader-2"></div>
2