← back to model results
ground truth
Nice spinny stuff
jssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.82T 0.27
Qwen3-VL-8B-Instruct (no output)
A —T —
GPT-5.4 →
A 0.91T 0.33
Claude Sonnet 4.6 →
A 0.77T 0.31
LLaMA 4 Scout →
A 0.54T 0.33
1<div class="container">
2<div class="verypoop lol"></div>
3</div>