← back to model results
ground truth
Animation
csssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.87T 0.36
Qwen3-VL-8B-Instruct →
A 0.52T 0.31
GPT-5.4 →
A 0.88T 0.30
Claude Sonnet 4.6 →
A 0.91T 0.36
LLaMA 4 Scout →
A 0.42T 0.18
1<section>
2 <div></div>
3 <div></div>
4 <div></div>
5 <div></div>
6 <div></div>
7 <div></div>
8 <div></div>
9 <div></div>
10 <div></div>
11</section>