← back to model results
ground truth
Staggered Stair Loading
jssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.92T 0.30
Qwen3-VL-8B-Instruct →
A 0.70T 0.21
GPT-5.4 →
A 0.96T 0.20
Claude Sonnet 4.6 →
A 0.84T 0.29
LLaMA 4 Scout →
A 0.44T 0.19
1<div class="loading">Loading</div>