← back to model results
ground truth
Single Element Spinner
model outputs
Gemini 3 Flash Preview →
A 0.77T 0.23
Qwen3-VL-8B-Instruct →
A 0.91T 0.29
GPT-5.4 →
A 0.94T 0.31
Claude Sonnet 4.6 →
A 0.89T 0.28
LLaMA 4 Scout →
A 0.91T 0.27
1<div class="loader loader-4"></div>
2