← back to model results
ground truth
Simple loading indicators
csssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.95T 0.33
Qwen3-VL-8B-Instruct →
A 0.66T 0.30
GPT-5.4 →
A 0.94T 0.27
Claude Sonnet 4.6 →
A 0.94T 0.29
LLaMA 4 Scout →
A 0.64T 0.26
1<div class="loading round"></div>