← back to model results
ground truth
Exploring Bourbon
csssource ↗
model outputs
Gemini 3 Flash Preview →
A 0.82T 0.24
Qwen3-VL-8B-Instruct →
A 0.70T 0.27
GPT-5.4 →
A 0.79T 0.30
Claude Sonnet 4.6 →
A 0.82T 0.21
LLaMA 4 Scout →
A 0.64T 0.34
1<div class="container"><div class="item four"></div>
2