← back to model results
ground truth
Only CSS: Codevember #5 Lightsaber
model outputs
Gemini 3 Flash Preview →
A 0.74T 0.25
Qwen3-VL-8B-Instruct (no output)
A —T —
GPT-5.4 →
A 0.83T 0.28
Claude Sonnet 4.6 →
A 0.77T 0.22
LLaMA 4 Scout →
A 0.49T 0.20
1<div class="force_field"><div class="sword"><div class="grip"></div><div class="beam"></div></div></div>