Urambazaji
AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs OpenAI: GPT-5.3-Codex

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-12

Kipimo Seed-2.0-Lite Seed-2.0-Lite medium Toleo: 2026-02-14 GPT-5.3-Codex GPT-5.3-Codex medium Toleo: 2026-02-05
Nafasi #3 #4
Wastani wa alama 8.5 8.4
Uthabiti 8.7 9.1
Gharama kwa matokeo 0.870 4.485
Jumla ya gharama $0.105 $0.539
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 87.5% 83.3%
Majaribio yasiyo thabiti 3 2
Jumla ya uendeshaji 48 48
Tokeni za matokeo 2,815 1,764
Tokeni za hoja 44,618 33,348
Muda wa majibu (wastani) 29.39s 16.59s
Muda wa majibu (upeo) 168.71s 100.93s
Muda wa majibu (jumla) 470.29s 265.39s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Wastani wa alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Wastani wa alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 23.34s 990 7,037
GPT-5.3-Codex 10.0 10.0 100.0% 0 4.69s 216 1,421
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 37.67s 506 4,299
GPT-5.3-Codex 10.0 10.0 100.0% 0 19.56s 364 2,731
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 9.9 10.0 100.0% 0 9.07s 246 1,742
GPT-5.3-Codex 9.9 10.0 100.0% 0 3.07s 234 728
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 4.0 7.2 55.6% 1 88.74s 15 23,897
GPT-5.3-Codex 4.0 7.2 55.6% 1 64.31s 64 25,308
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 7.0 3.6 66.7% 1 18.25s 304 1,620
GPT-5.3-Codex 4.0 10.0 0.0% 0 4.87s 187 331
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 7.26s 71 1,480
GPT-5.3-Codex 10.0 10.0 100.0% 0 3.04s 93 693
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 9.3 7.9 88.9% 1 11.03s 461 3,532
GPT-5.3-Codex 9.3 7.9 88.9% 1 5.12s 352 1,644
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 12.38s 222 1,011
GPT-5.3-Codex 10.0 10.0 100.0% 0 6.37s 254 492

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho