Urambazaji
AI BENCHY
Advertise here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs Owl Alpha

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-05-22

Kipimo Seed-2.0-Lite Seed-2.0-Lite none Toleo: 2026-02-14 Owl Alpha Owl Alpha none Toleo: 2026-04-30
Alama 5.9 5.7
Nafasi #99 #106
Uaminifu 10.0 10.0
Uthabiti 7.9 9.2
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 50.0% 41.7%
Majaribio yasiyo thabiti 5 2
Jumla ya uendeshaji 60 60
Gharama kwa matokeo 0.216 0.000
Jumla ya gharama $0.018 $0.000
Bei ya ingizo $0.250 / 1M $0.000 / 1M
Bei ya toleo $2.000 / 1M $0.000 / 1M
Tokeni za matokeo 3,164 4,864
Tokeni za hoja 0 0
Muda wa majibu (wastani) 2.44s 8.84s
Muda wa majibu (upeo) 6.70s 47.10s
Muda wa majibu (jumla) 48.71s 176.83s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 3.0 5.9 16.7% 2 2.43s 709 0
Owl Alpha 3.4 7.9 16.7% 1 2.78s 57 0
Uandishi wa msimbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 6.8 9.9 50.0% 0 2.95s 404 0
Owl Alpha 7.0 9.9 50.0% 0 39.68s 3,629 0
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 3.0 10.0 0.0% 0 6.59s 498 0
Owl Alpha 3.0 10.0 0.0% 0 21.74s 315 0
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 1.82s 246 0
Owl Alpha 10.0 10.0 100.0% 0 3.60s 246 0
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 3.6 7.2 22.2% 1 1.33s 17 0
Owl Alpha 5.3 10.0 33.3% 0 3.00s 27 0
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.45s 294 0
Owl Alpha 4.3 10.0 0.0% 0 4.61s 80 0
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 1.06s 73 0
Owl Alpha 6.4 10.0 50.0% 0 2.63s 63 0
Utatuzi wa mafumbo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 5.2 4.4 55.6% 2 2.46s 620 0
Owl Alpha 5.9 7.2 55.6% 1 4.43s 202 0
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 3.94s 292 0
Owl Alpha 10.0 10.0 100.0% 0 22.78s 231 0
Maarifa ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 3.0 10.0 0.0% 0 1.96s 11 0
Owl Alpha 3.0 10.0 0.0% 0 2.50s 14 0

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho