Urambazaji
AI BENCHY
Your ad here

AI BENCHY Compare

ByteDance Seed: Seed-2.0-Lite vs Qwen: Qwen3.5-Flash

Benchmark zimetengenezwa kutoka seti za majaribio za AI BENCHY tarehe: 2026-03-17

Kipimo Seed-2.0-Lite Seed-2.0-Lite medium Toleo: 2026-02-14 Qwen3.5-Flash Qwen3.5-Flash medium Toleo: 2026-02-24
Nafasi #5 #19
Alama 8.5 8.0
Uthabiti 8.8 7.6
Gharama kwa matokeo 0.873 0.688
Jumla ya gharama $0.105 $0.076
Majaribio sahihi
Kiwango cha kupita kwa kila jaribio 82.4% 82.4%
Majaribio yasiyo thabiti 3 5
Jumla ya uendeshaji 51 51
Tokeni za matokeo 2,821 1,827
Tokeni za hoja 44,723 179,299
Muda wa majibu (wastani) 27.78s 67.96s
Muda wa majibu (upeo) 168.71s 234.29s
Muda wa majibu (jumla) 472.24s 1155.28s

Modeli bora kwa alama

Alama dhidi ya gharama ya jumla

Muda wa majibu (wastani)

Alama vs Muda wa majibu (wastani)

Jumla ya tokeni za matokeo

Alama vs Jumla ya tokeni za matokeo

Mgawanyo wa kategoria

Mbinu za kupinga AI Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 8.3 10.0 75.0% 0 17.99s 996 7,142
Qwen3.5-Flash 10.0 10.0 100.0% 0 59.11s 383 32,992
Mchanganyiko Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 37.67s 506 4,299
Qwen3.5-Flash 10.0 10.0 100.0% 0 17.78s 483 8,270
Uchanganuzi na uchimbaji wa data Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 9.07s 246 1,742
Qwen3.5-Flash 7.3 5.9 83.3% 1 56.99s 235 16,237
Mahususi kwa domeni Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 5.9 7.2 55.6% 1 88.74s 15 23,897
Qwen3.5-Flash 5.3 7.2 44.4% 1 146.50s 58 43,615
Akili ya jumla Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 6.7 3.6 66.7% 1 18.25s 304 1,620
Qwen3.5-Flash 6.1 3.1 66.7% 1 40.05s 99 38,486
Ufuataji wa maagizo Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 7.26s 71 1,480
Qwen3.5-Flash 10.0 10.0 100.0% 0 63.49s 98 14,139
Puzzle Solving Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 9.0 7.9 88.9% 1 11.03s 461 3,532
Qwen3.5-Flash 6.4 4.4 77.8% 2 56.74s 162 24,276
Mwito wa zana Alama Uthabiti Kiwango cha kupita kwa kila jaribio Majaribio yasiyo thabiti Majaribio sahihi Muda wa majibu (wastani) Tokeni za matokeo Tokeni za hoja
Seed-2.0-Lite 10.0 10.0 100.0% 0 12.38s 222 1,011
Qwen3.5-Flash 10.0 10.0 100.0% 0 10.33s 309 1,284

Ulinganisho wa haraka

Badilisha jozi ya ulinganisho