Clasament modele pentru Inteligență generală

Vezi ce modele AI se descurcă cel mai bine la Inteligență generală, care rămân fiabile și unde apar cele mai mari diferențe. Sortează după: Metrică ↑.

Modele afișate

Media pentru Scor Inteligență generală

6.1

Cel mai bun model

Qwen3.5-35B-A3B 2.8

Motive de eșec

Cu motivul de eșec Nu a urmat instrucțiunile78 Cu motivul de eșec Răspuns greșit59 Cu motivul de eșec Eroare API12 Cu motivul de eșec Timp expirat4

210/210

Rang	Model	Companie	Scor Inteligență generală	Scor	Cost total	Teste corecte	Timp de răspuns (mediu)
#119	Qwen3.5-35B-A3B medium	Qwen	2.8	6.2	$0.837	0/1	30.3s
Total teste 1 Teste greșite 1 Cost total $0.837 Timp de răspuns (mediu) 30.3s
#204	Qwen3.5-9B medium	Qwen	2.8	3.8	$0.036	0/1	226.4s
Total teste 1 Teste greșite 1 Cost total $0.036 Timp de răspuns (mediu) 226.4s
#135	Hy3 preview high	Tencent	3.0	5.9	$0.048	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.048 Timp de răspuns (mediu) 0ms
#153	Hy3 preview low	Tencent	3.0	5.5	$0.015	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.015 Timp de răspuns (mediu) 0ms
#175	Qwen3.6 Plus Preview medium	Qwen	3.0	4.9	$0.000	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.000 Timp de răspuns (mediu) 0ms
#186	Laguna M.1 medium	Poolside	3.0	4.7	$0.033	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.033 Timp de răspuns (mediu) 0ms
#192	Laguna M.1 none	Poolside	3.0	4.4	$0.009	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.009 Timp de răspuns (mediu) 0ms
#198	Laguna Xs.2 medium	Poolside	3.0	4.1	$0.015	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.015 Timp de răspuns (mediu) 0ms
#205	Laguna Xs.2 none	Poolside	3.0	3.8	$0.004	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.004 Timp de răspuns (mediu) 0ms
#207	Nemotron 3 Nano Omni 30b A3b Reasoning medium	NVIDIA	3.0	3.4	$0.000	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.000 Timp de răspuns (mediu) 0ms
#208	Nemotron 3 Nano Omni 30b A3b Reasoning none	NVIDIA	3.0	3.2	$0.000	0/1	0ms
Total teste 1 Teste greșite 1 Cost total $0.000 Timp de răspuns (mediu) 0ms
#76	DeepSeek V3.2 medium	DeepSeek	3.4	7.0	$0.078	0/1	58.3s
Total teste 1 Teste greșite 1 Cost total $0.078 Timp de răspuns (mediu) 58.3s
#72	Qwen3.5-122B-A10B medium	Qwen	3.4	7.1	$1.046	0/1	34.1s
Total teste 1 Teste greșite 1 Cost total $1.046 Timp de răspuns (mediu) 34.1s
#91	LongCat 2.0 low	Meituan	3.4	6.7	$0.391	0/1	22.5s
Total teste 1 Teste greșite 1 Cost total $0.391 Timp de răspuns (mediu) 22.5s
#67	Step 3.7 Flash low	Stepfun	3.4	7.3	$0.454	0/1	7.00s
Total teste 1 Teste greșite 1 Cost total $0.454 Timp de răspuns (mediu) 7.00s

Clasament Inteligență generală

Filtrează modelele

Top modele după Scor Inteligență generală

Scor Inteligență generală vs cost total

Top modele după Timp de răspuns (mediu)