#47
X AI ยท Toleo: 2026-03-12 ยท x-ai/grok-4.20-multi-agent-beta::medium
Majaribio yasiyo thabiti
6
Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya run (angalau kupita moja na kufeli moja).
Chati
Chagua modeli ya kwanza, kisha bofya modeli ya pili kufungua ukurasa wa kulinganisha bega kwa bega.
Wastani wa alama vs Jumla ya gharama
Muda wa majibu (wastani)
Wastani wa alama vs Muda wa majibu (wastani)
Jumla ya tokeni za matokeo
Wastani wa alama vs Jumla ya tokeni za matokeo
Ulinganisho wa haraka
Grok 4.20 Multi-Agent BetamediumvsSeed-2.0-LitenoneGrok 4.20 Multi-Agent BetamediumvsQwen3.5-122B-A10BnoneGrok 4.20 Multi-Agent BetamediumvsQwen3.5-35B-A3BnoneGrok 4.20 Multi-Agent Betamediumvsgpt-oss-120bmediumInapatikana bureGrok 4.20 Multi-Agent BetamediumvsMiniMax M2.5mediumGrok 4.20 Multi-Agent BetamediumvsGemini 3 Flash PreviewmediumGrok 4.20 Multi-Agent BetamediumvsGemini 3.1 Pro PreviewmediumGrok 4.20 Multi-Agent BetamediumvsStep 3.5 FlashmediumInapatikana bure
Mgawanyo wa kategoria
| Kategoria | Wastani wa alama | Uthabiti | Majaribio sahihi |
|---|---|---|---|
| Mbinu za kupinga AI | 4.0 | 4.4 | |
| Mchanganyiko | 10.0 | 10.0 | |
| Uchanganuzi na uchimbaji wa data | 9.9 | 10.0 | |
| Mahususi kwa domeni | 10.0 | 7.2 | |
| Akili ya jumla | 4.0 | 2.8 | |
| Ufuataji wa maagizo | 9.0 | 10.0 | |
| Utatuzi wa mafumbo | 6.3 | 5.1 | |
| Mwito wa zana | 10.0 | 10.0 |