#15 Grok 4.20 Beta
medium- Cost
- $0.034
- Time
- 91.0s
- Tokens
- 13,523 tok
Muhtasari
Grok 4.20 Beta hupata alama 8.2 kwenye AI BENCHY na iko nafasi ya #15. Ina reliability Haipo, pass rate 79.6%, gharama jumla $0.633, na wastani wa response time 9.81s.
Modeli iliyohifadhiwa: modeli hii haitasasishwa tena wala kujaribiwa kwenye majaribio mapya.
Kidokezo cha utambulisho
Grok 4.20 Beta ilikuwa toleo la awali la xAI: Grok 4.20.
8.2
Uthabiti
9.1
Haipo
Jumla ya tokeni za matokeo
93,477
Jumla ya tokeni za ingizo
0
Bei ya ingizo
$0.000 / 1M
Bei ya toleo
$0.000 / 1M
Majaribio yasiyo thabiti
2
Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya run (angalau kupita moja na kufeli moja).
Generation showcase
Prompt: Create a detailed SVG illustration of a hamster playing table tennis.
Historia ya uendeshaji
| Imepimwa tarehe | Alama | Uaminifu | Majaribio sahihi | Jumla ya gharama | Linganisha |
|---|---|---|---|---|---|
| 2026-05-06 14:15 Jaribio tena | 8.5 | Haipo | $0.750 ↑ | Linganisha | |
| 2026-05-06 14:15 Jaribio tena | 8.2 | Haipo | $0.633 | Linganisha | |
| 2026-05-06 14:15 Jaribio tena | 8.2 | Haipo | $0.633 | Uendeshaji wa sasa | |
| 2026-05-06 14:15 Suite imebadilika | 8.2 | Haipo | $0.633 | Linganisha | |
| 2026-04-11 01:19 Jaribio la kwanza lililorekodiwa | 8.0 | Haipo | $0.633 | Linganisha |
Ulinganisho wa uendeshaji
| Uendeshaji | Alama | Uthabiti | Uaminifu | Majaribio sahihi | Majaribio yasiyo thabiti | Jumla ya tokeni za matokeo | Jumla ya tokeni za ingizo | Jumla ya gharama | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|---|---|---|
| 2026-05-06 14:15 · Jaribio tena | 8.2 | 9.1 | Haipo | 13/18 | 2 | 93,477 | 0 | $0.633 | 9.81s |
| 2026-05-06 14:15 · Jaribio tena | 8.2 | 9.1 | Haipo | 13/18 | 2 | 93,477 | 0 | $0.633 | 9.81s |
| Tofauti | 0.0 | 0.0 | 0 | 0 | 0 | 0 | $0.000 | 0ms |
Chagua modeli ya kwanza, kisha bofya modeli ya pili kufungua ukurasa wa kulinganisha bega kwa bega.
| Kategoria | Alama | Uthabiti | Majaribio sahihi |
|---|---|---|---|
| Mbinu za kupinga AI | 8.7 | 7.9 | |
| Uandishi wa msimbo | 10.0 | 10.0 | |
| Mchanganyiko | 10.0 | 10.0 | |
| Uchanganuzi na uchimbaji wa data | 10.0 | 10.0 | |
| Mahususi kwa domeni | 5.3 | 10.0 | |
| Akili ya jumla | 10.0 | 10.0 | |
| Ufuataji wa maagizo | 9.8 | 10.0 | |
| Utatuzi wa mafumbo | 8.2 | 7.2 | |
| Mwito wa zana | 3.0 | 10.0 |