#29 Grok 4.20 Beta
medium- Cost
- $0.034
- Time
- 91.0s
- Tokens
- 13,523 tok
Muhtasari
Grok 4.20 Beta hupata alama 8.0 kwenye AI BENCHY na iko nafasi ya #29. Ina reliability Haipo, pass rate 74.1%, gharama jumla $0.633, na wastani wa response time 9.81s.
Kinachofanya Grok 4.20 Beta iwe ya kipekee: Inaonekana zaidi kwenye Uandishi wa msimbo, ambako iko #1; huku Ufuataji wa maagizo ikiwa eneo lake dhaifu zaidi kwenye #15.
Modeli iliyohifadhiwa: modeli hii haitasasishwa tena wala kujaribiwa kwenye majaribio mapya.
Kidokezo cha utambulisho
Grok 4.20 Beta ilikuwa toleo la awali la xAI: Grok 4.20.
8.0
Uthabiti
9.1
Haipo
Jumla ya tokeni za matokeo
93,477
Jumla ya tokeni za ingizo
0
Bei ya ingizo
$0.000 / 1M
Bei ya toleo
$0.000 / 1M
Majaribio yasiyo thabiti
2
Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya run (angalau kupita moja na kufeli moja).
Generation showcase
Prompt: Create a detailed SVG illustration of a hamster playing table tennis.
Historia ya uendeshaji
| Imepimwa tarehe | Alama | Uaminifu | Majaribio sahihi | Jumla ya gharama | Linganisha |
|---|---|---|---|---|---|
| 2026-05-06 14:15 Jaribio tena | 8.5 | Haipo | $0.750 ↑ | Linganisha | |
| 2026-05-06 14:15 Jaribio tena | 8.2 | Haipo | $0.633 | Linganisha | |
| 2026-05-06 14:15 Jaribio tena | 8.2 | Haipo | $0.633 | Linganisha | |
| 2026-05-06 14:15 Suite imebadilika | 8.2 | Haipo | $0.633 | Linganisha | |
| 2026-04-11 01:19 Jaribio la kwanza lililorekodiwa | 8.0 | Haipo | $0.633 | Uendeshaji wa sasa |
Chagua modeli ya kwanza, kisha bofya modeli ya pili kufungua ukurasa wa kulinganisha bega kwa bega.
| Kategoria | Alama | Uthabiti | Majaribio sahihi |
|---|---|---|---|
| Mbinu za kupinga AI | 8.7 | 7.9 | |
| Uandishi wa msimbo | 10.0 | 10.0 | |
| Mchanganyiko | 10.0 | 10.0 | |
| Uchanganuzi na uchimbaji wa data | 10.0 | 10.0 | |
| Mahususi kwa domeni | 5.3 | 10.0 | |
| Akili ya jumla | 10.0 | 10.0 | |
| Ufuataji wa maagizo | 8.3 | 10.0 | |
| Utatuzi wa mafumbo | 8.2 | 7.2 | |
| Mwito wa zana | 3.0 | 10.0 |