#46 xAI: Grok Build 0.1
medium- Gharama
- $0.028
- Muda
- 81.3s
- Tokeni
- 14,009 tok
Muhtasari
Grok Build 0.1 hupata alama 7.6 kwenye AI BENCHY na iko nafasi ya #46. Ina reliability 10.0, pass rate 70.0%, gharama jumla $0.633, na wastani wa response time 26.36s.
Kinachofanya Grok Build 0.1 iwe ya kipekee: Inaonekana zaidi kwenye Mahususi kwa domeni, ambako iko #3; huku Uandishi wa msimbo ikiwa eneo lake dhaifu zaidi kwenye #16. Inatumia reasoning tokens nyingi isivyo kawaida, jambo linaloweza kueleza runs polepole au ghali zaidi.
7.6
Uthabiti
8.5
10.0
Jumla ya tokeni za matokeo
295,603
Jumla ya tokeni za ingizo
0
Bei ya ingizo
$1.000 / 1M
Bei ya toleo
$2.000 / 1M
Majaribio yasiyo thabiti
4
Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya run (angalau kupita moja na kufeli moja).
Onyesho la kizazi
Prompt: Create a detailed SVG illustration of a hamster playing table tennis.
Historia ya uendeshaji
| Imepimwa tarehe | Alama | Uaminifu | Majaribio sahihi | Jumla ya gharama | Linganisha |
|---|---|---|---|---|---|
| 2026-06-04 14:22 Jaribio jipya limeongezwa | 7.4 | 10.0 | $0.927 | Linganisha | |
| 2026-05-26 13:30 Jaribio tena | 7.7 | 10.0 | $0.729 | Linganisha | |
| 2026-05-22 00:36 Suite imebadilika | 7.6 | 10.0 | $0.633 | Uendeshaji wa sasa |
Uendeshaji huu ulitumia suite tofauti ya benchmark. Zingatia mabadiliko ya suite unapochambua mwenendo wa kihistoria.
Ulinganisho wa uendeshaji
| Uendeshaji | Alama | Uthabiti | Uaminifu | Majaribio sahihi | Majaribio yasiyo thabiti | Jumla ya tokeni za matokeo | Jumla ya tokeni za ingizo | Jumla ya gharama | Muda wa majibu (wastani) |
|---|---|---|---|---|---|---|---|---|---|
| 2026-05-22 00:36 · Suite imebadilika | 7.6 | 8.5 | 10.0 | 12/20 | 4 | 295,603 | 0 | $0.633 | 26.36s |
| 2026-05-26 13:30 · Jaribio tena | 7.7 | 9.9 | 10.0 | 13/20 | 0 | 343,639 | 0 | $0.729 | 42.39s |
| Tofauti | -0.1 | -1.4 | 0.0 | -1 | +4 | -48036 | 0 | -$0.097 | -16029ms |
Chagua modeli ya kwanza, kisha bofya modeli ya pili kufungua ukurasa wa kulinganisha bega kwa bega.
| Kategoria | Alama | Uthabiti | Majaribio sahihi |
|---|---|---|---|
| Mbinu za kupinga AI | 10.0 | 10.0 | |
| Uandishi wa msimbo | 5.3 | 2.9 | |
| Mchanganyiko | 10.0 | 10.0 | |
| Uchanganuzi na uchimbaji wa data | 10.0 | 10.0 | |
| Mahususi kwa domeni | 5.3 | 10.0 | |
| Akili ya jumla | 3.8 | 2.5 | |
| Ufuataji wa maagizo | 9.8 | 10.0 | |
| Utatuzi wa mafumbo | 6.2 | 7.5 | |
| Mwito wa zana | 10.0 | 10.0 | |
| Maarifa ya jumla | 3.0 | 10.0 |