#51 MoonshotAI: Kimi K2.5
medium- Gharama
- $0.030
- Muda
- 58.6s
- Tokeni
- 8,683 tok
Muhtasari
Kimi K2.5 hupata alama 7.0 kwenye AI BENCHY na iko nafasi ya #51. Ina reliability Haipo, pass rate 72.2%, gharama jumla $0.220, na wastani wa response time 72.43s.
Kinachofanya Kimi K2.5 iwe ya kipekee: Inaonekana zaidi kwenye Akili ya jumla, ambako iko #4; huku Uandishi wa msimbo ikiwa eneo lake dhaifu zaidi kwenye #15.
7.0
Uthabiti
6.8
Haipo
Jumla ya tokeni za matokeo
127,046
Jumla ya tokeni za ingizo
0
Bei ya ingizo
$0.440 / 1M
Bei ya toleo
$2.000 / 1M
Majaribio yasiyo thabiti
7
Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya run (angalau kupita moja na kufeli moja).
Onyesho la kizazi
Prompt: Create a detailed SVG illustration of a hamster playing table tennis.
Historia ya uendeshaji
| Imepimwa tarehe | Alama | Uaminifu | Majaribio sahihi | Jumla ya gharama | Linganisha |
|---|---|---|---|---|---|
| 2026-06-04 13:43 Jaribio jipya limeongezwa | 6.8 | 10.0 | $0.328 ↓ | Linganisha | |
| 2026-05-22 00:12 Suite imebadilika | 6.7 | 10.0 | $0.314 | Linganisha | |
| 2026-04-20 17:48 Jaribio la kwanza lililorekodiwa | 7.0 | Haipo | $0.220 | Uendeshaji wa sasa |
Chagua modeli ya kwanza, kisha bofya modeli ya pili kufungua ukurasa wa kulinganisha bega kwa bega.
| Kategoria | Alama | Uthabiti | Majaribio sahihi |
|---|---|---|---|
| Mbinu za kupinga AI | 7.3 | 5.8 | |
| Uandishi wa msimbo | 4.7 | 1.6 | |
| Mchanganyiko | 10.0 | 10.0 | |
| Uchanganuzi na uchimbaji wa data | 10.0 | 10.0 | |
| Mahususi kwa domeni | 3.5 | 4.4 | |
| Akili ya jumla | 6.5 | 3.4 | |
| Ufuataji wa maagizo | 10.0 | 10.0 | |
| Utatuzi wa mafumbo | 5.3 | 7.3 | |
| Mwito wa zana | 10.0 | 10.0 |