| Mbinu za kupinga AI | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakuna majibu yaliyoshindwa. Muda wa majibu (wastani) 4687ms Muda wa majibu (upeo) 6680ms Muda wa majibu (jumla) 14061ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 100.0% MoonshotAI: Kimi K2.5 - Bila uchambuzi 0.0% 0.0% 100.0% | 10.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 10.00 Z.ai: GLM 4.7 Flash - Bila uchambuzi 1.00 1.00 10.00 | 10.00 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 10.00 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 4.41 4.41 10.00 | 100.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 100.0% xAI: Grok 4.1 Fast - Bila uchambuzi 0.0% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 0 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 2 0 2 | 6.00 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Nafasi: #15/19 22% Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 10.00 OpenAI: gpt-oss-120b - Uchambuzi (medium) 10.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 9.89 Z.ai: GLM 5 - Uchambuzi (medium) 9.83 StepFun: Step 3.5 Flash - Uchambuzi (medium) 9.83 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 6.00 Qwen: Qwen3 Coder Next - Uchambuzi (medium) 4.00 4.00 10.00 | 4687ms | $0.02371 Jumla ya gharama Nafasi: #24/29 18% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 OpenAI: GPT-4o-mini - Bila uchambuzi $0.00018 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00020 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00024 xAI: Grok 4.1 Fast - Bila uchambuzi $0.00049 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) $0.02371 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) $0.05049 $0.00000 $0.05049 |
| Uchanganuzi na uchimbaji wa data | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakuna majibu yaliyoshindwa. Muda wa majibu (wastani) 3180ms Muda wa majibu (upeo) 3585ms Muda wa majibu (jumla) 6360ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 100.0% Z.ai: GLM 4.7 Flash - Bila uchambuzi 0.0% 0.0% 100.0% | 10.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 10.00 Z.ai: GLM 4.7 Flash - Bila uchambuzi 0.50 0.50 10.00 | 10.00 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 10.00 Z.ai: GLM 5 - Uchambuzi (medium) 5.56 5.56 10.00 | 100.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 100.0% Xiaomi: MiMo-V2-Flash - Bila uchambuzi 16.7% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 0 Z.ai: GLM 5 - Uchambuzi (medium) 1 0 1 | 1.25 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Nafasi: #19/19 0% OpenAI: gpt-oss-120b - Uchambuzi (medium) 10.00 Z.ai: GLM 4.7 Flash - Uchambuzi (medium) 9.87 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 9.83 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 9.83 Z.ai: GLM 5 - Uchambuzi (medium) 9.80 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1.25 1.25 10.00 | 3180ms | $0.02600 Jumla ya gharama Nafasi: #23/29 21% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) $0.00029 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00029 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00050 OpenAI: gpt-oss-120b - Uchambuzi (medium) $0.00052 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) $0.02600 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) $0.07755 $0.00000 $0.07755 |
| Mahususi kwa domeni | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Jibu lisilo sahihi: 2 Muda wa majibu (wastani) 64314ms Muda wa majibu (upeo) 100927ms Muda wa majibu (jumla) 192942ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #9/29 71% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 66.7% Google: Gemini 3 Flash Preview - Bila uchambuzi 66.7% Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 66.7% Z.ai: GLM 4.7 Flash - Bila uchambuzi 66.7% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 33.3% Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 0.0% 0.0% 100.0% | 4.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #9/29 71% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 7.00 Google: Gemini 3 Flash Preview - Bila uchambuzi 7.00 Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 7.00 Z.ai: GLM 4.7 Flash - Bila uchambuzi 7.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 4.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 1.00 1.00 10.00 | 7.21 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #15/29 50% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Bila uchambuzi 10.00 Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 7.21 Google: Gemini 3 Flash Preview - Uchambuzi (low) 4.41 4.41 10.00 | 55.6% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 66.7% Google: Gemini 3 Flash Preview - Bila uchambuzi 66.7% Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 66.7% Z.ai: GLM 4.7 Flash - Bila uchambuzi 66.7% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 55.6% Z.ai: GLM 5 - Bila uchambuzi 0.0% 0.0% 100.0% | 1 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #15/29 50% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Bila uchambuzi 0 Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 0 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1 Google: Gemini 3 Flash Preview - Uchambuzi (low) 2 0 2 | 1.00 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Nafasi: #19/19 0% Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) 8.72 OpenAI: gpt-oss-120b - Uchambuzi (medium) 8.53 StepFun: Step 3.5 Flash - Uchambuzi (medium) 8.44 Z.ai: GLM 5 - Uchambuzi (medium) 8.43 Z.ai: GLM 4.7 Flash - Uchambuzi (medium) 8.21 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1.00 1.00 8.72 | 64314ms | $0.35664 Jumla ya gharama Nafasi: #27/29 7% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00005 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00008 Qwen: Qwen3 Coder Next - Bila uchambuzi $0.00010 Qwen: Qwen3 Coder Next - Uchambuzi (medium) $0.00010 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) $0.35664 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) $0.64205 $0.00000 $0.64205 |
| Ufuataji wa maagizo | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakufuata maelekezo: 1 Muda wa majibu (wastani) 3037ms Muda wa majibu (upeo) 3436ms Muda wa majibu (jumla) 6074ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #15/29 50% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% OpenAI: GPT-5.2 - Uchambuzi (medium) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 50.0% xAI: Grok 4.1 Fast - Bila uchambuzi 0.0% 0.0% 100.0% | 9.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #14/29 54% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 10.00 Z.ai: GLM 5 - Bila uchambuzi 10.00 OpenAI: gpt-oss-120b - Uchambuzi (medium) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 9.00 xAI: Grok 4.1 Fast - Bila uchambuzi 1.00 1.00 10.00 | 10.00 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #4/29 89% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 10.00 OpenAI: GPT-5.2 - Uchambuzi (medium) 10.00 Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) 5.80 5.80 10.00 | 50.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #20/29 32% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% OpenAI: GPT-5.2 - Uchambuzi (medium) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 50.0% xAI: Grok 4.1 Fast - Bila uchambuzi 0.0% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #6/29 82% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Bila uchambuzi 1 0 1 | 1.00 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Nafasi: #19/19 0% Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 10.00 Z.ai: GLM 5 - Uchambuzi (medium) 9.75 StepFun: Step 3.5 Flash - Uchambuzi (medium) 9.67 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 9.50 OpenAI: gpt-oss-120b - Uchambuzi (medium) 9.50 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1.00 1.00 10.00 | 3037ms | $0.01216 Jumla ya gharama Nafasi: #23/29 21% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00006 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00008 Qwen: Qwen3 Coder Next - Bila uchambuzi $0.00013 Qwen: Qwen3 Coder Next - Uchambuzi (medium) $0.00014 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) $0.01216 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) $0.03134 $0.00000 $0.03134 |
| Puzzle Solving | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakufuata maelekezo: 1 Muda wa majibu (wastani) 4610ms Muda wa majibu (upeo) 7191ms Muda wa majibu (jumla) 13830ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #8/29 75% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 66.7% StepFun: Step 3.5 Flash - Uchambuzi (medium) 0.0% 0.0% 100.0% | 7.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #9/29 71% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 7.00 Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) 1.00 1.00 10.00 | 7.38 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #20/29 32% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 7.38 MiniMax: MiniMax M2.5 - Uchambuzi (medium) 4.79 4.79 10.00 | 77.8% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #8/29 75% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 77.8% OpenAI: GPT-4o-mini - Bila uchambuzi 0.0% 0.0% 100.0% | 1 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #18/29 39% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1 OpenAI: GPT-5 Nano - Uchambuzi (medium) 2 0 2 | 6.00 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Nafasi: #18/19 6% Z.ai: GLM 5 - Uchambuzi (medium) 9.50 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 9.44 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 9.44 MoonshotAI: Kimi K2.5 - Uchambuzi (medium) 9.26 StepFun: Step 3.5 Flash - Uchambuzi (medium) 9.22 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 6.00 Qwen: Qwen3 Coder Next - Uchambuzi (medium) 4.33 4.33 9.50 | 4610ms | $0.02559 Jumla ya gharama Nafasi: #25/29 14% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00008 OpenAI: GPT-4o-mini - Bila uchambuzi $0.00028 xAI: Grok 4.1 Fast - Bila uchambuzi $0.00053 Qwen: Qwen3 Coder Next - Uchambuzi (medium) $0.00058 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) $0.02559 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) $0.05508 $0.00000 $0.05508 |