| Mbinu za kupinga AI | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakuna majibu yaliyoshindwa. Muda wa majibu (wastani) 3496ms Muda wa majibu (upeo) 4305ms Muda wa majibu (jumla) 10487ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% MoonshotAI: Kimi K2.5 - Bila uchambuzi 0.0% 0.0% 100.0% | 10.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 Z.ai: GLM 4.7 Flash - Bila uchambuzi 1.00 1.00 10.00 | 10.00 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 4.41 4.41 10.00 | 100.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% xAI: Grok 4.1 Fast - Bila uchambuzi 0.0% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 2 0 2 | 6.23 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Kumbuka: kwa baadhi ya modeli za Gemini, maandishi ya hoja hupatikana kwa sehemu tu, hivyo alama ya hoja inaweza kuonekana kuwa ya chini. Nafasi: #14/19 28% Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 10.00 OpenAI: gpt-oss-120b - Uchambuzi (medium) 10.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 9.89 Z.ai: GLM 5 - Uchambuzi (medium) 9.83 StepFun: Step 3.5 Flash - Uchambuzi (medium) 9.83 Google: Gemini 3 Flash Preview - Uchambuzi (low) 6.23 Qwen: Qwen3 Coder Next - Uchambuzi (medium) 4.00 4.00 10.00 | 3496ms | $0.00844 Jumla ya gharama Nafasi: #16/29 46% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 OpenAI: GPT-4o-mini - Bila uchambuzi $0.00018 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00020 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00024 xAI: Grok 4.1 Fast - Bila uchambuzi $0.00049 Google: Gemini 3 Flash Preview - Uchambuzi (low) $0.00844 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) $0.05049 $0.00000 $0.05049 |
| Uchanganuzi na uchimbaji wa data | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakuna majibu yaliyoshindwa. Muda wa majibu (wastani) 9460ms Muda wa majibu (upeo) 14717ms Muda wa majibu (jumla) 18919ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% Z.ai: GLM 4.7 Flash - Bila uchambuzi 0.0% 0.0% 100.0% | 10.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 Z.ai: GLM 4.7 Flash - Bila uchambuzi 0.50 0.50 10.00 | 10.00 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 Z.ai: GLM 5 - Uchambuzi (medium) 5.56 5.56 10.00 | 100.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% Xiaomi: MiMo-V2-Flash - Bila uchambuzi 16.7% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 Z.ai: GLM 5 - Uchambuzi (medium) 1 0 1 | 4.73 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Kumbuka: kwa baadhi ya modeli za Gemini, maandishi ya hoja hupatikana kwa sehemu tu, hivyo alama ya hoja inaweza kuonekana kuwa ya chini. Nafasi: #17/19 11% OpenAI: gpt-oss-120b - Uchambuzi (medium) 10.00 Z.ai: GLM 4.7 Flash - Uchambuzi (medium) 9.87 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 9.83 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 9.83 Z.ai: GLM 5 - Uchambuzi (medium) 9.80 Google: Gemini 3 Flash Preview - Uchambuzi (low) 4.73 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1.25 1.25 10.00 | 9460ms | $0.01354 Jumla ya gharama Nafasi: #18/29 39% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) $0.00029 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00029 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00050 OpenAI: gpt-oss-120b - Uchambuzi (medium) $0.00052 Google: Gemini 3 Flash Preview - Uchambuzi (low) $0.01354 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) $0.07755 $0.00000 $0.07755 |
| Mahususi kwa domeni | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Jibu lisilo sahihi: 2 Muda wa majibu (wastani) 8314ms Muda wa majibu (upeo) 14399ms Muda wa majibu (jumla) 24941ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #8/29 75% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 66.7% Google: Gemini 3 Flash Preview - Bila uchambuzi 66.7% Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 66.7% Z.ai: GLM 4.7 Flash - Bila uchambuzi 66.7% Google: Gemini 3 Flash Preview - Uchambuzi (low) 33.3% Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 0.0% 0.0% 100.0% | 4.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #8/29 75% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 7.00 Google: Gemini 3 Flash Preview - Bila uchambuzi 7.00 Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 7.00 Z.ai: GLM 4.7 Flash - Bila uchambuzi 7.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 4.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 1.00 1.00 10.00 | 4.41 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #23/29 21% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Bila uchambuzi 10.00 Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 4.41 4.41 10.00 | 55.5% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #11/29 64% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 66.7% Google: Gemini 3 Flash Preview - Bila uchambuzi 66.7% Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 66.7% Z.ai: GLM 4.7 Flash - Bila uchambuzi 66.7% Google: Gemini 3 Flash Preview - Uchambuzi (low) 55.5% Z.ai: GLM 5 - Bila uchambuzi 0.0% 0.0% 100.0% | 2 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #23/29 21% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Bila uchambuzi 0 Anthropic: Claude Sonnet 4.6 - Bila uchambuzi 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 2 0 2 | 1.83 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Kumbuka: kwa baadhi ya modeli za Gemini, maandishi ya hoja hupatikana kwa sehemu tu, hivyo alama ya hoja inaweza kuonekana kuwa ya chini. Nafasi: #18/19 6% Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) 8.72 OpenAI: gpt-oss-120b - Uchambuzi (medium) 8.53 StepFun: Step 3.5 Flash - Uchambuzi (medium) 8.44 Z.ai: GLM 5 - Uchambuzi (medium) 8.43 Z.ai: GLM 4.7 Flash - Uchambuzi (medium) 8.21 Google: Gemini 3 Flash Preview - Uchambuzi (low) 1.83 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1.00 1.00 8.72 | 8314ms | $0.01993 Jumla ya gharama Nafasi: #18/29 39% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00005 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00008 Qwen: Qwen3 Coder Next - Bila uchambuzi $0.00010 Qwen: Qwen3 Coder Next - Uchambuzi (medium) $0.00010 Google: Gemini 3 Flash Preview - Uchambuzi (low) $0.01993 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) $0.64205 $0.00000 $0.64205 |
| Ufuataji wa maagizo | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakufuata maelekezo: 1 Muda wa majibu (wastani) 7016ms Muda wa majibu (upeo) 7350ms Muda wa majibu (jumla) 14031ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #14/29 54% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% OpenAI: GPT-5.2 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 50.0% xAI: Grok 4.1 Fast - Bila uchambuzi 0.0% 0.0% 100.0% | 7.50 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #15/29 50% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 10.00 Z.ai: GLM 5 - Bila uchambuzi 10.00 OpenAI: gpt-oss-120b - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 7.50 xAI: Grok 4.1 Fast - Bila uchambuzi 1.00 1.00 10.00 | 9.99 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #17/29 43% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 10.00 OpenAI: GPT-5.2 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 9.99 Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) 5.80 5.80 10.00 | 50.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #19/29 36% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% OpenAI: GPT-5.2 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 50.0% xAI: Grok 4.1 Fast - Bila uchambuzi 0.0% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 Google: Gemini 3 Flash Preview - Bila uchambuzi 1 0 1 | 5.00 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Kumbuka: kwa baadhi ya modeli za Gemini, maandishi ya hoja hupatikana kwa sehemu tu, hivyo alama ya hoja inaweza kuonekana kuwa ya chini. Nafasi: #17/19 11% Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 10.00 Z.ai: GLM 5 - Uchambuzi (medium) 9.75 StepFun: Step 3.5 Flash - Uchambuzi (medium) 9.67 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 9.50 OpenAI: gpt-oss-120b - Uchambuzi (medium) 9.50 Google: Gemini 3 Flash Preview - Uchambuzi (low) 5.00 OpenAI: GPT-5.3-Codex - Uchambuzi (medium) 1.00 1.00 10.00 | 7016ms | $0.00878 Jumla ya gharama Nafasi: #20/29 32% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00006 Xiaomi: MiMo-V2-Flash - Bila uchambuzi $0.00008 Qwen: Qwen3 Coder Next - Bila uchambuzi $0.00013 Qwen: Qwen3 Coder Next - Uchambuzi (medium) $0.00014 Google: Gemini 3 Flash Preview - Uchambuzi (low) $0.00878 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) $0.03134 $0.00000 $0.03134 |
| Puzzle Solving | Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Hakuna majibu yaliyoshindwa. Muda wa majibu (wastani) 6440ms Muda wa majibu (upeo) 10274ms Muda wa majibu (jumla) 19319ms Jaribio huhesabiwa kuwa limepita kikamilifu tu ikiwa marudio yake yote yamepita. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% StepFun: Step 3.5 Flash - Uchambuzi (medium) 0.0% 0.0% 100.0% | 10.00 Wastani wa alama katika majaribio yote ya benchmark. Nafasi: #4/29 89% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 10.00 Xiaomi: MiMo-V2-Flash - Uchambuzi (medium) 1.00 1.00 10.00 | 10.00 Alama ya uthabiti inaonyesha utulivu kati ya marudio (10 = thabiti sana, hata ikiwa ni makosa mfululizo). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 10.00 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 10.00 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 10.00 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 10.00 Google: Gemini 3 Flash Preview - Uchambuzi (low) 10.00 MiniMax: MiniMax M2.5 - Uchambuzi (medium) 4.79 4.79 10.00 | 100.0% Kiwango cha kupita kwa kila jaribio = majaribio yaliyopita / jumla ya majaribio katika marudio yote. Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 100.0% Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 100.0% Google: Gemini 3 Pro Preview - Uchambuzi (medium) 100.0% Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 100.0% Google: Gemini 3 Flash Preview - Uchambuzi (low) 100.0% OpenAI: GPT-4o-mini - Bila uchambuzi 0.0% 0.0% 100.0% | 0 Majaribio yasiyo thabiti yalikuwa na matokeo mchanganyiko kati ya marudio (angalau kupita moja na kufeli moja). Nafasi: #5/29 86% Google: Gemini 3 Flash Preview - Uchambuzi (medium) 0 Google: Gemini 3.1 Pro Preview - Uchambuzi (medium) 0 Google: Gemini 3 Pro Preview - Uchambuzi (medium) 0 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) 0 Google: Gemini 3 Flash Preview - Uchambuzi (low) 0 OpenAI: GPT-5 Nano - Uchambuzi (medium) 2 0 2 | 7.50 Hupima uwazi, ufanisi na uthabiti wa hoja bila kutegemea usahihi wa jibu la mwisho. Kumbuka: kwa baadhi ya modeli za Gemini, maandishi ya hoja hupatikana kwa sehemu tu, hivyo alama ya hoja inaweza kuonekana kuwa ya chini. Nafasi: #13/19 33% Z.ai: GLM 5 - Uchambuzi (medium) 9.50 Anthropic: Claude Sonnet 4.6 - Uchambuzi (medium) 9.44 Anthropic: Claude Opus 4.6 - Uchambuzi (medium) 9.44 MoonshotAI: Kimi K2.5 - Uchambuzi (medium) 9.26 StepFun: Step 3.5 Flash - Uchambuzi (medium) 9.22 Google: Gemini 3 Flash Preview - Uchambuzi (low) 7.50 Qwen: Qwen3 Coder Next - Uchambuzi (medium) 4.33 4.33 9.50 | 6440ms | $0.01105 Jumla ya gharama Nafasi: #17/29 43% StepFun: Step 3.5 Flash - Uchambuzi (medium) $0.00000 Z.ai: GLM 4.7 Flash - Bila uchambuzi $0.00008 OpenAI: GPT-4o-mini - Bila uchambuzi $0.00028 xAI: Grok 4.1 Fast - Bila uchambuzi $0.00053 Qwen: Qwen3 Coder Next - Uchambuzi (medium) $0.00058 Google: Gemini 3 Flash Preview - Uchambuzi (low) $0.01105 Qwen: Qwen3.5 Plus 2026-02-15 - Uchambuzi (medium) $0.05508 $0.00000 $0.05508 |