Cobuddy (medium) vs Granite 4.1 8B

Recommended model Cobuddy (medium)

It has the strongest score in this comparison (4.7) and the best overall balance of cost and response time across all 2 models.

Detailed comparison

Metric	Cobuddy Cobuddy medium Release: 2026-05-06	Granite 4.1 8B Granite 4.1 8B none Release: 2026-05-01

Metric	Cobuddy Cobuddy medium Release: 2026-05-06	Granite 4.1 8B Granite 4.1 8B none Release: 2026-05-01
Score	4.7	4.0
Rank	#210	#224
Reliability	10.0	10.0
Consistency	7.2	10.0
Tests Correct
Attempt pass rate	45.5%	9.1%
Flaky tests	6	0
Total Runs	63	66
Cost per result	0.000	0.315
Total Cost	$0.000	$0.007
Input Price	$0.000 / 1M	$0.050 / 1M
Output Price	$0.000 / 1M	$0.100 / 1M
Total Input Tokens	37,449	113,827
Output Tokens	1,677	5,996
Reasoning Tokens	116,703	0
Response Time (avg)	39.90s	1.45s
Response Time (max)	309.02s	16.67s
Response Time (total)	797.98s	31.96s

Prompt: Create a detailed SVG illustration of a hamster playing table tennis.

medium

No endpoints found for baidu/cobuddy:free.

none

Category:

Anti-AI Tricks	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	8.7	7.9	91.7%	1		10.00s	453	98	4,666
Granite 4.1 8B	4.9	10.0	25.0%	0		844ms	645	903	0

Coding	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	3.7	6.7	22.2%	1		79.17s	4,726	358	30,138
Granite 4.1 8B	4.5	10.0	0.0%	0		775ms	8,344	525	0

Combined	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	1.5	5.0	0.0%	0		47.38s	18,324	465	7,265
Granite 4.1 8B	3.0	10.0	0.0%	0		9.28s	86,631	3,481	0

Data parsing and extraction	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	6.3	5.8	66.7%	1		17.36s	8,181	275	5,591
Granite 4.1 8B	3.0	10.0	0.0%	0		575ms	7,617	195	0

Domain specific	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	2.9	4.4	22.2%	2		128.15s	540	10	49,454
Granite 4.1 8B	3.0	10.0	0.0%	0		357ms	768	24	0

General Intelligence	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	4.2	9.9	0.0%	0		23.23s	498	76	3,782
Granite 4.1 8B	4.0	10.0	0.0%	0		499ms	528	115	0

Instructions following	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	9.8	10.0	100.0%	0		11.60s	508	64	2,842
Granite 4.1 8B	3.6	9.9	0.0%	0		344ms	687	66	0

Puzzle Solving	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	3.6	7.2	22.2%	1		12.83s	561	189	5,808
Granite 4.1 8B	3.2	10.0	0.0%	0		608ms	672	432	0

Tool Calling	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	10.0	10.0	100.0%	0		11.19s	3,505	133	294
Granite 4.1 8B	10.0	10.0	100.0%	0		2.17s	7,719	243	0

Trivia	Score	Consistency	Attempt pass rate	Flaky tests	Tests Correct	Response Time (avg)	Input Tokens	Output Tokens	Reasoning Tokens
Cobuddy	3.0	10.0	0.0%	0		36.98s	153	9	6,863
Granite 4.1 8B	3.0	10.0	0.0%	0		306ms	216	12	0

Switch Comparison Pair