#117

Laguna M.1

Poolside Release: 2026-04-28 Tested on: 2026-04-28 22:25 poolside/laguna-m.1::none

(medium) (none)

5.1

Consistency

8.7

9.9

$0.000

Total Output Tokens

2,870

Input Price

$0.000 / 1M

Output Price

$0.000 / 1M

Wrong Tests: 14

Attempt pass rate: 33.3%

Flaky tests

3

Flaky tests had mixed outcomes across runs (at least one pass and one fail).

Response Time (avg)

2.79s

Response Time (max): 15.42s

Response Time (total): 50.24s

Wrong answer: 13 Invalid tool call: 1

Charts

Choose the first model, then click a second model to open a side-by-side page.

Top Models by Score

Score vs Total Cost

Response Time (avg)

Score vs Response Time (avg)

Total Output Tokens

Score vs Total Output Tokens

Quick Compare

Laguna M.1noneFree Availablevsgpt-oss-120bnoneFree Available Laguna M.1noneFree AvailablevsGPT-5.4 Mininone Laguna M.1noneFree AvailablevsMistral Small 4none Laguna M.1noneFree AvailablevsQwen3 Coder Nextnone Laguna M.1noneFree AvailablevsGrok 4.20none Laguna M.1noneFree AvailablevsMiMo-V2.5none Laguna M.1noneFree AvailablevsGemini 3 Flash Previewmedium Laguna M.1noneFree AvailablevsGemini 3.1 Pro Previewmedium Laguna M.1noneFree AvailablevsHY3 PreviewhighFree Available

Category Breakdown

Category	Score	Consistency	Tests Correct
Anti-AI Tricks	3.4	7.9
Coding	7.5	3.8
Combined	3.0	10.0
Data parsing and extraction	10.0	10.0
Domain specific	3.6	7.2
General Intelligence	4.0	10.0
Instructions following	6.3	10.0
Puzzle Solving	3.2	10.0
Tool Calling	10.0	10.0

Compared models