#118
Poolside
Release: 2026-04-28
Tested on: 2026-04-28 22:51
poolside/laguna-xs.2::none
(medium)
(none)
Input Price
$0.000 / 1M
Output Price
$0.000 / 1M
Flaky tests
0
Flaky tests had mixed outcomes across runs (at least one pass and one fail).
Charts
Choose the first model, then click a second model to open a side-by-side page.
Score vs Total Cost
Response Time (avg)
Score vs Response Time (avg)
Total Output Tokens
Score vs Total Output Tokens
Quick Compare
Laguna Xs.2noneFree AvailablevsElephant AlphanoneLaguna Xs.2noneFree AvailablevsQwen3 Coder NextnoneLaguna Xs.2noneFree Availablevsgpt-oss-120bnoneFree AvailableLaguna Xs.2noneFree AvailablevsMiMo-V2.5noneLaguna Xs.2noneFree AvailablevsMistral Small 4noneLaguna Xs.2noneFree AvailablevsQwen3.6 35B A3BnoneLaguna Xs.2noneFree AvailablevsGemini 3 Flash PreviewmediumLaguna Xs.2noneFree AvailablevsGemini 3.1 Pro PreviewmediumLaguna Xs.2noneFree AvailablevsHY3 PreviewhighFree Available
Category Breakdown
| Category | Score | Consistency | Tests Correct |
|---|---|---|---|
| Anti-AI Tricks | 3.2 | 10.0 | |
| Coding | 2.5 | 10.0 | |
| Combined | 3.0 | 10.0 | |
| Data parsing and extraction | 10.0 | 10.0 | |
| Domain specific | 5.3 | 10.0 | |
| General Intelligence | 5.0 | 10.0 | |
| Instructions following | 6.5 | 10.0 | |
| Puzzle Solving | 5.4 | 10.0 | |
| Tool Calling | 3.0 | 10.0 |