2026-04-23
- New Models Tested: inclusionai/ling-2.6-1t:free Added benchmark coverage for InclusionAI Ling 2.6 1T Free.
- New Feature: Run history - Model pages now show historical public runs and a side-by-side run comparison table. (Example model page)
- UX: The leaderboard now supports URL-backed pagination, filters, and direct compare actions from the ranking list.
- Bug Fix: Homepage search, filter counts, and pagination state now stay consistent across the full dataset.
- Re-test: z-ai/glm-5.1 Reran the full benchmark suite and cleaned up the public run-history snapshot for this model.
- Bug Fix: Stopped unrelated models from receiving a fresh tested_at timestamp when they were not actually retested.