AI BENCHY
AD
Track all your projects in one dashboard. Get 📊stats, 🔥heatmaps and 👀recordings in one self-hosted dashboard.
uxwizz.com

AI BENCHY

Changelog

A simple log of product and benchmark updates, grouped by date. We use it to note newly tested models, re-tests, benchmark changes, and shipped UX/product work.

2026-04-23

  • New Models Tested: inclusionai/ling-2.6-1t:free Added benchmark coverage for InclusionAI Ling 2.6 1T Free.
  • New Feature: Run history - Model pages now show historical public runs and a side-by-side run comparison table. (Example model page)
  • UX: The leaderboard now supports URL-backed pagination, filters, and direct compare actions from the ranking list.
  • Bug Fix: Homepage search, filter counts, and pagination state now stay consistent across the full dataset.
  • Re-test: z-ai/glm-5.1 Reran the full benchmark suite and cleaned up the public run-history snapshot for this model.
  • Bug Fix: Stopped unrelated models from receiving a fresh tested_at timestamp when they were not actually retested.

Changelog page created

We started this changelog after launch, so some older updates are missing.

2026-02-15

  • Initial release