HAKARI-bench leaderboard

🚧 WIP: This leaderboard is currently under active implementation, so specifications and data may change significantly.

Compare multilingual retrieval models, inspect compression variants, and audit reranking and Nano subset diagnostics from the DuckDB result warehouse.

Benchmark coverage

Result warehouse size and coverage visible in this viewer.

Latest result: 2026-05-10T06:14:02.838924+00:00

Models

36

Benchmarks

5

Tasks

70

Languages

21

Base rows

2,520

Variant rows

29,190

Loading leaderboard...