MNanoBEIR
Overview
MNanoBEIR is the multilingual NanoBEIR group: a grid of compact BEIR-style retrieval tasks across Arabic, German, Spanish, French, Italian, Japanese, Korean, Norwegian, Portuguese, Serbian, Swedish, Thai, and Vietnamese. Each language variant contains the same thirteen source tasks, so the group separates two questions that are often mixed together: whether a model understands the underlying BEIR retrieval relation, and whether that behavior survives in non-English text.
The source task mix is deliberately heterogeneous. Some tasks retrieve duplicate questions, some retrieve Wikipedia evidence, some retrieve biomedical or scientific documents, some retrieve debate arguments, and some retrieve answer-bearing web passages. A single average score is therefore not enough to understand the group. The useful reading is by language, task family, and retrieval profile. BM25 exposes exact-term and named-entity dependence; dense retrieval exposes semantic transfer and paraphrase handling; reranking_hybrid shows where sparse and dense candidates complement each other.
What This Group Measures
BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models introduced a benchmark philosophy based on many retrieval relations rather than one passage-search task. MNanoBEIR inherits that philosophy from compact NanoBEIR tasks and applies it to multilingual evaluation. The result is a regular 13-by-13 grid: thirteen languages and thirteen BEIR-derived source tasks.
This group measures multilingual robustness under changing relevance semantics. For example, a relevant Quora document should be a duplicate question, a relevant FEVER document should support or refute a claim, a relevant ArguAna document should work as a counterargument, and a relevant SCIDOCS document should be scientifically related. A retriever that treats all rows as generic semantic similarity will miss much of what the group tests.
Task Families
- Argument retrieval:
NanoArguAnaandNanoTouche2020evaluate counterargument and debate-passage retrieval. They often have long text and stance-sensitive negatives. - Evidence retrieval:
NanoClimateFEVER,NanoFEVER, andNanoSciFactevaluate claim-to-evidence retrieval. Exact named entities matter, but the target passage must also express the right evidence relation. - Open-domain QA retrieval:
NanoFiQA2018,NanoHotpotQA,NanoMSMARCO, andNanoNQevaluate answer-bearing retrieval for finance, multi-hop QA, web search, and natural questions. - Entity and duplicate retrieval:
NanoDBPediaandNanoQuoraRetrievalstress entity-page matching and duplicate-question intent matching. - Scientific and biomedical retrieval:
NanoNFCorpusandNanoSCIDOCSare domain-specific and multi-positive, with many lexical traps around medical terms, paper titles, and related-work language.
Dataset Shape
MNanoBEIR contains 169 task pages, 8,437 queries, 737,399 split-local documents, and 61,048 positive qrel rows. Each language has 13 task pages and 649 queries: most base tasks have 50 queries, while NanoTouche2020 has 49. The document count is a sum over task-local pools, not a deduplicated multilingual corpus.
The group is highly multi-positive. NanoArguAna and NanoMSMARCO are single-positive in this grid, but NanoDBPedia, NanoNFCorpus, NanoSCIDOCS, and NanoTouche2020 contain many positives per query. This means hit@10 can look good while nDCG@10 or Recall@100 still shows ranking quality differences. Query and document length also vary by task: ArguAna and Touche have long argumentative text, Quora has short question text, NFCorpus has short medical queries and many positives, and FEVER-like tasks depend on claim-evidence alignment.
Retrieval Behavior
BM25 Profile
BM25 is strongest where exact words, named entities, titles, technical terms, or many acceptable positives dominate. In this group, Quora, FEVER, DBPedia, HotpotQA, and some Touche splits often give sparse retrieval a clear path. BM25 is weaker on finance, scientific related-paper retrieval, and some climate evidence tasks where the relevant document may use different wording from the query.
Language differences should not be reduced to script alone. Latin-script European languages can still differ because of translation choices and morphology; Japanese, Korean, Thai, Arabic, Serbian, and Vietnamese introduce additional tokenization and segmentation considerations. A BM25-competitive split is a sign that exact lexical anchoring remains central, not that the task is easy.
Dense Profile
Dense retrieval is the leading nDCG@10 profile for many MNanoBEIR tasks. It is most informative when the target relation depends on paraphrase, answerability, or relatedness rather than direct term overlap. This is visible in duplicate question retrieval, NQ-style passage retrieval, FEVER-style evidence selection, and some finance and scientific tasks.
Dense retrieval can still lose rare exact anchors. Entity-heavy tasks, medical terms, numeric facts, paper titles, and translated names can be underweighted if the embedding model smooths them into broader topical similarity. This makes the BM25-versus-dense comparison useful: it tells whether a task is mainly lexical, mainly semantic, or dependent on both.
Reranking Hybrid Profile
reranking_hybrid is the practical reranker-candidate view of MNanoBEIR. It often performs best when BM25 and dense retrieval find different relevant documents. ClimateFEVER, DBPedia, Touche, SCIDOCS, and several multilingual open-domain QA splits show this complementarity.
When reranking_hybrid is best by nDCG@10, the task is signaling that neither exact matching nor embedding similarity alone is sufficient. When it is not the best top-rank sorter but has stronger Recall@100, it is still useful for reranker experiments because it preserves positives that either first-stage method would otherwise drop.
Language Summary
| Language | Tasks | Queries | Docs | Positives | BM25 nDCG@10 | Dense nDCG@10 | Reranking hybrid nDCG@10 |
ar | 13 | 649 | 56,723 | 4,696 | 0.4412 | 0.4867 | 0.4919 |
de | 13 | 649 | 56,723 | 4,696 | 0.4476 | 0.5313 | 0.5099 |
es | 13 | 649 | 56,723 | 4,696 | 0.5025 | 0.5309 | 0.5318 |
fr | 13 | 649 | 56,723 | 4,696 | 0.5125 | 0.5565 | 0.5620 |
it | 13 | 649 | 56,723 | 4,696 | 0.4830 | 0.5324 | 0.5304 |
ja | 13 | 649 | 56,723 | 4,696 | 0.4661 | 0.5122 | 0.5061 |
ko | 13 | 649 | 56,723 | 4,696 | 0.4479 | 0.4989 | 0.5023 |
no | 13 | 649 | 56,723 | 4,696 | 0.4197 | 0.5140 | 0.4756 |
pt | 13 | 649 | 56,723 | 4,696 | 0.4816 | 0.5334 | 0.5311 |
sr | 13 | 649 | 56,723 | 4,696 | 0.3944 | 0.5032 | 0.4841 |
sv | 13 | 649 | 56,723 | 4,696 | 0.4184 | 0.5170 | 0.4899 |
th | 13 | 649 | 56,723 | 4,696 | 0.4356 | 0.5081 | 0.4944 |
vi | 13 | 649 | 56,723 | 4,696 | 0.4603 | 0.5336 | 0.5244 |
Task Navigation
| Base task | Family | Language pages |
| NanoArguAna | Argument retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoClimateFEVER | Evidence retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoDBPedia | Entity retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoFEVER | Evidence retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoFiQA2018 | Open-domain QA retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoHotpotQA | Open-domain QA retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoMSMARCO | Open-domain QA retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoNFCorpus | Biomedical retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoNQ | Open-domain QA retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoQuoraRetrieval | Duplicate question retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoSCIDOCS | Scientific related-paper retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoSciFact | Evidence retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
| NanoTouche2020 | Argument retrieval | ar, de, es, fr, it, ja, ko, no, pt, sr, sv, th, vi |
Interpretation Notes for Model Researchers
The strongest use of MNanoBEIR is controlled comparison. Because every language uses the same task grid, score differences can be read as language robustness only after accounting for the task family. A model may be excellent on translated Quora duplicates yet weak on translated finance QA or scientific related-paper retrieval. Family-level breakdowns are therefore more useful than one global average.
Pay special attention to profile changes. BM25-led rows suggest exact strings, entities, or many positives. Dense-led rows suggest paraphrase and semantic answerability. Hybrid-led rows suggest that sparse and dense retrieval recover different useful candidates. For reranker research, reranking_hybrid is often the most relevant candidate pool even when dense has the best top-rank nDCG.
Training and Leakage Notes
Useful training data should be both multilingual and task-matched: translated MS MARCO-style query-passage data for web retrieval, multilingual FEVER-style claim-evidence pairs for fact checking, duplicate-question pairs for Quora-like tasks, argument-counterargument pairs for ArguAna, and biomedical or scientific retrieval data for NFCorpus and SCIDOCS. Mixing all tasks into one generic similarity objective will blur the distinctions this group is designed to test.
Leakage risk is high because the underlying BEIR tasks are common in retrieval training mixtures. Exclude exact MNanoBEIR queries, qrels, positives, translated documents, and direct translations of evaluation text. Audit overlap with MS MARCO, NQ, FEVER, Quora, NFCorpus, SCIDOCS, SciFact, and Touche-style corpora before using them for training or synthetic-data seeding.
Source Reference Table
| Source | Year | Type | URL |
| BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models | 2021 | paper | https://arxiv.org/abs/2104.08663 |
| MMTEB: Massive Multilingual Text Embedding Benchmark | 2025 | paper | https://arxiv.org/abs/2502.13595 |
| NanoBEIR collection | dataset collection | https://huggingface.co/collections/zeta-alpha-ai/nanobeir | |
| MTEB benchmark | project | https://github.com/embeddings-benchmark/mteb |
Metadata Summary
| Field | Value |
| Task pages | 169 |
| Queries | 8,437 |
| Split-local documents | 737,399 |
| Positive qrels | 72,319 |
| Languages | ar, de, es, fr, it, ja, ko, multilingual, no, pt, sr, sv, th, vi |
| Categories | natural_language |
| Positives / query avg | 8.57 |
Task Metadata Summary
| Task | Backing dataset | Lang | Category | Queries | Docs | Positives | BM25 nDCG@10 | Dense nDCG@10 | Reranking hybrid nDCG@10 | Best profile |
| NanoArguAna | NanoBEIR-ar | ar | natural_language | 50 | 3,635 | 50 | 0.3619 | 0.4295 | 0.4188 | Dense |
| NanoArguAna | NanoBEIR-de | de | natural_language | 50 | 3,635 | 50 | 0.3453 | 0.4738 | 0.4422 | Dense |
| NanoArguAna | NanoBEIR-es | es | natural_language | 50 | 3,635 | 50 | 0.4133 | 0.4808 | 0.4365 | Dense |
| NanoArguAna | NanoBEIR-fr | fr | natural_language | 50 | 3,635 | 50 | 0.3948 | 0.5200 | 0.4611 | Dense |
| NanoArguAna | NanoBEIR-it | it | natural_language | 50 | 3,635 | 50 | 0.3934 | 0.4706 | 0.4138 | Dense |
| NanoArguAna | NanoBEIR-ja | ja | natural_language | 50 | 3,635 | 50 | 0.3620 | 0.4239 | 0.4022 | Dense |
| NanoArguAna | NanoBEIR-ko | ko | natural_language | 50 | 3,635 | 50 | 0.3661 | 0.4082 | 0.4217 | Reranking hybrid |
| NanoArguAna | NanoBEIR-no | no | natural_language | 50 | 3,635 | 50 | 0.3096 | 0.3985 | 0.3656 | Dense |
| NanoArguAna | NanoBEIR-pt | pt | natural_language | 50 | 3,635 | 50 | 0.4131 | 0.4918 | 0.4474 | Dense |
| NanoArguAna | NanoBEIR-sr | sr | natural_language | 50 | 3,635 | 50 | 0.2817 | 0.4187 | 0.3625 | Dense |
| NanoArguAna | NanoBEIR-sv | sv | natural_language | 50 | 3,635 | 50 | 0.3185 | 0.4108 | 0.3784 | Dense |
| NanoArguAna | NanoBEIR-th | th | natural_language | 50 | 3,635 | 50 | 0.4051 | 0.3721 | 0.4349 | Reranking hybrid |
| NanoArguAna | NanoBEIR-vi | vi | natural_language | 50 | 3,635 | 50 | 0.4275 | 0.4529 | 0.4701 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-ar | ar | natural_language | 50 | 3,408 | 148 | 0.2400 | 0.2899 | 0.2948 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-de | de | natural_language | 50 | 3,408 | 148 | 0.2481 | 0.2625 | 0.3310 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-es | es | natural_language | 50 | 3,408 | 148 | 0.2849 | 0.3097 | 0.3090 | Dense |
| NanoClimateFEVER | NanoBEIR-fr | fr | natural_language | 50 | 3,408 | 148 | 0.3063 | 0.3114 | 0.3531 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-it | it | natural_language | 50 | 3,408 | 148 | 0.2700 | 0.3389 | 0.3255 | Dense |
| NanoClimateFEVER | NanoBEIR-ja | ja | natural_language | 50 | 3,408 | 148 | 0.2672 | 0.2839 | 0.3100 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-ko | ko | natural_language | 50 | 3,408 | 148 | 0.2457 | 0.3003 | 0.2983 | Dense |
| NanoClimateFEVER | NanoBEIR-no | no | natural_language | 50 | 3,408 | 148 | 0.2099 | 0.3053 | 0.2862 | Dense |
| NanoClimateFEVER | NanoBEIR-pt | pt | natural_language | 50 | 3,408 | 148 | 0.2631 | 0.2508 | 0.2958 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-sr | sr | natural_language | 50 | 3,408 | 148 | 0.2389 | 0.2946 | 0.3266 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-sv | sv | natural_language | 50 | 3,408 | 148 | 0.2388 | 0.2633 | 0.3202 | Reranking hybrid |
| NanoClimateFEVER | NanoBEIR-th | th | natural_language | 50 | 3,408 | 148 | 0.2368 | 0.3444 | 0.3015 | Dense |
| NanoClimateFEVER | NanoBEIR-vi | vi | natural_language | 50 | 3,408 | 148 | 0.2668 | 0.3539 | 0.3558 | Reranking hybrid |
| NanoDBPedia | NanoBEIR-ar | ar | natural_language | 50 | 6,045 | 1,158 | 0.5272 | 0.5204 | 0.5277 | Reranking hybrid |
| NanoDBPedia | NanoBEIR-de | multilingual | natural_language | 50 | 6,045 | 1,158 | 0.5747 | 0.6152 | 0.6157 | Reranking hybrid |
| NanoDBPedia | NanoBEIR-es | es | natural_language | 50 | 6,045 | 1,158 | 0.6140 | 0.6099 | 0.6210 | Reranking hybrid |
| NanoDBPedia | NanoBEIR-fr | fr | natural_language | 50 | 6,045 | 1,158 | 0.5827 | 0.5581 | 0.5710 | BM25 |
| NanoDBPedia | NanoBEIR-it | it | natural_language | 50 | 6,045 | 1,158 | 0.5339 | 0.6427 | 0.6069 | Dense |
| NanoDBPedia | NanoBEIR-ja | ja | natural_language | 50 | 6,045 | 1,158 | 0.5843 | 0.6098 | 0.6081 | Dense |
| NanoDBPedia | NanoBEIR-ko | ko | natural_language | 50 | 6,045 | 1,158 | 0.5322 | 0.5928 | 0.5787 | Dense |
| NanoDBPedia | NanoBEIR-no | no | natural_language | 50 | 6,045 | 1,158 | 0.4684 | 0.5506 | 0.5184 | Dense |
| NanoDBPedia | NanoBEIR-pt | pt | natural_language | 50 | 6,045 | 1,158 | 0.5110 | 0.5816 | 0.5620 | Dense |
| NanoDBPedia | NanoBEIR-sr | sr | natural_language | 50 | 6,045 | 1,158 | 0.4704 | 0.5693 | 0.5567 | Dense |
| NanoDBPedia | NanoBEIR-sv | sv | natural_language | 50 | 6,045 | 1,158 | 0.4342 | 0.5906 | 0.5073 | Dense |
| NanoDBPedia | NanoBEIR-th | th | natural_language | 50 | 6,045 | 1,158 | 0.5043 | 0.5468 | 0.5482 | Reranking hybrid |
| NanoDBPedia | NanoBEIR-vi | vi | natural_language | 50 | 6,045 | 1,158 | 0.4949 | 0.5676 | 0.5628 | Dense |
| NanoFEVER | NanoBEIR-ar | ar | natural_language | 50 | 4,996 | 57 | 0.6665 | 0.8243 | 0.7767 | Dense |
| NanoFEVER | NanoBEIR-de | de | natural_language | 50 | 4,996 | 57 | 0.7362 | 0.8449 | 0.8004 | Dense |
| NanoFEVER | NanoBEIR-es | es | natural_language | 50 | 4,996 | 57 | 0.7803 | 0.8427 | 0.8029 | Dense |
| NanoFEVER | NanoBEIR-fr | fr | natural_language | 50 | 4,996 | 57 | 0.7469 | 0.8194 | 0.7803 | Dense |
| NanoFEVER | NanoBEIR-it | it | natural_language | 50 | 4,996 | 57 | 0.7776 | 0.7972 | 0.7977 | Reranking hybrid |
| NanoFEVER | NanoBEIR-ja | ja | natural_language | 50 | 4,996 | 57 | 0.6797 | 0.7141 | 0.6482 | Dense |
| NanoFEVER | NanoBEIR-ko | ko | natural_language | 50 | 4,996 | 57 | 0.5723 | 0.7335 | 0.7001 | Dense |
| NanoFEVER | NanoBEIR-no | no | natural_language | 50 | 4,996 | 57 | 0.7396 | 0.8416 | 0.7934 | Dense |
| NanoFEVER | NanoBEIR-pt | pt | natural_language | 50 | 4,996 | 57 | 0.8043 | 0.8461 | 0.8511 | Reranking hybrid |
| NanoFEVER | NanoBEIR-sr | sr | natural_language | 50 | 4,996 | 57 | 0.6486 | 0.7611 | 0.7191 | Dense |
| NanoFEVER | NanoBEIR-sv | sv | natural_language | 50 | 4,996 | 57 | 0.7512 | 0.8570 | 0.8153 | Dense |
| NanoFEVER | NanoBEIR-th | th | natural_language | 50 | 4,996 | 57 | 0.7001 | 0.8663 | 0.7768 | Dense |
| NanoFEVER | NanoBEIR-vi | vi | natural_language | 50 | 4,996 | 57 | 0.6109 | 0.8304 | 0.6964 | Dense |
| NanoFiQA2018 | NanoBEIR-ar | ar | natural_language | 50 | 4,598 | 123 | 0.3196 | 0.3934 | 0.3900 | Dense |
| NanoFiQA2018 | NanoBEIR-de | de | natural_language | 50 | 4,598 | 123 | 0.1864 | 0.3977 | 0.2704 | Dense |
| NanoFiQA2018 | NanoBEIR-es | es | natural_language | 50 | 4,598 | 123 | 0.3205 | 0.3819 | 0.4174 | Reranking hybrid |
| NanoFiQA2018 | NanoBEIR-fr | fr | natural_language | 50 | 4,598 | 123 | 0.3403 | 0.3881 | 0.4341 | Reranking hybrid |
| NanoFiQA2018 | NanoBEIR-it | it | natural_language | 50 | 4,598 | 123 | 0.2633 | 0.3443 | 0.3405 | Dense |
| NanoFiQA2018 | NanoBEIR-ja | ja | natural_language | 50 | 4,598 | 123 | 0.3288 | 0.3762 | 0.4041 | Reranking hybrid |
| NanoFiQA2018 | NanoBEIR-ko | ko | natural_language | 50 | 4,598 | 123 | 0.3415 | 0.3713 | 0.4291 | Reranking hybrid |
| NanoFiQA2018 | NanoBEIR-no | no | natural_language | 50 | 4,598 | 123 | 0.1955 | 0.4205 | 0.3330 | Dense |
| NanoFiQA2018 | NanoBEIR-pt | pt | natural_language | 50 | 4,598 | 123 | 0.2621 | 0.3853 | 0.3478 | Dense |
| NanoFiQA2018 | NanoBEIR-sr | sr | natural_language | 50 | 4,598 | 123 | 0.1904 | 0.3094 | 0.3183 | Reranking hybrid |
| NanoFiQA2018 | NanoBEIR-sv | sv | natural_language | 50 | 4,598 | 123 | 0.1159 | 0.3435 | 0.2256 | Dense |
| NanoFiQA2018 | NanoBEIR-th | th | natural_language | 50 | 4,598 | 123 | 0.2726 | 0.4085 | 0.3911 | Dense |
| NanoFiQA2018 | NanoBEIR-vi | vi | natural_language | 50 | 4,598 | 123 | 0.3300 | 0.3306 | 0.3693 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-ar | ar | natural_language | 50 | 5,090 | 100 | 0.6837 | 0.7365 | 0.7798 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-de | de | natural_language | 50 | 5,090 | 100 | 0.7904 | 0.7388 | 0.7941 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-es | es | natural_language | 50 | 5,090 | 100 | 0.7466 | 0.7101 | 0.7209 | BM25 |
| NanoHotpotQA | NanoBEIR-fr | fr | natural_language | 50 | 5,090 | 100 | 0.7258 | 0.7564 | 0.7834 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-it | it | natural_language | 50 | 5,090 | 100 | 0.7275 | 0.7540 | 0.7762 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-ja | ja | natural_language | 50 | 5,090 | 100 | 0.5296 | 0.6885 | 0.6354 | Dense |
| NanoHotpotQA | NanoBEIR-ko | ko | natural_language | 50 | 5,090 | 100 | 0.5966 | 0.6269 | 0.6316 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-no | no | natural_language | 50 | 5,090 | 100 | 0.7728 | 0.7574 | 0.8168 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-pt | pt | natural_language | 50 | 5,090 | 100 | 0.7604 | 0.7948 | 0.8145 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-sr | sr | natural_language | 50 | 5,090 | 100 | 0.6327 | 0.7516 | 0.7414 | Dense |
| NanoHotpotQA | NanoBEIR-sv | sv | natural_language | 50 | 5,090 | 100 | 0.7413 | 0.7617 | 0.8233 | Reranking hybrid |
| NanoHotpotQA | NanoBEIR-th | th | natural_language | 50 | 5,090 | 100 | 0.5523 | 0.6880 | 0.6652 | Dense |
| NanoHotpotQA | NanoBEIR-vi | vi | natural_language | 50 | 5,090 | 100 | 0.6311 | 0.7552 | 0.7089 | Dense |
| NanoMSMARCO | NanoBEIR-ar | ar | natural_language | 50 | 5,043 | 50 | 0.2732 | 0.3625 | 0.3212 | Dense |
| NanoMSMARCO | NanoBEIR-de | de | natural_language | 50 | 5,043 | 50 | 0.3502 | 0.4728 | 0.4388 | Dense |
| NanoMSMARCO | NanoBEIR-es | es | natural_language | 50 | 5,043 | 50 | 0.4039 | 0.4987 | 0.4523 | Dense |
| NanoMSMARCO | NanoBEIR-fr | fr | natural_language | 50 | 5,043 | 50 | 0.4608 | 0.5748 | 0.5312 | Dense |
| NanoMSMARCO | NanoBEIR-it | it | natural_language | 50 | 5,043 | 50 | 0.3957 | 0.5087 | 0.4781 | Dense |
| NanoMSMARCO | NanoBEIR-ja | ja | natural_language | 50 | 5,043 | 50 | 0.3318 | 0.4748 | 0.3743 | Dense |
| NanoMSMARCO | NanoBEIR-ko | ko | natural_language | 50 | 5,043 | 50 | 0.3320 | 0.4164 | 0.4371 | Reranking hybrid |
| NanoMSMARCO | NanoBEIR-no | no | natural_language | 50 | 5,043 | 50 | 0.3249 | 0.4174 | 0.3728 | Dense |
| NanoMSMARCO | NanoBEIR-pt | pt | natural_language | 50 | 5,043 | 50 | 0.3494 | 0.5121 | 0.4873 | Dense |
| NanoMSMARCO | NanoBEIR-sr | sr | natural_language | 50 | 5,043 | 50 | 0.2833 | 0.4541 | 0.4072 | Dense |
| NanoMSMARCO | NanoBEIR-sv | sv | natural_language | 50 | 5,043 | 50 | 0.3777 | 0.4559 | 0.4737 | Reranking hybrid |
| NanoMSMARCO | NanoBEIR-th | th | natural_language | 50 | 5,043 | 50 | 0.2907 | 0.4265 | 0.3653 | Dense |
| NanoMSMARCO | NanoBEIR-vi | vi | natural_language | 50 | 5,043 | 50 | 0.3423 | 0.4934 | 0.4454 | Dense |
| NanoNFCorpus | NanoBEIR-ar | ar | natural_language | 50 | 2,953 | 2,518 | 0.1663 | 0.2067 | 0.2000 | Dense |
| NanoNFCorpus | NanoBEIR-de | de | natural_language | 50 | 2,953 | 2,518 | 0.2260 | 0.2810 | 0.2582 | Dense |
| NanoNFCorpus | NanoBEIR-es | es | natural_language | 50 | 2,953 | 2,518 | 0.3140 | 0.3053 | 0.3212 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-fr | fr | natural_language | 50 | 2,953 | 2,518 | 0.3235 | 0.3287 | 0.3265 | Dense |
| NanoNFCorpus | NanoBEIR-it | it | natural_language | 50 | 2,953 | 2,518 | 0.3347 | 0.2634 | 0.3377 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-ja | ja | natural_language | 50 | 2,953 | 2,518 | 0.2684 | 0.2681 | 0.3139 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-ko | ko | natural_language | 50 | 2,953 | 2,518 | 0.2719 | 0.2515 | 0.2745 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-no | no | natural_language | 50 | 2,953 | 2,518 | 0.3007 | 0.2663 | 0.3172 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-pt | pt | natural_language | 50 | 2,953 | 2,518 | 0.3389 | 0.3298 | 0.3519 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-sr | sr | natural_language | 50 | 2,953 | 2,518 | 0.1776 | 0.2411 | 0.2301 | Dense |
| NanoNFCorpus | NanoBEIR-sv | sv | natural_language | 50 | 2,953 | 2,518 | 0.2793 | 0.2688 | 0.3145 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-th | th | natural_language | 50 | 2,953 | 2,518 | 0.2663 | 0.2409 | 0.2743 | Reranking hybrid |
| NanoNFCorpus | NanoBEIR-vi | vi | natural_language | 50 | 2,953 | 2,518 | 0.2486 | 0.2756 | 0.2879 | Reranking hybrid |
| NanoNQ | NanoBEIR-ar | ar | natural_language | 50 | 5,035 | 57 | 0.3555 | 0.4600 | 0.4247 | Dense |
| NanoNQ | NanoBEIR-de | de | natural_language | 50 | 5,035 | 57 | 0.3757 | 0.5266 | 0.4491 | Dense |
| NanoNQ | NanoBEIR-es | es | natural_language | 50 | 5,035 | 57 | 0.3197 | 0.5059 | 0.4130 | Dense |
| NanoNQ | NanoBEIR-fr | fr | natural_language | 50 | 5,035 | 57 | 0.4460 | 0.5970 | 0.5556 | Dense |
| NanoNQ | NanoBEIR-it | it | natural_language | 50 | 5,035 | 57 | 0.3750 | 0.5133 | 0.4545 | Dense |
| NanoNQ | NanoBEIR-ja | ja | natural_language | 50 | 5,035 | 57 | 0.4473 | 0.6165 | 0.5569 | Dense |
| NanoNQ | NanoBEIR-ko | ko | natural_language | 50 | 5,035 | 57 | 0.4301 | 0.5805 | 0.5033 | Dense |
| NanoNQ | NanoBEIR-no | no | natural_language | 50 | 5,035 | 57 | 0.3011 | 0.5490 | 0.3641 | Dense |
| NanoNQ | NanoBEIR-pt | pt | natural_language | 50 | 5,035 | 57 | 0.3645 | 0.5103 | 0.4522 | Dense |
| NanoNQ | NanoBEIR-sr | sr | natural_language | 50 | 5,035 | 57 | 0.2624 | 0.5343 | 0.4228 | Dense |
| NanoNQ | NanoBEIR-sv | sv | natural_language | 50 | 5,035 | 57 | 0.3026 | 0.5147 | 0.3660 | Dense |
| NanoNQ | NanoBEIR-th | th | natural_language | 50 | 5,035 | 57 | 0.3191 | 0.5367 | 0.4246 | Dense |
| NanoNQ | NanoBEIR-vi | vi | natural_language | 50 | 5,035 | 57 | 0.3848 | 0.5939 | 0.5351 | Dense |
| NanoQuoraRetrieval | NanoBEIR-ar | ar | natural_language | 50 | 5,046 | 70 | 0.7238 | 0.8170 | 0.7728 | Dense |
| NanoQuoraRetrieval | NanoBEIR-de | de | natural_language | 50 | 5,046 | 70 | 0.7177 | 0.8323 | 0.7982 | Dense |
| NanoQuoraRetrieval | NanoBEIR-es | es | natural_language | 50 | 5,046 | 70 | 0.7912 | 0.8661 | 0.8425 | Dense |
| NanoQuoraRetrieval | NanoBEIR-fr | fr | natural_language | 50 | 5,046 | 70 | 0.7657 | 0.8593 | 0.8376 | Dense |
| NanoQuoraRetrieval | NanoBEIR-it | it | natural_language | 50 | 5,046 | 70 | 0.7130 | 0.8699 | 0.8038 | Dense |
| NanoQuoraRetrieval | NanoBEIR-ja | ja | natural_language | 50 | 5,046 | 70 | 0.7391 | 0.7722 | 0.7417 | Dense |
| NanoQuoraRetrieval | NanoBEIR-ko | ko | natural_language | 50 | 5,046 | 70 | 0.7062 | 0.8133 | 0.7632 | Dense |
| NanoQuoraRetrieval | NanoBEIR-no | no | natural_language | 50 | 5,046 | 70 | 0.6347 | 0.7829 | 0.6988 | Dense |
| NanoQuoraRetrieval | NanoBEIR-pt | pt | natural_language | 50 | 5,046 | 70 | 0.7247 | 0.8172 | 0.7634 | Dense |
| NanoQuoraRetrieval | NanoBEIR-sr | sr | natural_language | 50 | 5,046 | 70 | 0.5837 | 0.8100 | 0.7129 | Dense |
| NanoQuoraRetrieval | NanoBEIR-sv | sv | natural_language | 50 | 5,046 | 70 | 0.6474 | 0.8274 | 0.7341 | Dense |
| NanoQuoraRetrieval | NanoBEIR-th | th | natural_language | 50 | 5,046 | 70 | 0.7267 | 0.8859 | 0.7928 | Dense |
| NanoQuoraRetrieval | NanoBEIR-vi | vi | natural_language | 50 | 5,046 | 70 | 0.7238 | 0.8646 | 0.7903 | Dense |
| NanoSCIDOCS | NanoBEIR-ar | multilingual | natural_language | 50 | 2,210 | 244 | 0.2488 | 0.2996 | 0.2939 | Dense |
| NanoSCIDOCS | NanoBEIR-de | de | natural_language | 50 | 2,210 | 244 | 0.1913 | 0.3711 | 0.3004 | Dense |
| NanoSCIDOCS | NanoBEIR-es | es | natural_language | 50 | 2,210 | 244 | 0.2978 | 0.3551 | 0.3554 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-fr | fr | natural_language | 50 | 2,210 | 244 | 0.3129 | 0.3734 | 0.3787 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-it | it | natural_language | 50 | 2,210 | 244 | 0.2867 | 0.3378 | 0.3499 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-ja | ja | natural_language | 50 | 2,210 | 244 | 0.3116 | 0.3498 | 0.3710 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-ko | ko | natural_language | 50 | 2,210 | 244 | 0.2673 | 0.3310 | 0.3380 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-no | no | natural_language | 50 | 2,210 | 244 | 0.2153 | 0.3412 | 0.2467 | Dense |
| NanoSCIDOCS | NanoBEIR-pt | pt | natural_language | 50 | 2,210 | 244 | 0.2911 | 0.3163 | 0.3201 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-sr | sr | natural_language | 50 | 2,210 | 244 | 0.2561 | 0.3382 | 0.3229 | Dense |
| NanoSCIDOCS | NanoBEIR-sv | sv | natural_language | 50 | 2,210 | 244 | 0.1892 | 0.3472 | 0.2844 | Dense |
| NanoSCIDOCS | NanoBEIR-th | th | natural_language | 50 | 2,210 | 244 | 0.2641 | 0.2915 | 0.3165 | Reranking hybrid |
| NanoSCIDOCS | NanoBEIR-vi | vi | natural_language | 50 | 2,210 | 244 | 0.2839 | 0.3201 | 0.3202 | Reranking hybrid |
| NanoSciFact | NanoBEIR-ar | ar | natural_language | 50 | 2,919 | 56 | 0.5755 | 0.5807 | 0.6340 | Reranking hybrid |
| NanoSciFact | NanoBEIR-de | de | natural_language | 50 | 2,919 | 56 | 0.6212 | 0.7017 | 0.6577 | Dense |
| NanoSciFact | NanoBEIR-es | es | natural_language | 50 | 2,919 | 56 | 0.7176 | 0.6480 | 0.7280 | Reranking hybrid |
| NanoSciFact | NanoBEIR-fr | fr | natural_language | 50 | 2,919 | 56 | 0.7182 | 0.6965 | 0.7342 | Reranking hybrid |
| NanoSciFact | NanoBEIR-it | it | natural_language | 50 | 2,919 | 56 | 0.6714 | 0.6381 | 0.6766 | Reranking hybrid |
| NanoSciFact | NanoBEIR-ja | ja | natural_language | 50 | 2,919 | 56 | 0.7023 | 0.6751 | 0.7232 | Reranking hybrid |
| NanoSciFact | NanoBEIR-ko | ko | natural_language | 50 | 2,919 | 56 | 0.6835 | 0.6207 | 0.6838 | Reranking hybrid |
| NanoSciFact | NanoBEIR-no | no | natural_language | 50 | 2,919 | 56 | 0.5652 | 0.6217 | 0.6137 | Dense |
| NanoSciFact | NanoBEIR-pt | pt | natural_language | 50 | 2,919 | 56 | 0.6827 | 0.6801 | 0.7118 | Reranking hybrid |
| NanoSciFact | NanoBEIR-sr | sr | natural_language | 50 | 2,919 | 56 | 0.6468 | 0.6223 | 0.6834 | Reranking hybrid |
| NanoSciFact | NanoBEIR-sv | sv | natural_language | 50 | 2,919 | 56 | 0.6539 | 0.6730 | 0.7181 | Reranking hybrid |
| NanoSciFact | NanoBEIR-th | th | natural_language | 50 | 2,919 | 56 | 0.6334 | 0.5713 | 0.6206 | BM25 |
| NanoSciFact | NanoBEIR-vi | vi | natural_language | 50 | 2,919 | 56 | 0.7134 | 0.6644 | 0.7632 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-ar | ar | natural_language | 49 | 5,745 | 932 | 0.5263 | 0.4155 | 0.5262 | BM25 |
| NanoTouche2020 | NanoBEIR-de | de | natural_language | 49 | 5,745 | 932 | 0.4824 | 0.4155 | 0.5012 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-es | es | natural_language | 49 | 5,745 | 932 | 0.5732 | 0.4143 | 0.5306 | BM25 |
| NanoTouche2020 | NanoBEIR-fr | fr | natural_language | 49 | 5,745 | 932 | 0.5609 | 0.4733 | 0.5763 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-it | it | natural_language | 49 | 5,745 | 932 | 0.5714 | 0.4599 | 0.5717 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-ja | ja | natural_language | 49 | 5,745 | 932 | 0.5361 | 0.4354 | 0.5296 | BM25 |
| NanoTouche2020 | NanoBEIR-ko | ko | natural_language | 49 | 5,745 | 932 | 0.5033 | 0.4564 | 0.5013 | BM25 |
| NanoTouche2020 | NanoBEIR-no | no | natural_language | 49 | 5,745 | 932 | 0.4586 | 0.4613 | 0.5021 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-pt | pt | natural_language | 49 | 5,745 | 932 | 0.5366 | 0.4496 | 0.5371 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-sr | sr | natural_language | 49 | 5,745 | 932 | 0.4741 | 0.4605 | 0.5246 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-sv | sv | natural_language | 49 | 5,745 | 932 | 0.4296 | 0.4427 | 0.4707 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-th | th | natural_language | 49 | 5,745 | 932 | 0.5108 | 0.4534 | 0.5380 | Reranking hybrid |
| NanoTouche2020 | NanoBEIR-vi | vi | natural_language | 49 | 5,745 | 932 | 0.5444 | 0.4636 | 0.5368 | BM25 |