Benchmark
-
BEIR
Benchmarking IR; heterogeneous benchmark of 18 retrieval datasets spanning 9 domains to evaluate zero-shot generalization of retrieval models trained on MS MARCO.
-
MS MARCO
Microsoft MAchine Reading COmprehension dataset; the dominant benchmark for passage retrieval and document ranking with 8.8M passages, 1M training queries, and sparse binary relevance judgments.