Dense-Retrieval

Retrieval-Augmented Generation

Grounding language model generation in retrieved external documents; reduces hallucination and enables knowledge updates without retraining.
Hybrid Search

Combining dense vector similarity and sparse term-matching scores to balance semantic understanding with keyword precision.
ANCE

Approximate Nearest Neighbor Negative Contrastive Estimation; improves dense retrieval training by dynamically refreshing hard negatives from the current model’s ANN index.
ColBERTv2

Improved ColBERT with cross-encoder distillation and residual compression; dramatically reduces index size while matching or exceeding v1 effectiveness.
Contriever

Unsupervised dense retrieval model trained with contrastive learning on unlabeled text; no labeled query-passage pairs required.
DPR (Dense Passage Retrieval)

Dual BERT encoder model that retrieves passages by embedding queries and documents into a shared dense vector space; foundational bi-encoder for open-domain QA.
DSSM

Deep Structured Semantic Model (2013); the original neural dual-encoder for web search, using word-hash trigram inputs and MLP towers to learn query-document semantic similarity.
Hard Negative Mining

Strategy for selecting training negatives that are difficult for the current model to distinguish from positives; critical for dense retrieval model quality beyond in-batch random negatives.
HyDE

Hypothetical Document Embeddings; generates a hypothetical answer to a query using an LLM and embeds that instead of the original query for zero-shot dense retrieval improvement.
Query2Doc

Expands queries by prepending LLM-generated pseudo-documents before retrieval; improves both sparse and dense retrieval without modifying the index or retrieval model.
SimCSE

Simple Contrastive Sentence Embeddings; learns high-quality sentence representations via dropout-based augmentation (unsupervised) or NLI entailment pairs (supervised).
TAS-B

Topic-Aware Sampling with BERT; dense retrieval model trained via balanced topic-aware sampling and cross-encoder distillation, achieving strong recall with efficient inference.
ColBERT

Contextualized Late Interaction over BERT; late-interaction ranking using per-token embeddings with MaxSim scoring for efficient dense retrieval.
Bi-Encoder

Neural architecture encoding query and document independently into separate embeddings, enabling fast retrieval via approximate nearest-neighbour search.
Approximate Nearest Neighbour

Fast nearest-neighbour search algorithm sacrificing exactness for speed; enables practical dense retrieval at scale. Abbreviated ANN.
Comparing BM25 and Dense Retrieval for a Product Catalogue

A side-by-side evaluation of keyword search and embedding-based search on a realistic product dataset, showing where each approach wins and how hybrid search splits the difference.