Training

Contrastive Loss

Training objective that pulls similar pairs together and pushes dissimilar pairs apart in embedding space; the dominant loss function for dense retrieval and sentence embedding models.
Hard Negative Mining

Strategy for selecting training negatives that are difficult for the current model to distinguish from positives; critical for dense retrieval model quality beyond in-batch random negatives.
In-Batch Negatives

Training technique where other (query, passage) pairs within the same mini-batch serve as negatives; free negative supervision that scales with batch size.
Knowledge Distillation for IR

Training a fast bi-encoder (student) to mimic the ranking scores of a slow cross-encoder (teacher); the dominant approach for improving dense retrieval without cross-encoder latency.