Statistical
-
N-Gram Language Model
Language model estimating token probabilities from observed n-gram counts; foundation of statistical NLP before neural methods.
-
Collocation
Statistically significant co-occurrence of words (e.g. “strong tea”, “black coffee”); indicates meaningful phrases beyond random chance.