Your task has a fixed output format (classification, token labeling, span extraction). - You need fast inference (encoder-only models are faster than encoder-decoder for classification). - Your fine-tuning dataset is small and you want maximum parameter efficiency.