The fine-tuning and alignment stages (SFT, RLHF) after pretraining that shape a model's behavior and usefulness.
← All terms