The Definitive Guide toAI Data Centers
Ask the Guide
GuideGlossaryRLHF

RLHF · Reinforcement Learning from Human Feedback

Tuning a model using human preference rankings to make its outputs more helpful and aligned.

← All terms