Back to glossary

    How Models Learn

    RLHF (Reinforcement Learning from Human Feedback)

    Definition

    Human reviewers rate AI responses, and the model learns to produce more of the preferred style. This is how models become helpful and conversational.

    Why it matters

    RLHF is why modern AI assistants feel so natural.