RLAIF

« Back to Glossary Index

Reinforcement Learning with AI Feedback. As opposed to RLHF there is no human in the loop anymore.

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Reinforcement learning from human feedback (RLHF) is effective at aligning large language models (LLMs) to human preferences, but gathering high quality human preference labels is a key bottleneck. We conduct a head-to-head comparison of RLHF vs. RL from AI Feedback (RLAIF) – a technique where prefe…

« Back to Glossary Index

stevenbaert.ai

Author archive Author website

09/11/2023

© 2024 AIworks — Powered by WordPress

Theme by Anders Noren — Up ↑