altered-bot

altered-bot t1_izo2xqd wrote on December 10, 2022 at 3:45 PM

Reply to [R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist

Really insightful, thanks.