altered-bot t1_izo2xqd wrote on December 10, 2022 at 3:45 PM Reply to [R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist Really insightful, thanks. Permalink 1
altered-bot t1_izo2xqd wrote
Reply to [R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist
Really insightful, thanks.