Comments on: Illustrating Reinforcement Learning from Human Feedback (RLHF) https://russian.lifeboat.com/blog/2023/11/illustrating-reinforcement-learning-from-human-feedback-rlhf Safeguarding Humanity Fri, 24 Nov 2023 02:42:40 +0000 hourly 1 https://wordpress.org/?v=6.6.1 By: Lance https://russian.lifeboat.com/blog/2023/11/illustrating-reinforcement-learning-from-human-feedback-rlhf#comment-497371 Fri, 24 Nov 2023 02:42:40 +0000 https://lifeboat.com/blog/2023/11/illustrating-reinforcement-learning-from-human-feedback-rlhf#comment-497371 What happens when “it” causes you to do something you don’t want to do?

]]>