Reinforcement Learning from Human Feedback (RLHF) in Notebooks github.com 72 points by ash_at_hny a day ago
Hl
[dead]
[dead]