Next.js Hacker News
top
|
new
|
ask
|
show
|
jobs
|
GitHub
RLHF from Scratch
61 points by
onurkanbkrc
10 hours ago |
2 comments
add comment
fauria
RLHF: Reinforcement learning from human feedback -
https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
alansaber
Looks good. I am a big advocate for these hands on demos as being the best way for beginners to learn ML