Next.js Hacker News
  • top|
  • new|
  • ask|
  • show|
  • jobs|
  • GitHub
Reinforcement Learning from Human Feedback
83 points by onurkanbkrc 7 hours ago | 5 comments
  • dang
    Related. Others?

    RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)

  • verdverm
    Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
    • leggerss
      You could say he's also learning from human feedback
  • klelatti
    Web version with links, etc:

    https://rlhfbook.com/

    • dang
      Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.
  • iisweetheartii
    [dead]
Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact