Next.js Hacker News
  • top|
  • new|
  • ask|
  • show|
  • jobs|
  • GitHub
A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly
27 points by monax 4 days ago | discuss
    Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact