Next.js Hacker News
top
|
new
|
ask
|
show
|
jobs
|
GitHub
A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly
27 points by
monax
4 days ago |
discuss
add comment