N
Hacker Next
new
past
show
ask
show
jobs
submit
login
▲
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
(
arxiv.org
)
23 points by
PaulHoule
1 days ago
|
1 comment
add comment
Rendered at 23:39:07 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
Reubend 24 hours ago
[-]
Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.