← heapsort
NEWS↑ trending44

Cloudflare open-sources lossless LLM compression tool

Reddit r/LocalLLaMAΒ·April 18, 2026

Cloudflare released Unweight, a lossless compression system that reduces LLM size by 15-22% without sacrificing output accuracy. The tool, which saves roughly 3 GB of VRAM on Nvidia H100 GPUs for Llama-3.1-8B, has been open-sourced on GitHub with plans to extend compression.

Read original β†—