← heapsort
RESEARCH↑ trending38

QWEN3.6 + ik_llama is fast af

Reddit r/LocalLLaMAΒ·April 19, 2026
QWEN3.6 + ik_llama is fast af

A user reported running the Qwen3.6 + ik_llama model at over 50 tokens/second with a 200k context window on 16GB VRAM and 32GB RAM. This marks a significant performance benchmark for large language models.

Read original β†—