← heapsort-ai

LLM benchmarking

2 items

ARTICLE↑ trendingReddit r/LocalLLaMA·4/17/2026

Qwen 3.6 35B crushes Gemma 4 26B on my tests

The author conducted a personal benchmark where Qwen 3.6 35B significantly outperformed Gemma 4 26B across tests evaluating agentic capabilities, coding, image-to-text synthesis, instruction following, and reasoning. Qwen fixed more issues, showed fewer regressions, and completed the tasks in less time, indicating superior overall performance.

42