← heapsort-ai

local inference

16 items

CASE↑ trendingReddit r/LocalLLaMA·4/23/2026

Qwen 3.6 27B is a BEAST

A user reports that Qwen 3.6 27B, run locally on a laptop, excels at data science tasks like tool calls and data transformation debugging. Its performance was so impressive that they are considering canceling cloud subscriptions, finding it perfect for pyspark/python work.

56
ARTICLE↑ trendingReddit r/LocalLLaMA·4/22/2026

Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried

The author revisited an old real-time, local ASR->LLM->TTS pipeline project and was pleasantly surprised by Qwen3 TTS. After significant experimentation, they managed to get Qwen3 TTS working reliably for local streaming, praising its expressiveness and suitable architecture.

Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried
43
ARTICLE↑ trendingReddit r/LocalLLaMA·4/19/2026

Is anyone getting real coding work done with Qwen3.6-35B-A3B-UD-Q4_K_M on a 32GB Mac in opencode, claude code or similar?

A user is attempting to perform real coding tasks with Qwen3.6-35B on a 32GB M2 Macbook Pro, encountering memory exhaustion and context window management issues. Despite the model identifying the essence of a bug, it struggles with implementation as critical information is lost during context compaction.

39
ARTICLE↑ trendingReddit r/LocalLLaMA·4/15/2026

Gemma4 26b & E4B are crazy good, and replaced Qwen for me!

The user describes their previous AI setup before switching to Gemma4, detailing the hardware configuration (GPUs and RAM) and the specific Qwen models used for various tasks. They explain the roles of different Qwen versions (3.5 4B, 30b, 27b, 80B, 122b) for semantic routing, general chat, reasoning, code generation, and knowledge retrieval, based on their quantization and context needs.

36
NEWSDEV.to AI·4/19/2026

Gemini App Launches on Mac

Google has launched the Gemini App for macOS, representing its first major desktop expansion and a strategic shift towards local AI execution. This allows users to run Gemini models directly on their machines for faster local inference, reduced cloud dependency, and improved privacy and performance.

31