← heapsort-ai

code generation

107 items

ARTICLE↑ trendingReddit r/LocalLLaMA·4/23/2026

Qwen3.6 can code

A user, frustrated with OpenAI models, tried Qwen3.6-27b for Svelte 5 code generation and got a perfect result, despite it taking longer. They anticipate interesting developments in the next 12 months, despite the informal nature of the evaluation.

52
CASE↑ trendingReddit r/LocalLLaMA·4/17/2026

Qwen3.6. This is it.

A user recounts their experience with the Qwen3.6 model, which successfully built and tested a tower defense game, demonstrating the ability to identify and fix its own bugs. The AI confirmed builds using screenshots, astonishing the user with its advanced capabilities.

Qwen3.6. This is it.
43
RESEARCH↑ trendingReddit r/MachineLearning·5/4/2026

AutoBe benchmark: structured harness narrows frontier-vs-local gap in backend generation [D]

AutoBe is a new benchmark for end-to-end backend generation, where natural language requests produce six structured outputs via structured function calls. The benchmark reveals that backend quality is more influenced by harness design than model prestige, with local models performing comparably to frontier models at a significantly lower cost.

43
RESEARCH↑ trendingReddit r/MachineLearning·5/7/2026

META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet?

Meta Superintelligence Lab introduces ProgramBench, an initiative testing the ability of advanced AIs to recreate executable programs like ffmpeg and SQLite from scratch, without internet access. This study aims to explore the limits of AI code generation. The research focuses on evaluating the autonomy and completeness of AI models in complex software synthesis.

42
CASE↑ trendingReddit r/LocalLLaMA·4/23/2026

Been using PI Coding Agent with local Qwen3.6 35b for a while now and its actually insane

The user reports an extremely positive and effective experience with the PI Coding Agent, utilizing a local Qwen3.6 35b model for production projects. Success was attributed to a custom "plan-first skill file" that enforces a structured planning workflow, ensuring step-by-step execution and plan approval before any coding.

42
ARTICLE↑ trendingReddit r/LocalLLaMA·4/19/2026

Is anyone getting real coding work done with Qwen3.6-35B-A3B-UD-Q4_K_M on a 32GB Mac in opencode, claude code or similar?

A user is attempting to perform real coding tasks with Qwen3.6-35B on a 32GB M2 Macbook Pro, encountering memory exhaustion and context window management issues. Despite the model identifying the essence of a bug, it struggles with implementation as critical information is lost during context compaction.

39
RESEARCHarXiv CS.AI·5d ago

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

StepPRM-RTL is a novel framework that enhances LLM-based RTL code generation by combining stepwise trajectory modeling, process-reward modeling (PRM), and retrieval-augmented fine-tuning (RAFT). It uses dense feedback from a PRM to guide reinforcement-style updates and Monte Carlo Tree Search (MCTS) to enrich the training dataset.

33