← heapsort-ai

Software engineering

157 items

RESEARCHarXiv CS.LG·4/21/2026

Beyond Verifiable Rewards: Rubric-Based GRM for Reinforced Fine-Tuning SWE Agents

This research introduces a rubric-based Generative Reward Model (GRM) to enhance Reinforced Fine-Tuning (RFT) for LLM Agents in Software Engineering (SWE) tasks. By providing richer learning signals beyond binary terminal rewards, this approach shapes intermediate behaviors and significantly improves the quality of the resolution process.

31
ARTICLEDEV.to AI·4/19/2026

What if I told you that the future of software development hinges not on human expertise but on AI efficiency?

The author shares a transformative experience witnessing AI-generated code rapidly replace a micro-SaaS service, challenging previous doubts about LLMs' impact on SaaS. This economic and efficiency shift promises a new era in software creation, drastically cutting development time and demanding adaptation from the industry.

29
ARTICLEDEV.to AI·16d ago

Stop Engineering Prompts: How an Eval-First Harness Let Us Ship 25 Algorithm Versions Autonomously

This article details the creation of an eval-first AI harness that enabled the autonomous shipment of 25 algorithm versions in 13 days. The methodology focuses on immutable test sets and independent reviews to ensure changes do not cause regressions. The author emphasizes that the harness, rather than just prompt engineering or full automation, was key to the pace and safety of development.

28
ARTICLEDEV.to AI·4/23/2026

Top 10 Vibe Coding Agency Companies to Watch in 2026

The article warns against the "vibe coding hangover," where AI-generated MVPs, created by describing goals in natural language to LLMs, lead to spaghetti code, unauthenticated APIs, and hardcoded secrets due to a lack of disciplined engineering. It highlights the rapid adoption of AI coding tools but stresses the importance of underlying engineering rigor to avoid pitfalls.

28