Quality Assurance

20 items

ARTICLEDEV.to AI·1d ago

AI-Driven Test Automation Is Not a Testing Strategy, It's a Decision Shift

AI-assisted development transforms the nature of testing, shifting the bottleneck to verification and risk judgment rather than merely increasing the volume of tests. Successful teams are deliberate about what to test and review, establishing clear boundaries for AI's role in the workflow.

development workflow AI Software Testing test automation

ARTICLE↑ trendingReddit r/MachineLearning·4/27/2026

How do you test AI agents in production? The unpredictability is overwhelming.[D]

A QA professional highlights the overwhelming challenges of testing non-deterministic LLM-based AI agents in production, where traditional quality assurance methods fail. They struggle with the variability of outputs and reasoning chains, finding existing approaches like snapshot testing and human evaluation insufficient or unscalable.

production AI testing Quality Assurance LLM

ARTICLEDEV.to AI·4/22/2026

What an AI Publishing Pipeline Learns When Image Generation and Editorial QA Run on Different Clocks: Practical Notes for Builders

This article explores the challenges in AI publishing pipelines, highlighting that problems arise in ensuring editorial QA, preserving source truth, and handling platform-specific variants, rather than just draft generation speed. It emphasizes that system design is crucial to guarantee the final content matches the original intent, even when image generation and editorial QA run on different clocks.

AI publishing System design workflow automation content management

ARTICLEDEV.to AI·3d ago

OpenClaw Diff Artifacts: Review Agent Edits Before They Ship

This article highlights the risks of unreviewed AI agent changes in production and introduces OpenClaw's diffs plugin. The plugin generates read-only diff artifacts from before-and-after text or patches, enabling thorough human inspection before deployment.

diff artifacts code review Quality Assurance AI agents

ARTICLEDEV.to AI·4/19/2026

AI Doesn't Fix Bad Engineering — It Amplifies It (Here's What To Do Instead)

This content argues that AI tools amplify existing engineering quality, making good teams faster and bad teams even worse by accelerating poor practices. It emphasizes that AI success should be measured by quality improvements rather than mere velocity, urging for well-defined tasks and clear prompts.

prompt engineering productivity Software engineering AI development

DOCDEV.to AI·4/18/2026

Your AI Assistant is Not a Proofreader: A Quality Assurance Framework for Self-Publishers

The content warns that AI automates execution, not judgment, especially in self-publishing formatting. It emphasizes the need for human quality assurance and introduces a three-step framework for reviewing AI-generated output.

self-publishing AI Quality Assurance

RESEARCHarXiv CS.CL·4/7/2026

Are Arabic Benchmarks Reliable? QIMMA's Quality-First Approach to LLM Evaluation

QIMMA é uma nova plataforma de avaliação de LLMs em árabe que prioriza a qualidade, realizando validação sistemática de benchmarks. Ela resolve problemas de qualidade em benchmarks existentes através de revisão automatizada e humana, resultando em um conjunto de avaliação reprodutível e multi-tarefa com mais de 52 mil amostras.

Arabic LLM NLP Benchmarks Quality Assurance

ARTICLEDEV.to AI·4d ago

Your Test Suite Is Lying To You

This article discusses the danger in AI-assisted development where AI-generated test suites, written after the code, can fail to identify bugs, instead documenting existing behavior. This leads to passing tests and shipped bugs, masking real problems and silently violating specifications.

bugs CI/CD Software Testing AI development

ARTICLEDEV.to AI·10d ago

Claude Code Hooks I Ship in Every Project: 6 Patterns

This article details six essential 'code hooks' that the author integrates into every AI project, specifically with Claude, to proactively catch errors before content goes live. These hooks address limitations of Claude's memory files by automating checks for brand compliance, layout, accessibility, SEO, and post-publish verification, ensuring high-quality output.

code hooks Claude AI automation AI development

DOCDEV.to AI·5/2/2026

AI as Your eBook QA Partner: Mastering Reflowable Layouts

This content explores how AI can act as an eBook QA partner, helping self-publishers master reflowable layouts. It details how to leverage AI automation to apply and validate CSS rules, ensuring a perfect reading experience across various devices.

Publishing self-publishing AI eBooks

ARTICLEDEV.to AI·24d ago

One AI code review pass isn't enough. Here's the loop that actually catches bugs.

A single pass of AI code review, despite giving a "LGTM" response, is often inadequate and statistically worse than a human's initial review, leading to costly production bugs. While AI effectively catches minor issues, it frequently misses critical problems like cross-file invariants, race conditions, and silent regressions that require a more robust review process.

Software Development code quality bug detection AI code review

ARTICLEDEV.to AI·5/8/2026

Record-and-Playback Test Automation Is Not Enough for the AI Era

Record-and-playback test automation, while historically useful, is no longer a sufficient core product strategy in the AI era. It creates a painful workflow and falls behind AI-native testing workflows.

Software Development AI test automation Quality Assurance

ARTICLEDEV.to AI·5/8/2026

The QA and Code Review Checklist for AI-Generated PRs That Nobody Wrote

This article discusses the challenges of reviewing AI-generated pull requests, which can introduce subtle bugs and misleadingly coherent code. The author developed a specialized review playbook after experiencing issues with AI-assisted code in production, highlighting how AI breaks traditional code review assumptions.

code review Software engineering developer tools AI development

DOCDEV.to AI·5/8/2026

Your AI-Powered Pre-Publish Checklist: From Automation to Assurance

This content discusses leveraging AI for eBook formatting while emphasizing the critical need for human review in quality assurance. It outlines a three-step framework for auditing AI output, not the process, to ensure publication readiness. The article positions AI as a powerful tool for structural tasks, requiring strategic oversight and a meticulous final review from the author.

self-publishing learning AI tools publishing workflow

NEWSAWS Machine Learning Blog·5/4/2026

Introducing agent quality optimization in AgentCore, now in preview

AgentCore introduces a new agent quality optimization feature, now in preview, to help maintain AI agent performance over time. It allows users to generate recommendations from production traces, validate them through batch evaluation and A/B testing, and confidently deploy improvements.

development Performance optimization Quality Assurance AI agents

CASEOpenAI Blog·18d ago

How Virgin Atlantic ships faster with Codex

Virgin Atlantic successfully used Codex to launch its revamped mobile app ahead of a fixed holiday travel deadline. This implementation achieved near-total unit test coverage and zero P1 defects.

Software Development DevOps mobile app development project success

ARTICLEDEV.to AI·4/9/2026

Manual testing isn't dying, but manual testers need to change

O autor, CEO de uma empresa de QA, argumenta que o teste manual não está morrendo, apesar da pressão por automação total. Ele defende que, embora testes repetitivos devam ser automatizados, há uma demanda crescente por testadores manuais qualificados para tarefas complexas.

Manual Testing Software Testing automation Quality Assurance

DOCDEV.to AI·20d ago

Software Testing Life Cycle Explained for Modern Development Teams

The Software Testing Life Cycle (STLC) is a structured process crucial for modern development teams, helping to identify issues early and ensure software quality. It organizes testing into multiple phases to validate that an application works as expected before release.

Software Development agile STLC Software Testing

DOCDEV.to AI·5/3/2026

Testing Localization at Scale: A Deep Dive with TestSprite

This content delves into testing localization at scale, providing a deep dive with the TestSprite tool. It explores methodologies and challenges associated with ensuring quality in globalized products.

Testing TestSprite localization Quality Assurance

DOCGoogle for Developers (YouTube)·27d ago

3 tips for stopping flaky tests

This document provides three essential tips for addressing flaky tests, which are tests that produce inconsistent results without code changes. It focuses on strategies to enhance test reliability and ensure more stable software development cycles.

Testing Best Practices Software Testing Flaky Tests Quality Assurance