← heapsort-ai

LLM

612 items

ARTICLEDEV.to AI·4/16/2026

Complete Guide to AI-Powered Zero-Day Vulnerability Discovery — Claude Opus 4.6's 500+ Zero-Days and the Security Paradigm Shift

This article analyzes how Claude Opus 4.6 discovered over 500 zero-day vulnerabilities, including a 23-year-old Linux kernel bug, transforming LLMs into autonomous security research agents. It explores the technical mechanisms and DevSecOps implications of this AI-driven vulnerability discovery.

27
ARTICLEDEV.to AI·5/8/2026

Web4.0 Is Coming

This article explores the rarity of experienced "LLM integration developer" roles, even as AI emerges as a revolutionary computing platform. It points out the challenge for companies to find engineers with complete AI development experience given the rapid rise of LLM development.

27
ARTICLEDEV.to AI·5/8/2026

Stop Rereading Your Documents. Let the AI Study Them Once.

This content highlights the inefficiency of naive RAG workflows that repeatedly re-synthesize answers for static knowledge, incurring costs and inconsistencies. It advocates for compiling knowledge at ingest time, a pattern proposed by Andrej Karpathy (llm-wiki.md), where an LLM reads a document once to build structured wiki pages. Zenii reportedly implements this optimized pattern out-of-the-box.

27
RESEARCHarXiv CS.CL·4/13/2026

SynDocDis: A Metadata-Driven Framework for Generating Synthetic Physician Discussions Using Large Language Models

SynDocDis is a novel framework that utilizes Large Language Models and de-identified case metadata to generate clinically accurate synthetic physician-to-physician dialogues. This approach addresses the scarcity of real discussion data due to privacy concerns, aiming to enrich AI agents with valuable clinical knowledge.

27
ARTICLEDEV.to AI·15d ago

Most people starting with local LLMs jump straight to 4-bit quantization because it's fast and uses

This article compares 16-bit, 8-bit, and 4-bit LLM quantization, revealing that 4-bit, while faster, significantly compromises quality on reasoning and math tasks. The real trade-off is between the task and required precision, with 8-bit being optimal for precision-demanding tasks, offering minimal quality loss with only a slight speed reduction. Quantization choice should be based on the task and hardware considerations, not solely on hardware.

27
RESEARCHarXiv CS.LG·4/22/2026

Easy Samples Are All You Need: Self-Evolving LLMs via Data-Efficient Reinforcement Learning

This research introduces EasyRL, a novel data-efficient reinforcement learning approach for self-evolving LLMs, designed to overcome high annotation costs and performance issues in existing methods. Inspired by cognitive learning theory, EasyRL integrates knowledge transfer from easy labeled data with a progressive divide-and-conquer strategy for difficult unlabeled data.

27
RESEARCHarXiv CS.AI·5/6/2026

Programmatic Context Augmentation for LLM-based Symbolic Regression

This paper introduces a novel LLM-based evolutionary search framework for symbolic regression, addressing the limitations of existing methods that rely solely on scalar evaluation metrics. It incorporates programmatic context augmentation to enable code-based data analysis and richer information extraction, aiming to improve the discovery of mathematical expressions.

27
RESEARCHarXiv CS.CL·14d ago

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

This work introduces CroCo, a method for cross-lingual contrastive preference tuning on self-generated responses from LLMs, demonstrating effective transfer across 14 languages without language-specific preference annotations. An English-trained reward model yields useful rankings across most languages, improving existing models and preventing catastrophic forgetting, provided on-policy data is used.

27