← heapsort-ai

LLMs

723 items

DOCDEV.to AI·13d ago

99. Build a Chatbot With Memory

This content explains how to build a chatbot with memory, overcoming the stateless nature of LLMs. It details patterns like conversation history, sliding window memory, summary memory, and entity memory, including using LangChain to build a multi-turn chatbot and persist memory across sessions.

27
ARTICLEDEV.to AI·4/6/2026

AI Citation Registries as Information Infrastructure for AI Systems

O conteúdo aborda como sistemas de IA podem deturpar a fonte de informação, como a autoridade emissora de um aviso, ao processar fragmentos de texto e perder o contexto original. Isso ressalta a necessidade de "AI Citation Registries" para preservar atributos cruciais de jurisdição e autoria, garantindo a precisão e a integridade dos dados gerados.

27
ARTICLEDEV.to AI·5/5/2026

Building Agent Memory: Episodic vs Semantic Stores

The text discusses the concept of "agent memory" in AI systems, highlighting the challenge of agents retaining context from previous sessions due to fresh message arrays. This leads to issues where agents forget user preferences, increasing costs and latency when attempts are made to compensate with lengthy system prompts.

27
RESEARCHarXiv CS.CL·4/15/2026

Leveraging Weighted Syntactic and Semantic Context Assessment Summary (wSSAS) Towards Text Categorization Using LLMs

This paper introduces the Weighted Syntactic and Semantic Context Assessment Summary (wSSAS), a deterministic framework to optimize text categorization using LLMs. It addresses LLM limitations by organizing text hierarchically and employing a Signal-to-Noise Ratio (SNR) to focus on high-value semantic features.

27
RESEARCHarXiv CS.LG·4/15/2026

When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation

This paper investigates how enhanced reasoning in language models can harm the fidelity of behavioral simulations, particularly when the goal is to sample boundedly rational behavior rather than solve a strategic problem. The authors identify a "solver-sampler mismatch" where LLMs over-optimize, collapsing compromise-oriented behavior and leading to diversity without fidelity in outcomes.

27
RESEARCHarXiv CS.LG·4/15/2026

Schema-Adaptive Tabular Representation Learning with LLMs for Generalizable Multimodal Clinical Reasoning

This research introduces "Schema-Adaptive Tabular Representation Learning," a novel method using Large Language Models (LLMs) to generate transferable tabular embeddings. By semantically encoding structured variables into natural language, it enables zero-shot alignment across varying EHR schemas in clinical medicine without manual feature engineering.

27
RESEARCHarXiv CS.LG·4/14/2026

Deliberative Alignment is Deep, but Uncertainty Remains: Inference time safety improvement in reasoning via attribution of unsafe behavior to base model

This research investigates Deliberative Alignment in LLMs, a method designed to improve safety by distilling reasoning capabilities from stronger models. It uncovers an alignment gap between teacher and student models, showing that student models can retain unsafe behaviors from the base model despite learning advanced reasoning patterns. The paper proposes a BoN sampling method to address these challenges.

27
RESEARCHarXiv CS.CL·5/5/2026

Can AI Debias the News? LLM Interventions Improve Cross-Partisan Receptivity but LLMs Overestimate Their Own Effectiveness

This research paper explores whether LLMs can debias partisan news to improve cross-partisan receptivity among conservative readers. It found that a substantive reframing by LLMs significantly increased conservatives' trust and willingness to engage with liberal news headlines, though LLMs overestimate their own effectiveness.

27