Retrieval Augmented Generation

13 items

RESEARCHarXiv CS.CL·20h ago

Retrieval Augmented Generation Framework for the Nepali Legal Domain Question Answering

This study presents the first application of a Retrieval Augmented Generation (RAG) model for Nepali legal question answering, addressing data scarcity in low-resource languages. Using BM25 on chunked documents, the RAG pipeline achieved high precision and truthfulness, demonstrating its effectiveness in the Nepali legal domain.

Retrieval Augmented Generation Legal AI Question Answering natural language processing

RESEARCHarXiv CS.CL·4/23/2026

Cognis: Context-Aware Memory for Conversational AI Agents

Lyzr Cognis introduces a unified memory architecture for conversational AI agents, addressing the lack of persistent memory through a multi-stage retrieval pipeline. It combines a dual-store backend, context-aware ingestion, and temporal boosting, achieving state-of-the-art performance on two independent benchmarks.

Retrieval Augmented Generation research memory Conversational AI

RESEARCHarXiv CS.AI·4/15/2026

Memory as Metabolism: A Design for Companion Knowledge Systems

This paper proposes a companion-specific governance profile for single-user knowledge wikis, addressing the unique failure mode of entrenchment under user-coupled drift. It discusses emerging personal AI memory architectures from 2026, including RAG-based systems and wiki-style designs, alongside established academic and production memory systems.

Retrieval Augmented Generation LLMs Companion AI knowledge systems

ARTICLEDEV.to AI·25d ago

Why your local LLM knowledge base gives bad answers (and how to fix it)

Local LLMs often provide poor answers from personal knowledge bases, not due to the model itself, but due to issues in the retrieval layer. This article explores the frustrating problem and how the retrieval pipeline works.

Retrieval Augmented Generation knowledge base Local AI LLM

RESEARCHarXiv CS.CL·4/6/2026

Principled and Scalable Diversity-Aware Retrieval via Cardinality-Constrained Binary Quadratic Programming

Este trabalho propõe uma formulação rigorosa para a recuperação consciente da diversidade em Geração Aumentada por Recuperação (RAG), abordando a falta de garantias teóricas e escalabilidade dos métodos existentes. A solução utiliza programação quadrática binária com restrição de cardinalidade (CCBQP) e um algoritmo baseado em Frank-Wolfe, demonstrando desempenho superior na fronteira de Pareto de relevância-diversidade e maior velocidade.

Retrieval Augmented Generation Scalability Diversity-aware Retrieval Binary Quadratic Programming

RESEARCHarXiv CS.AI·5/1/2026

Think it, Run it: Autonomous ML pipeline generation via self-healing multi-agent AI

This paper proposes a unified multi-agent AI architecture to automate end-to-end machine learning (ML) pipeline generation from datasets and natural-language goals. The five-agent system integrates RAG, an explainable hybrid recommender, and an LLM-based self-healing mechanism, achieving an 84.7% success rate and improved robustness.

Retrieval Augmented Generation multi-agent AI large language models ML Automation

DOCDEV.to AI·4/27/2026

From Static Data to Conversational AI: Building a RAG-Powered Customer Agent (Part 2)

Part 2 of this series focuses on building the interface and reasoning engine for a RAG-powered customer agent. It details how to connect a messaging front-end (Telegram Bot API) to a vector database (Pinecone) and an LLM using Make.com to provide accurate, real-time responses.

Retrieval Augmented Generation LLMs Vector Databases customer service AI

DOCDEV.to AI·5/1/2026

Prompt engineering techniques

This document describes a prompt engineering technique that improves AI model response quality by replacing static examples with semantically similar ones retrieved from a vector database. It involves indexing successful conversations and injecting the most relevant examples into the system prompt for complex tasks.

Retrieval Augmented Generation AI models prompt engineering Vector Databases

RESEARCHarXiv CS.CL·4/15/2026

Benchmarking Deflection and Hallucination in Large Vision-Language Models

This paper introduces VLM-DeflectionBench, a new benchmark for Large Vision-Language Models (LVLMs) focusing on deflection and hallucination when dealing with conflicting or insufficient evidence. It also proposes a dynamic data curation pipeline to maintain benchmark difficulty over time and a fine-grained evaluation protocol to disentangle model behavior.

Retrieval Augmented Generation hallucination Benchmarking LVLM

RESEARCHarXiv CS.CL·4/30/2026

CogRAG+: Cognitive-Level Guided Diagnosis and Remediation of Memory and Reasoning Deficiencies in Professional Exam QA

CogRAG+ is a training-free framework designed to diagnose and remediate memory and reasoning deficiencies in large language models for professional exam QA. It decouples and aligns retrieval and reasoning with human cognitive hierarchies, employing Reinforced Retrieval and cognition-stratified Constrained Reasoning to enhance accuracy and consistency.

Retrieval Augmented Generation natural language processing AI Reasoning large language models

RESEARCHarXiv CS.CL·5/8/2026

AdaGATE: Adaptive Gap-Aware Token-Efficient Evidence Assembly for Multi-Hop Retrieval-Augmented Generation

AdaGATE is a training-free evidence controller for multi-hop Retrieval-Augmented Generation (RAG) designed to address noisy or redundant retrieved evidence in limited contexts. It frames evidence selection as a token-constrained repair problem, combining entity-centric gap tracking and targeted micro-query generation to balance coverage, corroboration, and novelty.

Retrieval Augmented Generation AI models Multi-hop RAG Evidence Selection

RESEARCHarXiv CS.CL·8d ago

CanLegalRAGBench: Evaluating Retrieval-Augmented Generation on Canadian Case Law

This paper introduces CanLegalRAGBench, a new Canadian legal QA benchmark for evaluating Retrieval-Augmented Generation (RAG) systems using realistic queries and expert-annotated case law answers. It highlights the sensitivity of retrieval performance, the competitiveness of open-source embedding models, and the limitations of automatic evaluations and LLM hallucinations in generated responses.

Retrieval Augmented Generation LLMs evaluation Legal AI

RESEARCHarXiv CS.CL·4/30/2026

Generative AI-Based Virtual Assistant using Retrieval-Augmented Generation: An evaluation study for bachelor projects

This paper evaluates a Generative AI-based virtual assistant utilizing Retrieval-Augmented Generation (RAG) to support Maastricht University students with project regulations. The system aims to address challenges like hallucinations and provide accurate, context-specific responses by integrating domain-specific knowledge.

Retrieval Augmented Generation education Virtual Assistants large language models