← heapsort-ai

Retrieval Augmented Generation

13 items

RESEARCHarXiv CS.CL·20h ago

Retrieval Augmented Generation Framework for the Nepali Legal Domain Question Answering

This study presents the first application of a Retrieval Augmented Generation (RAG) model for Nepali legal question answering, addressing data scarcity in low-resource languages. Using BM25 on chunked documents, the RAG pipeline achieved high precision and truthfulness, demonstrating its effectiveness in the Nepali legal domain.

54
RESEARCHarXiv CS.AI·4/15/2026

Memory as Metabolism: A Design for Companion Knowledge Systems

This paper proposes a companion-specific governance profile for single-user knowledge wikis, addressing the unique failure mode of entrenchment under user-coupled drift. It discusses emerging personal AI memory architectures from 2026, including RAG-based systems and wiki-style designs, alongside established academic and production memory systems.

28
RESEARCHarXiv CS.CL·4/6/2026

Principled and Scalable Diversity-Aware Retrieval via Cardinality-Constrained Binary Quadratic Programming

Este trabalho propõe uma formulação rigorosa para a recuperação consciente da diversidade em Geração Aumentada por Recuperação (RAG), abordando a falta de garantias teóricas e escalabilidade dos métodos existentes. A solução utiliza programação quadrática binária com restrição de cardinalidade (CCBQP) e um algoritmo baseado em Frank-Wolfe, demonstrando desempenho superior na fronteira de Pareto de relevância-diversidade e maior velocidade.

28
RESEARCHarXiv CS.AI·5/1/2026

Think it, Run it: Autonomous ML pipeline generation via self-healing multi-agent AI

This paper proposes a unified multi-agent AI architecture to automate end-to-end machine learning (ML) pipeline generation from datasets and natural-language goals. The five-agent system integrates RAG, an explainable hybrid recommender, and an LLM-based self-healing mechanism, achieving an 84.7% success rate and improved robustness.

27
RESEARCHarXiv CS.CL·4/15/2026

Benchmarking Deflection and Hallucination in Large Vision-Language Models

This paper introduces VLM-DeflectionBench, a new benchmark for Large Vision-Language Models (LVLMs) focusing on deflection and hallucination when dealing with conflicting or insufficient evidence. It also proposes a dynamic data curation pipeline to maintain benchmark difficulty over time and a fine-grained evaluation protocol to disentangle model behavior.

27
RESEARCHarXiv CS.CL·4/30/2026

CogRAG+: Cognitive-Level Guided Diagnosis and Remediation of Memory and Reasoning Deficiencies in Professional Exam QA

CogRAG+ is a training-free framework designed to diagnose and remediate memory and reasoning deficiencies in large language models for professional exam QA. It decouples and aligns retrieval and reasoning with human cognitive hierarchies, employing Reinforced Retrieval and cognition-stratified Constrained Reasoning to enhance accuracy and consistency.

27
RESEARCHarXiv CS.CL·5/8/2026

AdaGATE: Adaptive Gap-Aware Token-Efficient Evidence Assembly for Multi-Hop Retrieval-Augmented Generation

AdaGATE is a training-free evidence controller for multi-hop Retrieval-Augmented Generation (RAG) designed to address noisy or redundant retrieved evidence in limited contexts. It frames evidence selection as a token-constrained repair problem, combining entity-centric gap tracking and targeted micro-query generation to balance coverage, corroboration, and novelty.

27
RESEARCHarXiv CS.CL·8d ago

CanLegalRAGBench: Evaluating Retrieval-Augmented Generation on Canadian Case Law

This paper introduces CanLegalRAGBench, a new Canadian legal QA benchmark for evaluating Retrieval-Augmented Generation (RAG) systems using realistic queries and expert-annotated case law answers. It highlights the sensitivity of retrieval performance, the competitiveness of open-source embedding models, and the limitations of automatic evaluations and LLM hallucinations in generated responses.

27
RESEARCHarXiv CS.CL·4/30/2026

Generative AI-Based Virtual Assistant using Retrieval-Augmented Generation: An evaluation study for bachelor projects

This paper evaluates a Generative AI-based virtual assistant utilizing Retrieval-Augmented Generation (RAG) to support Maastricht University students with project regulations. The system aims to address challenges like hallucinations and provide accurate, context-specific responses by integrating domain-specific knowledge.

27