← heapsort-ai

Legal AI

15 items

RESEARCHarXiv CS.CL·20h ago

Retrieval Augmented Generation Framework for the Nepali Legal Domain Question Answering

This study presents the first application of a Retrieval Augmented Generation (RAG) model for Nepali legal question answering, addressing data scarcity in low-resource languages. Using BM25 on chunked documents, the RAG pipeline achieved high precision and truthfulness, demonstrating its effectiveness in the Nepali legal domain.

54
NEWS↑ trendingReddit r/LocalLLaMA·4/12/2026

Weekend project with Intel B70s

A user is building a high-end system with Intel Arc B70 GPUs and a Gigabyte B850 AI Top motherboard. The goal is to test the Gemma 4 model in legal RAG applications, utilizing a Hermes agent.

38
DOCDEV.to AI·9d ago

AI Automation for Ai For Solo Criminal Defense Attorneys How To Automate Discovery Document Summarization And Timeline Creati...

This quick guide offers solo criminal defense attorneys advice on leveraging AI to automate repetitive tasks like discovery document summarization and timeline creation. It recommends identifying automatable tasks, using free tools, building workflows, and utilizing prompts to standardize outputs.

27
RESEARCHarXiv CS.CL·5/8/2026

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

This paper evaluates whether a domain-trained Small Language Model (SLM) can outperform frontier Large Language Models on structured contract extraction at radically lower cost. Olava Extract achieved the strongest aggregate performance and highest precision scores, reducing inference cost by 78% to 97% compared with the frontier models tested.

27
RESEARCHarXiv CS.CL·5/4/2026

ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

This article introduces ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. It consists of 42,012 premise-hypothesis pairs derived from official statutory documents, developed using a semi-automatic framework that integrates large language models for hypothesis generation and quality validation.

27
RESEARCHarXiv CS.CL·21d ago

Exploring Lightweight Large Language Models for Court View Generation

The research explores the capabilities of lightweight Large Language Models (LLMs) in Criminal Court View Generation (CVG) and their impact on charge prediction within Legal AI. It systematically investigates architectural differences, model size, and comparison with Deep Neural Networks, introducing the CVGEvalKit framework for evaluation.

27
RESEARCHarXiv CS.CL·8d ago

CanLegalRAGBench: Evaluating Retrieval-Augmented Generation on Canadian Case Law

This paper introduces CanLegalRAGBench, a new Canadian legal QA benchmark for evaluating Retrieval-Augmented Generation (RAG) systems using realistic queries and expert-annotated case law answers. It highlights the sensitivity of retrieval performance, the competitiveness of open-source embedding models, and the limitations of automatic evaluations and LLM hallucinations in generated responses.

27