RESEARCHarXiv CS.CL·5/4/2026
ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts
This article introduces ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. It consists of 42,012 premise-hypothesis pairs derived from official statutory documents, developed using a semi-automatic framework that integrates large language models for hypothesis generation and quality validation.
27