← heapsort
RESEARCH27

ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

arXiv CS.CLΒ·May 4, 2026

This article introduces ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. It consists of 42,012 premise-hypothesis pairs derived from official statutory documents, developed using a semi-automatic framework that integrates large language models for hypothesis generation and quality validation.

Read original β†—