RESEARCH27

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

arXiv CS.CL·May 8, 2026

This paper evaluates whether a domain-trained Small Language Model (SLM) can outperform frontier Large Language Models on structured contract extraction at radically lower cost. Olava Extract achieved the strongest aggregate performance and highest precision scores, reducing inference cost by 78% to 97% compared with the frontier models tested.

LLMs Legal AI SLMs Benchmarking contract extraction

Read original ↗