A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction
This paper evaluates whether a domain-trained Small Language Model (SLM) can outperform frontier Large Language Models on structured contract extraction at radically lower cost. Olava Extract achieved the strongest aggregate performance and highest precision scores, reducing inference cost by 78% to 97% compared with the frontier models tested.