← heapsort-ai

legal tech

47 items

CASE↑ trendingReddit r/MachineLearning·4/10/2026

[D] Large scale OCR [D]

Um usuário busca a forma mais econômica e rápida (1 semana) de realizar OCR em 50 milhões de páginas de documentos legais, focando apenas no texto e sem se preocupar com o layout. Este é um desafio prático de processamento de documentos em larga escala com restrições de tempo e custo.

36
RESEARCHarXiv CS.CL·1d ago

HKJudge: A Legal Discourse-Annotated Corpus for Interpreting What Courts Find, How They Reason, and What They Rule

The HKJudge project introduces the first sentence-level, expert-annotated legal discourse corpus of Hong Kong criminal judgments, comprising approximately 290k sentences. It utilizes a two-tier discourse schema to identify what courts find, how they reason, and what they rule, with high inter-annotator agreement.

36
RESEARCHarXiv CS.CL·4/24/2026

AITP: Traffic Accident Responsibility Allocation via Multimodal Large Language Models

AITP is introduced as a multimodal large language model designed for traffic accident responsibility allocation, enhancing reasoning through Multimodal Chain-of-Thought and integrating legal knowledge via Retrieval-Augmented Generation. The research also presents DecaTARA, a comprehensive decathlon-style benchmark with 67,941 annotated videos and 195,821 question-answer pairs.

29