RESEARCH27

Confidence Estimation in Automatic Short Answer Grading with LLMs

arXiv CS.CL·May 4, 2026

This work investigates confidence estimation in Automatic Short Answer Grading (ASAG) with Large Language Models (LLMs), essential for human-AI collaboration in education. It compares model-based confidence estimation strategies and proposes a hybrid framework to address their limitations.

education LLMs AI grading human-AI interaction confidence estimation

Read original ↗