RESEARCHarXiv CS.CL·5/4/2026
Confidence Estimation in Automatic Short Answer Grading with LLMs
This work investigates confidence estimation in Automatic Short Answer Grading (ASAG) with Large Language Models (LLMs), essential for human-AI collaboration in education. It compares model-based confidence estimation strategies and proposes a hybrid framework to address their limitations.
27