RESEARCH27
Confidence Estimation in Automatic Short Answer Grading with LLMs
arXiv CS.CLΒ·May 4, 2026
This work investigates confidence estimation in Automatic Short Answer Grading (ASAG) with Large Language Models (LLMs), essential for human-AI collaboration in education. It compares model-based confidence estimation strategies and proposes a hybrid framework to address their limitations.
Read original β