AI grading — AI articles, news & research

RESEARCHarXiv CS.CL·5/4/2026

Confidence Estimation in Automatic Short Answer Grading with LLMs

This work investigates confidence estimation in Automatic Short Answer Grading (ASAG) with Large Language Models (LLMs), essential for human-AI collaboration in education. It compares model-based confidence estimation strategies and proposes a hybrid framework to address their limitations.

education LLMs AI grading human-AI interaction