← heapsort
RESEARCH27

Neural networks for Text-to-Speech evaluation

arXiv CS.CLΒ·April 13, 2026

This research introduces novel neural models to automate the evaluation of Text-to-Speech (TTS) system quality, addressing the limitations of traditional human subjective assessments. It proposes NeuralSBS for relative evaluations and enhancements to MOSNet and WhisperBert for absolute assessments, aiming to approximate expert judgments efficiently.

Read original β†—