ARTICLE↑ trending42

Built a normalizer so WER stops penalizing formatting differences in STT evals! [P]

Reddit r/MachineLearning·April 23, 2026

This content addresses the issue of Word Error Rate (WER) penalizing formatting differences in STT evaluations, leading to inaccurate scores. To solve this, the open-source `gladia-normalization` library was released, which normalizes transcripts before WER calculation, ensuring a fairer assessment of recognition quality.

Open Source evaluation NLP Speech-to-Text

Read original ↗