← heapsort
RESEARCH27

Cross-Lingual Response Consistency in Large Language Models: An ILR-Informed Evaluation of Claude Across Six Languages

arXiv CS.CLΒ·May 1, 2026

This paper introduces an ILR-informed framework to evaluate Claude (Sonnet 4.6) for cross-lingual response consistency across six languages. It analyzes responses to semantically equivalent prompts using quantitative metrics and expert ILR qualitative assessment, revealing language-specific variations like response length differences and surface divergence in creative clusters.

Read original β†—