RESEARCHarXiv CS.CL·29d ago
IntentGrasp: A Comprehensive Benchmark for Intent Understanding
IntentGrasp is a new comprehensive benchmark for evaluating the intent understanding capability of Large Language Models, derived from 49 high-quality corpora. Extensive evaluations on 20 LLMs showed unsatisfactory performance, with scores below 60% on the All Set and 25% on the Gem Set.
27