← heapsort
RESEARCH27

Memory Architectures for Multi-Turn Text-to-SQL: A Benchmark and Empirical Study

arXiv CS.CLΒ·May 27, 2026

This research introduces EnterpriseMem-Bench, a novel multi-turn Text-to-SQL benchmark with 300 sessions and 1,400 turns from enterprise domains. It empirically evaluates five frontier models, including GPT and Claude variants, revealing that stateless multi-turn Text-to-SQL models achieve zero execution accuracy by Turn 3.

Read original β†—