RESEARCH27
Latent Cache Flow: Model-to-Model Communication Without Text
arXiv CS.LGΒ·May 25, 2026
Latent Cache Flow (LCF) is introduced as a new method for efficient model-to-model communication, addressing the latency and information loss of text-based LLM agent communication. LCF jointly translates and compresses keys and values, significantly reducing adapter size and transmitting a summary of new information for differing contexts.
Read original β