Towards Scalable Lightweight GUI Agents via Multi-role Orchestration
This paper proposes the LAMO framework to address the challenge of deploying lightweight MLLM-powered autonomous GUI agents on resource-constrained devices. LAMO enhances lightweight MLLMs with GUI-specific knowledge and task scalability through multi-role orchestration.