← heapsort-ai

MLLMs

7 items

RESEARCHarXiv CS.LG·4/21/2026

SaFeR-Steer: Evolving Multi-Turn MLLMs via Synthetic Bootstrapping and Feedback Dynamics

SaFeR-Steer is a novel framework designed to improve the safety alignment of Multi-modal Large Language Models (MLLMs) in multi-turn dialogues, addressing challenges like escalating unsafe intent and long-context safety decay. It employs synthetic bootstrapping and feedback dynamics, while also releasing the STEER dataset for training and evaluation.

27
RESEARCHarXiv CS.CL·12d ago

ICG: Improving Cover Image Generation via MLLM-based Prompting and Personalized Preference Alignment

The paper proposes ICG, a novel framework for personalized cover image generation that integrates MLLM-based prompting with preference alignment. It utilizes semantic features and user embeddings to contextualize the diffusion model and adopts a multi-reward learning strategy to address the lack of labeled supervision.

27