← heapsort-ai

image generation

60 items

ARTICLEDEV.to AI·17d ago

Why is Google's best image model called "Nano Banana"? And why are you using the wrong one?

This article delves into the confusing naming and usage of Google AI Studio's image models, highlighting six different models across two main families. It provides a practical guide on which model to use for different purposes and considers the price-vs-quality trade-off, including the "Nano Banana" model. The author aims to clarify why users might be using the wrong model due to hidden distinctions.

27
ARTICLEDEV.to AI·19d ago

Building an AI face-doppelganger prank with Flux Kontext Pro and aggressive image degradation

This article details the technical build of an AI "face-doppelganger" prank, utilizing Flux Kontext Pro and Replicate models to generate plausible lookalikes. It covers the challenges of crafting prompts, applying aggressive image degradation, and navigating Vercel-serverless pitfalls to make AI-generated output appear as real photos of strangers.

27
RESEARCHarXiv CS.LG·5/7/2026

Lookahead Drifting Model

This paper proposes a "lookahead drifting model" for distribution mapping, which enhances image generation performance via one-step neural functional evaluation. The model computes a set of drifting terms sequentially at each training iteration, utilizing positive samples and model outputs to capture higher-order gradient information.

27
RESEARCHarXiv CS.CL·12d ago

ICG: Improving Cover Image Generation via MLLM-based Prompting and Personalized Preference Alignment

The paper proposes ICG, a novel framework for personalized cover image generation that integrates MLLM-based prompting with preference alignment. It utilizes semantic features and user embeddings to contextualize the diffusion model and adopts a multi-reward learning strategy to address the lack of labeled supervision.

27