Autoregressive Models

5 items

RESEARCHarXiv CS.CL·12d ago

From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons

FLUID is a new framework designed to efficiently adapt Autoregressive (AR) backbones to the diffusion paradigm for parallel text generation. It enables initialization from GPT-style models and introduces an entropy-driven mechanism called Elastic Horizons, achieving state-of-the-art performance with significantly reduced training costs.

Diffusion Models text generation large language models Autoregressive Models

RESEARCHarXiv CS.CL·26d ago

Differences in Text Generated by Diffusion and Autoregressive Language Models

This research explores the intrinsic differences in text generated by Diffusion Language Models (DLMs) and Autoregressive Language Models (ARMs), finding that DLMs show lower n-gram entropy but higher semantic coherence and diversity. Controlled experiments reveal that DLM training objectives enhance coherence and diversity through bidirectional context, while decoding algorithms are responsible for entropy reduction.

Diffusion Models language models NLP text generation

RESEARCHarXiv CS.AI·24d ago

Conditional Attribute Estimation with Autoregressive Sequence Models

This research introduces Conditional Attribute Transformers, a novel method for jointly estimating next-token probability and an attribute's value conditional on each potential next token selection. This framework enables critical capabilities like per-token credit assignment and counterfactual analysis within a single forward pass, overcoming limitations of traditional generative models.

deep learning generative models sequence models Conditional Attribute Estimation

RESEARCHarXiv CS.AI·21d ago

PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation

This paper introduces PRISMat, a cost-effective, permutation-invariant model designed for the rapid identification of candidate materials. It addresses the inefficiencies of large language models in material generation by offering a faster and cheaper alternative for material filtering.

Materials Science AI models machine learning Computational Efficiency

RESEARCHYannic Kilcher (YouTube)·12/27/2025

TiDAR: Think in Diffusion, Talk in Autoregression (Paper Analysis)

This content provides an analysis of a research paper exploring the TiDAR model. The model integrates concepts of diffusion and autoregression for processing.

Diffusion Models AI models Paper analysis Machine learning research

TiDAR: Think in Diffusion, Talk in Autoregression (Paper Analysis)