information theory

15 items

ARTICLE3Blue1Brown (YouTube)·2d ago

Reinventing Entropy | Compression & Intelligence Part 1

This article explores the relationship between entropy, compression, and intelligence, serving as the first part of a series. It aims to redefine the understanding of these fundamental concepts.

information theory intelligence AI compression

Reinventing Entropy | Compression & Intelligence Part 1

RESEARCHarXiv CS.CL·18d ago

PromptNCE: Pointwise Mutual Information Predictions Using Only LLMs and Contrastive Estimation Prompts

This paper introduces PromptNCE, a method to estimate pointwise mutual information (PMI) using only LLMs and contrastive estimation prompts, circumventing the need for task-specific critics. It presents a benchmark with human-derived PMI and shows PromptNCE achieves Spearman correlation up to 0.82.

information theory LLMs prompt engineering machine learning

RESEARCHDEV.to AI·4d ago

Detection in the stochastic block model with multiple clusters: proof of theachievability conjectures, acyclic BP, and the infor

This paper explores detection within the stochastic block model with multiple clusters, providing proofs for achievability conjectures. It also discusses acyclic Belief Propagation and information-theoretic aspects of the model.

information theory stochastic block model machine learning graph theory

RESEARCHDEV.to AI·4/18/2026

Braille-D-FUMT8 vs CLIP / BERT / ImageBind: a Rigorous Information-Theoretic Comparison

This article, a re-publication of Rei-AIOS Paper 110, presents a rigorous information-theoretic comparison between the Braille-Unicode × D-FUMT8 encoding and multimodal embedding schemes like CLIP, BERT, and ImageBind. The research explores representing 256 philosophical states within a single 3-byte UTF-8 character.

information theory AI models multimodal AI NLP

RESEARCHarXiv CS.CL·4/9/2026

The Stepwise Informativeness Assumption: Why are Entropy Dynamics and Reasoning Correlated in LLMs?

Este artigo investiga a correlação entre a dinâmica interna de entropia e o raciocínio correto em Large Language Models (LLMs), um enigma ainda sem solução. Propõe a Hipótese de Informatividade Gradual (SIA), que afirma que os modelos raciocinam corretamente ao acumular informações relevantes sobre a resposta por meio de prefixos informativos, um processo reforçado por métodos de treinamento padrão.

information theory LLMs machine learning Reasoning

RESEARCHarXiv CS.AI·12d ago

On the Origin of Synthetic Information by Means of Steganographic Inheritance

This research paper posits the origin of synthetic information as a core mystery in information science, drawing an analogy to the origin of species. It introduces a steganographic inheritance mechanism to help trace the evolutionary lineage of AI-generated synthetic information, acknowledging the moral implications and technical challenges.

information theory synthetic data steganography AI ethics

RESEARCHarXiv CS.CL·4/16/2026

Bi-Predictability: A Real-Time Signal for Monitoring LLM Interaction Integrity

This paper introduces bi-predictability (P) and the Information Digital Twin (IDT) architecture for real-time monitoring of LLM interaction integrity. It aims to continuously ensure structural coupling in multi-turn workflows, addressing the shortcomings of current evaluation methods that fail to detect gradual degradation.

information theory monitoring evaluation real-time AI

RESEARCHDEV.to AI·4/26/2026

FIDT as a Domain-Specific Generator: A Honest Reframing of Fujimoto Infinite Dot Theory (Paper 140)

This article reframes the Fujimoto Infinite Dot Theory (FIDT) from a universal codec to a domain-specific generator for D-FUMT₈ theories. This new positioning, developed with Claude Opus 4.7's collaboration, achieves byte-exact reconstruction and high compression.

information theory research large language models compression

RESEARCHarXiv CS.LG·19d ago

Neural Estimation of Pairwise Mutual Information in Masked Discrete Sequence Models

The paper proposes a neural framework to estimate pairwise conditional mutual information (MI) directly from the hidden states of pretrained masked diffusion models (MDMs). This method captures dependency structures and enables MI-guided parallel decoding, showing utility in Sudoku and protein sequence generation by recovering known structural constraints.

neural networks information theory machine learning sequence models

RESEARCHarXiv CS.LG·5/4/2026

Information-Theoretic Generalization Bounds for Stochastic Gradient Descent with Predictable Virtual Noise

This paper introduces predictable history-adaptive virtual perturbations to enhance information-theoretic generalization bounds for Stochastic Gradient Descent. This new approach allows perturbation covariances to dynamically depend on past SGD history, addressing limitations of existing methods that require fixed covariances.

information theory Optimization Generalization machine learning

ARTICLEDEV.to AI·4/15/2026

Notes on Kullback-Leibler Divergence and Likelihood

This content explores notes on Kullback-Leibler Divergence and its relationship with the concept of Likelihood. It covers fundamental principles of information theory and statistical inference relevant to AI.

information theory Likelihood Machine Learning Theory Kullback-Leibler Divergence

RESEARCHarXiv CS.AI·4/21/2026

The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

This paper formulates masking-based AI explanation methods as communication over a query channel, where explanations act as messages. It derives information-theoretic limits for the recovery of exact explanations, showing that reliable recovery is achievable below a certain capacity.

information theory AI models Explainability feature importance

DOCTowards Data Science·2/3/2025

Quantifying Uncertainty — A Data Scientist’s Intro To Information Theory — Part 2/5: Entropy

This content provides an intuitive understanding of Entropy and its applications in Machine Learning and Data Analysis. It also includes Python code examples to facilitate learning.

information theory learning machine learning Data Analysis

DOCTowards Data Science·2/3/2025

Quantifying Surprise — A Data Scientist’s Intro To Information Theory — Part 1/5: Foundations

This content provides an introduction to Information Theory, focusing on its applications in Machine Learning and Data Analysis. Python code is included to aid understanding.

information theory learning machine learning Data Analysis

ARTICLEDEV.to AI·4/11/2026

The Translation Loss

The text discusses the long history of indirect and distorted communication between the US and Iran through intermediaries. Current negotiations in Islamabad represent a pioneering attempt at direct dialogue to correct decades of 'translation loss'.

information theory diplomacy international relations Communication