representation learning

23 items

RESEARCHarXiv CS.LG·1d ago

Principles and Practice of Deep Representation Learning: or a Mathematical Theory of Memory

This book aims to demystify large deep networks and generative models, often perceived as "black boxes," by exploring their internal mechanisms through the lens of representation learning. It outlines how modern neural network architectures are designed, utilizing optimization and information theory.

neural networks deep learning learning generative models

RESEARCH↑ trendingReddit r/MachineLearning·4/30/2026

[R] Joint Embedding Variational Bayes (TMLR ’26)

This TMLR paper introduces operational variational semantics to joint-embedding architectures for non-contrastive representation learning. It does so by factorizing embedding likelihood, anchoring posterior uncertainty to likelihood scale, and employing a heavy-tailed Student-t likelihood for empirical benefits.

Variational Inference Deep Learning Models machine learning representation learning

RESEARCHDEV.to AI·2d ago

Subject-Aware Contrastive Learning for Biosignals

This research focuses on Subject-Aware Contrastive Learning, a novel AI technique developed for the effective processing and understanding of biosignals. It aims to improve the representation learning of complex biological data, offering advancements in the analysis of physiological measurements.

contrastive learning learning machine learning biosignals

ARTICLEDEV.to AI·4/22/2026

Autoencoders and Representation Learning in Vision

Autoencoders are neural networks that compress data into a lower-dimensional space and reconstruct the original input, learning non-linear structures unlike linear PCA. Their two-stage design features an encoder that projects input data into a latent space to extract informative features.

neural networks deep learning autoencoders machine learning

ARTICLEDEV.to AI·4/11/2026

Sparse Federated Representation Learning for deep-sea exploration habitat design for low-power autonomous deployments

The author explores federated learning to overcome latency challenges in voluminous sensor data from multi-robotic autonomous vehicles, optimizing processing in low-bandwidth environments. This approach seeks a distributed alternative to centralized data synchronization through distributed model updates.

Autonomous systems Distributed AI Deep-sea exploration federated learning

RESEARCHarXiv CS.AI·27d ago

Don't Look at the Numbers: Visual Anchoring Bias and Layer-wise Representation in VLMs

This research paper demonstrates that embedded numeric anchors on images systematically bias Vision-Language Model quality judgments across multiple VLMs. Layer-wise probing reveals that optimal layers for quality prediction are deeper than where anchor classification saturates, establishing a causal account of visual anchoring bias.

neural networks Vision-Language Models Model Evaluation representation learning

RESEARCHDEV.to AI·5/9/2026

Anticipating Visual Representations from Unlabeled Video

This content explores methods for anticipating visual representations from unlabeled video. The research investigates models' ability to learn visual features without explicit supervision, enhancing contextual understanding in video sequences.

computer vision representation learning video-analysis unsupervised learning

RESEARCHarXiv CS.LG·5d ago

Bayes-Sufficient Representations in Supervised Learning

This work defines Bayes-sufficient representations for supervised learning, focusing on information relevant for prediction based on a fixed decision problem and loss function. It introduces the concept of a Bayes quotient and connects the framework to property elicitation, showing how different loss functions require specific Bayes-optimal actions.

learning Bayesian theory supervised learning representation learning

RESEARCHarXiv CS.LG·4/21/2026

SetFlow: Generating Structured Sets of Representations for Multiple Instance Learning

This work introduces SetFlow, a generative architecture that models entire Multiple Instance Learning (MIL) bags directly in the representation space. It leverages the flow matching paradigm and a Set Transformer-inspired design to capture intra-bag dependencies and generate coherent, semantically consistent representations.

machine-learning-models Multiple Instance Learning representation learning Generative AI

ARTICLEDEV.to AI·5/1/2026

Universal representations:The missing link between faces, text, planktons, andcat breeds

This content delves into the concept of universal representations in AI, proposing them as the missing link to connect and process extremely diverse data types. The goal is to establish a unified framework capable of understanding everything from human faces and text to planktons and cat breeds.

AI models multimodal AI universal representations representation learning

RESEARCHarXiv CS.LG·4/13/2026

Silhouette Loss: Differentiable Global Structure Learning for Deep Representations

This paper introduces Soft Silhouette Loss, a novel differentiable objective for deep learning, inspired by the classical silhouette coefficient. It aims to learn discriminative representations by enforcing intra-class compactness and inter-class separation more efficiently than existing metric learning approaches.

Classification metric learning deep learning loss functions

RESEARCHarXiv CS.LG·5/8/2026

Data-Driven Variational Basis Learning Beyond Neural Networks: A Non-Neural Framework for Adaptive Basis Discovery

This manuscript introduces Data Driven Variational Basis Learning (DVBL), a novel non-neural framework for learning data-adaptive basis functions directly from high-dimensional data. It provides an explicit, interpretable, and mathematically transparent alternative to neural networks for representation learning, addressing their limitations in control and transparency.

variational methods Optimization machine learning data science

RESEARCHarXiv CS.LG·4/13/2026

Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching

This paper introduces a distillation framework to make large genomic foundation models for mRNA representation learning more efficient, reducing model size by 200-fold. By using embedding-level distillation, the smaller model achieves state-of-the-art performance on mRNA-related tasks, demonstrating an effective strategy for scalable biological AI.

mRNA Foundation Models Model Distillation representation learning

RESEARCHarXiv CS.CL·4/10/2026

Enabling Intrinsic Reasoning over Dense Geospatial Embeddings with DFR-Gemma

O conteúdo descreve o DFR-Gemma, um novo framework que permite que LLMs raciocinem diretamente sobre embeddings geoespaciais densos. Ele alinha embeddings de alta dimensão com o espaço latente de um LLM através de um projetor leve, injetando-os como tokens semânticos.

Geospatial AI LLMs Geospatial Embeddings Spatio-temporal Data

RESEARCHarXiv CS.CL·5/5/2026

H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models

This paper introduces H-probes, linear probes designed to extract hierarchical structure, specifically depth and pairwise distance, from the latent representations of large language models. The research shows these probes robustly find low-dimensional subspaces crucial for performance in synthetic tree traversal tasks, generalizing well both within and out-of-domain.

language models hierarchical reasoning representation learning AI Research

RESEARCHarXiv CS.LG·4/16/2026

The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior

This research investigates the 'grokking' phenomenon in transformers, finding that the long delay to generalization in arithmetic models stems from a decoder bottleneck. The encoder acquires relevant structural knowledge early, but the decoder struggles to access it, a hypothesis supported by causal interventions like transplanting encoders.

grokking machine learning representation learning Transformers

RESEARCHarXiv CS.LG·25d ago

A Unified Geometric Framework for Weighted Contrastive Learning

Contrastive learning aims to preserve relational structure in sample representations by reflecting a similarity graph. This paper interprets weighted InfoNCE objectives as Distance Geometry Problems, providing a unified geometric framework and exact characterizations of optimal embeddings, revealing how class imbalance affects inter-class similarities in SupCon.

neural networks contrastive learning machine learning geometry

RESEARCHarXiv CS.LG·5/7/2026

Transformation Categorization Based on Group Decomposition Theory Using Parameter Division

This research explores unsupervised categorization of transformations between input pairs using algebraic constraints, aiming for a principled understanding of good representations. It introduces parameter division to refine prior Galois-theoretic methods, addressing their reliance on auxiliary assumptions and improving the decomposition of groups.

neural networks algebra group theory representation learning

RESEARCHarXiv CS.LG·11d ago

Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision

This research explores how world models learn semantic representations from physical exploration without linguistic input. It finds that the latent space develops spatial semantic structures mirroring physical geometry, with semantic alignment improving alongside prediction performance.

machine learning World Models embodied AI representation learning

RESEARCHarXiv CS.LG·12d ago

Tackling Multimodal Learning Challenges with Mixture-of-Expert: A Survey

This paper presents a survey addressing multimodal learning challenges with the Mixture-of-Experts (MoE) architecture. The study explores how MoE functions as an efficient engine and a representation learner for integrating diverse data modalities. It fills a gap in the literature by offering a comprehensive and systematic review on the topic.

multimodal learning Survey Mixture of Experts AI