← heapsort-ai

self-supervised learning

10 items

RESEARCH↑ trendingReddit r/MachineLearning·4/17/2026

Low accuracy (~50%) with SSL (BYOL/MAE/VICReg) on hyperspectral crop stress data — what am I missing? [R]

The content details a persistent problem with achieving low accuracy (~50%) using self-supervised learning methods like BYOL, MAE, and VICReg for hyperspectral crop stress detection. Despite trying various techniques, performance remains barely better than random for three classes, leading to suspicions about data separability or SSL method suitability.

42
RESEARCHDEV.to AI·3d ago

Self-Supervised Temporal Pattern Mining for bio-inspired soft robotics maintenance with zero-trust governance guarantees

The article explores the maintenance of bio-inspired soft robotics, addressing the non-linear nature of material degradation that challenges traditional maintenance schedules. The research focuses on Self-Supervised Temporal Pattern Mining to identify precursor signals to failure and integrate zero-trust governance guarantees.

31
RESEARCHarXiv CS.CL·4d ago

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

This paper introduces a hybrid pre-training objective for text encoders, combining a JEPA-style latent-space prediction loss with a standard Masked Language Modelling (MLM) objective. This new approach aims to encourage representations anchored to deeper semantic structure rather than just surface-form token identity, showing significantly more uniform embeddings.

30
RESEARCHarXiv CS.CL·4/10/2026

Lexical Tone is Hard to Quantize: Probing Discrete Speech Units in Mandarin and Yor\`ub\'a

O artigo investiga como as unidades discretas de fala (DSUs), derivadas de modelos SSL, codificam o tom lexical, descobrindo que elas o fazem de forma menos confiável do que a estrutura segmental. Embora as representações latentes do SSL codifiquem o tom, a quantização tende a priorizar a estrutura fonética, um problema demonstrado em mandarim e iorubá que persiste com vários métodos.

27
RESEARCHarXiv CS.AI·27d ago

RankQ: Offline-to-Online Reinforcement Learning via Self-Supervised Action Ranking

RankQ is an offline-to-online reinforcement learning objective designed to enhance sample efficiency by leveraging pre-collected datasets. It mitigates issues with inaccurate critics and limited data coverage by using a self-supervised multi-term ranking loss, which enforces structured action ordering and directs the Q-function towards higher-quality actions.

27
RESEARCHarXiv CS.LG·4/8/2026

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

PRIME é um novo framework de pré-treinamento multimodal auto-supervisionado projetado para prognóstico de câncer, que aborda o desafio de modalidades de dados ausentes em coortes clínicas. Ele integra imagens de histopatologia, expressão gênica e relatórios patológicos, aprendendo representações robustas por meio de imputação semântica no espaço latente e objetivos de alinhamento intermodal.

27
RESEARCHarXiv CS.LG·4/6/2026

UI-Oceanus: Scaling GUI Agents with Synthetic Environmental Dynamics

UI-Oceanus é uma estrutura que escala agentes GUI generalistas, focando em dominar a física da interação através de feedback ambiental em vez de imitar trajetórias. O sistema utiliza exploração autônoma e predição de dinâmicas futuras para construir um modelo de mundo interno robusto, superando limitações de dados e supervisão.

27