← heapsort-ai

machine learning

781 items

RESEARCHarXiv CS.LG·4/21/2026

UniMamba: A Unified Spatial-Temporal Modeling Framework with State-Space and Attention Integration

UniMamba is a new unified spatial-temporal forecasting framework that integrates efficient state-space dynamics with attention-based dependency learning to tackle multivariate time series challenges. It employs a Mamba Variate-Channel Encoding Layer and a Spatial Temporal Attention Layer to capture both global temporal dependencies and inter-variate correlations.

33
ARTICLEDEV.to AI·4/22/2026

DLSS 5 is not a failure. The Future of rendering: A deep technical look at new approaches after 15 years in Game Development

The article, written by an experienced technical director in ML and game development, provides a deep technical look into the future of rendering in the gamedev industry. It discusses recent shifts in rendering architecture, inspired by the announcement of Nvidia's DLSS 5, moving beyond traditional hardware improvements towards new technical approaches.

32
RESEARCHarXiv CS.LG·4/16/2026

Pareto-Optimal Offline Reinforcement Learning via Smooth Tchebysheff Scalarization

This paper introduces STOMP, a novel offline reinforcement learning algorithm for multi-objective optimization using smooth Tchebysheff scalarization. It addresses the limitation of linear scalarization in recovering non-convex Pareto fronts, crucial for aligning large language models and other real-world applications with conflicting rewards.

31
RESEARCHarXiv CS.CL·4/16/2026

A Multi-Model Approach to English-Bangla Sentiment Classification of Government Mobile Banking App Reviews

This study classifies sentiment in English and Bangla reviews of Bangladeshi government mobile banking apps, using a hybrid labeling approach for 5,652 reviews. It found that traditional machine learning models like Random Forest and Linear SVM significantly outperformed fine-tuned XLM-RoBERTa for this specific task.

31
ARTICLEDEV.to AI·4/18/2026

Part 2: The Data — Building the First Public Coffee Roasting Audio Dataset with Warp/Oz

This article describes the creation of the first public audio dataset for coffee roasting first crack detection, addressing a significant gap in available resources. The dataset, comprising 973 annotated 10-second segments, was meticulously built from scratch and led to a model achieving 100% precision thanks to careful data splitting and loss weighting.

31
RESEARCHarXiv CS.LG·4/22/2026

Discrete Tilt Matching

Discrete Tilt Matching (DTM) is a novel likelihood-free method for fine-tuning masked diffusion large language models (dLLMs), addressing the intractability of sequence-level marginal likelihoods in RL. It recasts fine-tuning as state-level matching, using a weighted cross-entropy objective with control variates for stability, and achieves strong results on various tasks like Sudoku and Countdown.

30
ARTICLEDEV.to AI·4/25/2026

My AI Agent Over-Corrected Itself — So I Built Metabolic Regulation

The author details how their AI agent, with an Active Inference perception pipeline, learned a correction rule that led to over-correction, causing it to misclassify human speech. This incident highlights the challenge of building robust regulation mechanisms in AI systems to prevent over-generalization and suggests a need for more metabolic control.

30