machine learning

790 items

DOCDEV.to AI·4/21/2026

MLOps in 2026: Production Machine Learning Best Practices

This content details MLOps in 2026, covering core concepts, tools, implementation, and best practices for production machine learning. It includes industry growth projections, key statistics, and the expected technology stack.

future of AI MLOps machine learning AI best practices

ARTICLEHugging Face Blog·22d ago

Introducing the Ettin Reranker Family

This article introduces the Ettin Reranker Family, a new set of models designed to enhance the relevance and quality of results in search and recommendation systems. The Ettin models aim to optimize document ordering, offering improved performance in information retrieval tasks.

AI models machine learning Reranking Information Retrieval

ARTICLEDEV.to AI·4/13/2026

Understanding Transformers Part 6: Calculating Similarity Between Queries and Keys

This article details the calculation of similarity between queries and keys in Transformers using the dot product, illustrating how a word's similarity to itself is higher than to other words. It explains that these scores are then transformed into meaningful weights via a softmax function.

machine learning Dot Product NLP AI

RESEARCHarXiv CS.AI·4/9/2026

Toward Reducing Unproductive Container Moves: Predicting Service Requirements and Dwell Times

Este artigo apresenta um estudo de ciência de dados em um terminal de contêineres com o objetivo de reduzir movimentos improdutivos. Ele desenvolve modelos de machine learning para prever requisitos de serviço e tempos de permanência dos contêineres, superando heurísticas existentes.

logistics machine learning data science Container Terminal

RESEARCHarXiv CS.LG·4/6/2026

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift

Este artigo propõe uma estrutura centrada na implantação para modelar a confiabilidade de modelos de machine learning em ambientes não-estacionários, onde a mudança de distribuição temporal pode degradar o desempenho. O framework trata a confiabilidade como um estado dinâmico, abordando a adaptação de implantação como um problema de controle multi-objetivo para equilibrar estabilidade e custo de intervenção.

implantação mudança de distribuição temporal volatilidade custo de intervenção

RESEARCHarXiv CS.AI·4/30/2026

Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields

The Distill-Belief framework addresses the challenge of efficient and accurate inverse source localization and characterization (ISLC) for mobile agents by balancing correctness and efficiency. It proposes a teacher-student model, where a Bayes-correct particle filter teacher guides a compact student for fast, uncertainty-aware decision-making in real-time.

machine learning AI Uncertainty Estimation robotics

RESEARCHarXiv CS.AI·4/30/2026

Hierarchical Multi-Persona Induction from User Behavioral Logs: Learning Evidence-Grounded and Truthful Personas

This paper proposes a hierarchical framework to induce multiple evidence-grounded user personas from behavioral logs by clustering intent memories and optimizing persona quality. The method utilizes a groupwise extension of Direct Preference Optimization (DPO) and demonstrates more coherent, truthful personas, also improving future interaction prediction.

Optimization LLMs machine learning persona generation

RESEARCHarXiv CS.AI·4/30/2026

Auto-Relational Reasoning

Researchers propose a novel theoretical framework for automated relational reasoning, integrating Machine Learning with rigid reasoning to surpass the limitations of current large models. The resulting system demonstrates high performance on IQ problems, achieving a 98.03% solving rate without prior knowledge.

neural networks machine learning Reasoning problem solving

RESEARCHarXiv CS.LG·4/30/2026

Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective

This work rethinks KV cache eviction for LLMs using an information-theoretic objective derived from the Information Bottleneck principle. It introduces CapKV, a new capacity-aware method that preserves information, outperforming existing heuristic strategies.

Memory Optimization machine learning large language models AI inference

RESEARCHarXiv CS.LG·4/30/2026

A Randomized PDE Energy driven Iterative Framework for Efficient and Stable PDE Solutions

This work introduces a PDE energy-driven iterative framework for solving partial differential equations efficiently and stably, without relying on traditional matrix-based discretizations or costly data-driven neural network training. It evolves random initial fields through physically constrained diffusion iterations and Gaussian smoothing, strictly enforcing boundary conditions, and demonstrates stable convergence on Poisson, Heat, and viscous Burgers equations.

Numerical Methods machine learning Scientific Computing Algorithms

RESEARCHarXiv CS.AI·4/30/2026

OMEGA: Optimizing Machine Learning by Evaluating Generated Algorithms

OMEGA is an end-to-end framework automating AI research, from idea generation to executable code, using structured meta-prompt engineering and code generation. It has produced novel ML classifiers that outperform scikit-learn baselines across 20 benchmark datasets.

Meta-Learning machine learning code generation Algorithms

RESEARCHarXiv CS.AI·4/30/2026

DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent

DreamProver introduces an agentic framework using a wake-sleep program induction paradigm to discover reusable lemmas for formal theorem proving. It iteratively evolves a compact, transferable lemma library, significantly improving performance on unseen theorems.

Theorem Proving machine learning Automated reasoning artificial intelligence

RESEARCHarXiv CS.LG·4/30/2026

A Multimodal and Explainable Machine Learning Approach to Diagnosing Multi-Class Ejection Fraction from Electrocardiograms

This research developed a multimodal machine-learning framework combining ECG features and EHR data to diagnose multi-class left ventricular ejection fraction. The model achieved high AUROCs and used SHAP for explainability, outperforming baseline models.

machine learning Explainable AI medical AI

RESEARCHarXiv CS.AI·5/6/2026

Accelerating battery research with an AI interface between FINALES and Kadi4Mat

This study optimizes sodium-ion coin cell formation protocols for duration efficiency and end-of-life performance, utilizing an AI interface between FINALES and Kadi4Mat. The framework employs multi-objective batched Bayesian optimization to guide experiment selection, aiming to accelerate discovery and reduce resource consumption.

Materials Science Optimization machine learning AI

RESEARCHarXiv CS.LG·5/6/2026

On the Invariants of Softmax Attention

This research defines the "energy field" in softmax attention, revealing essential invariant properties. It distinguishes between mechanism-level invariants, derived from algebraic structure, and model-level regularities observed in autoregressive language models.

neural networks softmax machine learning NLP

RESEARCHarXiv CS.AI·5/6/2026

2026 Roadmap on Artificial Intelligence and Machine Learning for Smart Manufacturing

This roadmap provides a comprehensive perspective on the foundations, applications, and emerging directions of AI and ML in smart manufacturing. It addresses critical challenges such as industrial big data complexity and the demand for trustworthy operations in high-stakes industrial environments.

industrial AI data management smart manufacturing AI roadmap

RESEARCHarXiv CS.LG·5/6/2026

An End-to-End Framework for Building Large Language Models for Software Operations

This paper introduces OpsLLM, an end-to-end framework for building large language models (LLMs) specifically for software operations. It addresses challenges like low-quality data and fragmented knowledge, detailing a workflow that includes data curation, supervised fine-tuning, and a domain process reward model.

LLMs AI frameworks Domain-Specific AI machine learning

RESEARCHarXiv CS.LG·5/6/2026

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

This survey provides an optimizer-agnostic view of rollout strategies for RL-based post-training of reasoning LLMs. It formalizes rollout pipelines with a unified notation and introduces the Generate-Filter-Control-Replay (GFCR) lifecycle taxonomy, decomposing pipelines into four modular stages.

Rollout Strategies reinforcement learning machine learning AI research

NEWS↑ trendingReddit r/LocalLLaMA·4/20/2026

Kimi K2.6 Released (huggingface)

An announcement has been made regarding the release of Kimi K2.6 on Hugging Face. This update signifies a new version of the Kimi AI model or tool available to the community.

machine learning AI Model

ARTICLETogether AI Blog·4/24/2026

Accelerate RL rollouts by up to 50% with distribution-aware speculative decoding

DAS (distribution-aware speculative decoding) addresses the rollout bottleneck in RL post-training. It accelerates rollouts by up to 50% without compromising reward quality.

Optimization AI acceleration reinforcement learning machine learning