machine learning

790 items

ARTICLEDEV.to AI·21d ago

Building an Inference OS: deterministic-first router for prediction markets

This article details the construction of a deterministic-first inference router for prediction markets, designed to reduce reliance on expensive LLMs. It leverages a 6-hook system including market regime classification, anomaly detection, and confidence decay to efficiently process market questions.

Prediction Markets machine learning AI system architecture

RESEARCHarXiv CS.CL·4/16/2026

A Multi-Model Approach to English-Bangla Sentiment Classification of Government Mobile Banking App Reviews

This study classifies sentiment in English and Bangla reviews of Bangladeshi government mobile banking apps, using a hybrid labeling approach for 5,652 reviews. It found that traditional machine learning models like Random Forest and Linear SVM significantly outperformed fine-tuned XLM-RoBERTa for this specific task.

Multilingual AI machine learning Natural Language Processing sentiment analysis

ARTICLEDEV.to AI·4/18/2026

Part 2: The Data — Building the First Public Coffee Roasting Audio Dataset with Warp/Oz

This article describes the creation of the first public audio dataset for coffee roasting first crack detection, addressing a significant gap in available resources. The dataset, comprising 973 annotated 10-second segments, was meticulously built from scratch and led to a model achieving 100% precision thanks to careful data splitting and loss weighting.

Dataset audio processing data engineering machine learning

RESEARCHarXiv CS.LG·4/22/2026

Discrete Tilt Matching

Discrete Tilt Matching (DTM) is a novel likelihood-free method for fine-tuning masked diffusion large language models (dLLMs), addressing the intractability of sequence-level marginal likelihoods in RL. It recasts fine-tuning as state-level matching, using a weighted cross-entropy objective with control variates for stability, and achieves strong results on various tasks like Sudoku and Countdown.

Diffusion Models LLMs reinforcement learning machine learning

RESEARCHarXiv CS.CL·4/16/2026

Text-as-Signal: Quantitative Semantic Scoring with Embeddings, Logprobs, and Noise Reduction

This paper introduces a practical pipeline to convert text corpora into quantitative semantic signals, employing embeddings, logprob-based evaluation, and noise reduction. The case study applies six semantic dimensions to Portuguese news articles about AI, supporting AI engineering tasks such as corpus inspection and monitoring.

machine learning NLP embeddings semantic analysis

RESEARCHarXiv CS.LG·20d ago

GROW: Aligning GRPO with State-Action Modeling for Open-World VLM Agents

This paper introduces GROW, an RL framework for open-world VLM agents, addressing limitations of existing Supervised Fine-Tuning methods. It proposes a novel approach for Group Relative Policy Optimization (GRPO) by decomposing trajectories into state-action samples rather than full entities.

VLM Agents Policy optimization Open-world AI reinforcement learning

ARTICLEDEV.to AI·4/25/2026

My AI Agent Over-Corrected Itself — So I Built Metabolic Regulation

The author details how their AI agent, with an Active Inference perception pipeline, learned a correction rule that led to over-correction, causing it to misclassify human speech. This incident highlights the challenge of building robust regulation mechanisms in AI systems to prevent over-generalization and suggests a need for more metabolic control.

AI agent machine learning Active Inference feedback loop

DOCDEV.to AI·4/16/2026

Setting Up JupyterHub on a Cloud GPU Server

This guide details setting up JupyterHub on a cloud GPU server to enable collaborative, multi-user environments for AI projects. It explains how JupyterHub manages individual Jupyter notebook servers, providing shared access to significant computational power.

Cloud GPU machine learning data science AI

ARTICLEKDNuggets·4d ago

A Deep Dive into Calibration of Language Models: Platt Scaling, Isotonic Regression, Temperature Scaling

This content explores three post-hoc methods—Platt Scaling, Isotonic Regression, and Temperature Scaling—designed to enhance the calibration of language models. These techniques aim to reduce the disparity between a model's predicted confidence and its actual accuracy.

language models Calibration learning machine learning

A Deep Dive into Calibration of Language Models: Platt Scaling, Isotonic Regression, Temperature Scaling

RESEARCHDEV.to AI·4d ago

Detection in the stochastic block model with multiple clusters: proof of theachievability conjectures, acyclic BP, and the infor

This paper explores detection within the stochastic block model with multiple clusters, providing proofs for achievability conjectures. It also discusses acyclic Belief Propagation and information-theoretic aspects of the model.

information theory stochastic block model machine learning graph theory

DOCDEV.to AI·27d ago

How Neural Networks Work — From Perceptrons to Backpropagation

Neural networks transform input into output through layers, adjusting internal values like weights and biases to learn from mistakes. The fundamental learning process involves forward propagation, loss computation, and backpropagation to refine outputs.

neural networks deep learning learning machine learning

RESEARCHDEV.to AI·4/24/2026

subgraph2vec: Learning Distributed Representations of Rooted Sub-graphs fromLarge Graphs

This research introduces `subgraph2vec`, a novel method for learning distributed representations of rooted sub-graphs extracted from large graphs. It aims to embed complex graph structures into a lower-dimensional vector space, facilitating various downstream machine learning tasks on graph data.

Graph Neural Networks machine learning graph embeddings

ARTICLEDEV.to AI·4/22/2026

Architecting Predictive Intelligence for Smart Venues

The content details the technical architecture of predictive intelligence for smart venues, explaining high-frequency data collection from various sensors. It describes storage in time-series databases and data lakes, and the use of ML models to forecast events like crowd surges and equipment failures.

IoT machine learning predictive AI Data Architecture

RESEARCHDEV.to AI·4/10/2026

Cross-Modal Knowledge Distillation for planetary geology survey missions with ethical auditability baked in

O texto narra a jornada de pesquisa do autor em destilação de conhecimento cross-modal com auditabilidade ética, impulsionada pela observação de que IAs de classificação mineral podem tomar decisões tecnicamente corretas, mas eticamente ingênuas. O objetivo é desenvolver sistemas de IA que sejam precisos e eticamente robustos para missões de pesquisa geológica planetária.

Knowledge Distillation Autonomous systems machine learning Planetary Geology

RESEARCHarXiv CS.CL·5d ago

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

This paper introduces a hybrid pre-training objective for text encoders, combining a JEPA-style latent-space prediction loss with a standard Masked Language Modelling (MLM) objective. This new approach aims to encourage representations anchored to deeper semantic structure rather than just surface-form token identity, showing significantly more uniform embeddings.

language models deep learning self-supervised learning machine learning

DOCDEV.to AI·4/15/2026

Clide

Clide is a tool featuring a core AI engine that provides command suggestions, code completion, and error detection in terminals. It leverages machine learning frameworks like TensorFlow/PyTorch and NLP libraries such as NLTK/spaCy to process and understand user interaction.

Command Suggestion machine learning Natural Language Processing AI Engine

ARTICLE↑ trendingReddit r/MachineLearning·27d ago

EEML Summer School (Eastern European ML) - Anyone here got accepted? [D]

An individual has been accepted into the EEML Summer School in Montenegro and is seeking to connect with other accepted participants to coordinate stay and post-school plans. They note that travel and accommodation logistics are proving difficult.

learning summer school machine learning

RESEARCHarXiv CS.LG·4/17/2026

Shapley Value-Guided Adaptive Ensemble Learning for Explainable Financial Fraud Detection with U.S. Regulatory Compliance Validation

This research addresses the challenge of explainability in AI for financial fraud detection, crucial for U.S. regulatory compliance. It introduces the SHAP-Guided Adaptive Ensemble (SGAE) method, which dynamically adjusts ensemble weights based on SHAP attribution agreement, achieving high performance and transparency.

regulatory compliance Financial services machine learning Explainable AI

ARTICLEDEV.to AI·4/10/2026

How We Built an AI That Explains Every Crypto Trade It Makes

Este artigo detalha a construção de uma plataforma de trading de criptomoedas onde uma IA explica cada operação em linguagem simples, abordando a falta de transparência dos bots tradicionais. A arquitetura técnica inclui pontuação de sinais com LightGBM, explicações geradas por Ollama/Llama 3.1 8B e um sistema de gerenciamento de portfólio.

cryptocurrency machine learning trading bots AI

RESEARCHarXiv CS.AI·5/9/2026

ZAYA1-8B Technical Report

ZAYA1-8B is a reasoning-focused mixture-of-experts (MoE) model with 700M active parameters, outperforming DeepSeek-R1-0528 on math and coding benchmarks. It was trained from scratch for reasoning on an AMD platform and uses a four-stage RL cascade for post-training.

AI models AI Training machine learning benchmarking