machine learning

781 items

RESEARCHarXiv CS.LG·20h ago

MedicalRec: Medical recommender system for image classification without retraining

This study introduces MedicalRec, a medical recommender system for image classification, designed to optimize model selection without the need for extensive retraining. It addresses the computational and energy challenges of model identification by leveraging a publicly available dataset, MedicalRec-Bench, compiled from 3,000 articles and over 5,000 tested model records.

recommender systems deep learning machine learning healthcare AI

RESEARCHarXiv CS.LG·20h ago

Emergence via Phase Transitions: Mechanism Landscapes and Universal Convergence Across Complex Systems

This research introduces the Hierarchical Emergence Framework (HEF) to explain universal convergence across complex systems in machine learning, biology, and physics. HEF models emergence as a phase transition, leading to a unique minimum-cost mechanism and proving convergence towards a fixed-point.

complex systems phase transitions machine learning emergence

RESEARCHarXiv CS.LG·20h ago

Boundary Variance Inflation Causes Acquisition Bias in Gaussian Processes

This paper investigates inflated posterior variance near the boundary in Gaussian processes, tracing the root cause to the truncation of the kernel correlation neighborhood. It shows how this geometric distortion creates acquisition bias, affecting selection patterns across different acquisition classes, independent of objective functions.

Bayesian Optimization Gaussian Processes Statistical Models machine learning

RESEARCHarXiv CS.LG·20h ago

When Should an AI Scientist Stop? Verifiable Experiment Steering and Refusal for Autonomous Discovery

This paper introduces CARTOGRAPH, a verification layer for AI scientists that integrates experiment steering, ambiguity closure, and inadequacy detection. It demonstrates superior performance over raw projection methods and successfully identifies and revokes out-of-library pharmacokinetic mechanisms, enhancing autonomous discovery.

experiment steering machine learning autonomous discovery Verification

RESEARCHarXiv CS.AI·20h ago

Improving Multimodal Reasoning via Worst Dimension Optimization

Multimodal reasoning requires maintaining integrity across diverse constraints like visual grounding and logical consistency. Current Process Reward Models often hide individual dimension failures by equally weighing factors, compromising the overall reasoning process.

Optimization multimodal AI machine learning AI Reasoning

RESEARCHarXiv CS.CL·20h ago

Bidirectional Small-Granularity Search between Code and Text

This research introduces a novel task: bidirectional small-granularity search between code and text, aiming to link scientific publications with corresponding code segments. It proposes a large dataset, partially generated by GPT-4, and a modular approach that achieves good in-domain results.

machine learning natural language processing Code Analysis Information Retrieval

RESEARCHarXiv CS.CL·20h ago

BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models

The paper introduces BEACON, a black-box framework for detecting hallucination in large language models by analyzing model outputs without internal access or external knowledge. It extracts a 31-dimensional feature vector and a gradient-boosted classifier achieves 0.8123 AUROC, outperforming existing baselines.

LLMs hallucination machine learning detection

ARTICLEAmazon Web Services (YouTube)·1d ago

AI-DLC explained- how AWS can accelerate AI in the SDLC | Amazon Web Services

This content explains how Amazon Web Services (AWS) can be used to accelerate the integration of Artificial Intelligence (AI) into the Software Development Life Cycle (SDLC).

cloud computing machine learning AWS AI development

AI-DLC explained- how AWS can accelerate AI in the SDLC | Amazon Web Services

NEWS↑ trendingReddit r/MachineLearning·4/19/2026

1,200 ICLR 2026 Papers with Public Code or Data [R]

A list of approximately 1,200 ICLR 2026 accepted papers with associated public code, data, or a demo link has been released. This resource, representing about 22% of the accepted papers, is available via Paper Digest and links directly to codebases which may become fully public closer to the conference in Rio de Janeiro.

machine learning open-source-code AI Research papers

RESEARCH↑ trendingReddit r/MachineLearning·4/22/2026

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

A training-time intervention for 1.2B-parameter LMs, using a precision-weighted gain function and divergence-scaled gradients, resulted in significantly higher human preference (63.4%, p < 0.00002) compared to standard training. Notably, this preference shift occurred without altering the aggregate validation loss metric, indicating that training interventions beyond RLHF can be effective.

LLMs machine learning Human Preference training methods

ARTICLE↑ trendingHacker News (AI)·4d ago

Sakana AI's Recursive Self-Improvement (RSI) Lab

Sakana AI has launched its Recursive Self-Improvement (RSI) Lab, aiming to develop AI models capable of enhancing their own performance. This initiative focuses on foundational research to create more robust and adaptable AI systems.

Self-improvement AI Sakana AI machine learning AI development

ARTICLEAWS Machine Learning Blog·1d ago

End-to-end encrypted ML inference with Amazon SageMaker AI and FHE

This blog details end-to-end encrypted ML inference using Amazon SageMaker AI and FHE. It introduces a more flexible, higher-level approach based on concrete-ml, supporting several common models and being API compatible with scikit-learn.

security machine learning Amazon SageMaker Encryption

ARTICLE↑ trendingReddit r/MachineLearning·4/18/2026

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

easyaligner is a new, performant forced alignment library offering GPU acceleration and flexible text normalization, compatible with all w2v2 models on Hugging Face Hub. It addresses common challenges in speech-to-text preprocessing, such as handling partial transcripts, irrelevant audio, and long segments without chunking.

GPU Acceleration machine learning natural language processing Speech-to-Text

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

DOCStatQuest (YouTube)·1d ago

StatQuest: Random Forests Part 2: Missing data and clustering

This StatQuest episode, part two of a series on Random Forests, delves into how to handle missing data and perform clustering using this powerful machine learning algorithm. It provides an in-depth explanation of these advanced applications.

Random Forests learning Clustering machine learning

StatQuest: Random Forests Part 2: Missing data and clustering

ARTICLE↑ trendingHacker News (AI)·4d ago

Ask HN: AI researchers – what's a recent paper that recently blew your mind?

A Hacker News user asks AI researchers to share recent machine learning papers that have impressed them. The goal is to find new and exciting publications in the ML space for those actively seeking new developments.

Academic Papers Research Recommendations machine learning AI

ARTICLEDEV.to AI·4/22/2026

No Free Lunch Theorem — Deep Dive + Problem: Reverse Bits

The No Free Lunch Theorem is a fundamental concept in Machine Learning that highlights the limitations of any learning algorithm. It states that no single algorithm can outperform all others on every possible problem, emphasizing the importance of problem-specific algorithm selection.

machine learning Algorithms

ARTICLE↑ trendingHacker News (AI)·11d ago

Otari: Own Your AI Stack

Mozilla AI introduces Otari, an initiative aimed at empowering users to control their own AI infrastructure. It seeks to foster autonomy and customization, enabling individuals to "own their AI stack" rather than relying on large providers.

open-source AI Data Ownership machine learning Mozilla AI

RESEARCH↑ trendingReddit r/MachineLearning·4/23/2026

8 inputs → 58 body params: putting a body-model forward pass inside the training loss [P]

A small Multi-Layer Perceptron (MLP) model accurately predicts 58 Anny body-shape parameters from 8 questionnaire inputs, outperforming existing photo-based and linear regression methods. The model's innovative training loss function is key to its superior accuracy, achieving low Mean Absolute Errors for critical body measurements.

neural networks body modeling Performance Metrics machine learning

ARTICLE↑ trendingHacker News (AI)·13d ago

Training our own AI models

This article discusses the process and considerations involved in training custom AI models. It covers the challenges and benefits of developing in-house artificial intelligence capabilities.

AI training machine learning data science custom models

ARTICLE↑ trendingReddit r/MachineLearning·4/23/2026

Isolation Forest + eBPF events to create a Linux based endpoint detection system [P]

The author is developing 'guardd', a Linux host-based anomaly detection system utilizing Isolation Forest with eBPF events. It groups exec and network events into 60-second windows to create feature vectors, trained unsupervised to detect anomalies, though it currently faces false positive issues.

security machine learning