← heapsort-ai

machine learning

781 items

RESEARCHarXiv CS.LG·20h ago

MedicalRec: Medical recommender system for image classification without retraining

This study introduces MedicalRec, a medical recommender system for image classification, designed to optimize model selection without the need for extensive retraining. It addresses the computational and energy challenges of model identification by leveraging a publicly available dataset, MedicalRec-Bench, compiled from 3,000 articles and over 5,000 tested model records.

54
RESEARCHarXiv CS.LG·20h ago

Boundary Variance Inflation Causes Acquisition Bias in Gaussian Processes

This paper investigates inflated posterior variance near the boundary in Gaussian processes, tracing the root cause to the truncation of the kernel correlation neighborhood. It shows how this geometric distortion creates acquisition bias, affecting selection patterns across different acquisition classes, independent of objective functions.

54
RESEARCHarXiv CS.LG·20h ago

When Should an AI Scientist Stop? Verifiable Experiment Steering and Refusal for Autonomous Discovery

This paper introduces CARTOGRAPH, a verification layer for AI scientists that integrates experiment steering, ambiguity closure, and inadequacy detection. It demonstrates superior performance over raw projection methods and successfully identifies and revokes out-of-library pharmacokinetic mechanisms, enhancing autonomous discovery.

54
RESEARCH↑ trendingReddit r/MachineLearning·4/22/2026

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

A training-time intervention for 1.2B-parameter LMs, using a precision-weighted gain function and divergence-scaled gradients, resulted in significantly higher human preference (63.4%, p < 0.00002) compared to standard training. Notably, this preference shift occurred without altering the aggregate validation loss metric, indicating that training interventions beyond RLHF can be effective.

47
ARTICLE↑ trendingReddit r/MachineLearning·4/18/2026

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

easyaligner is a new, performant forced alignment library offering GPU acceleration and flexible text normalization, compatible with all w2v2 models on Hugging Face Hub. It addresses common challenges in speech-to-text preprocessing, such as handling partial transcripts, irrelevant audio, and long segments without chunking.

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]
46
RESEARCH↑ trendingReddit r/MachineLearning·4/23/2026

8 inputs → 58 body params: putting a body-model forward pass inside the training loss [P]

A small Multi-Layer Perceptron (MLP) model accurately predicts 58 Anny body-shape parameters from 8 questionnaire inputs, outperforming existing photo-based and linear regression methods. The model's innovative training loss function is key to its superior accuracy, achieving low Mean Absolute Errors for critical body measurements.

43