machine learning

790 items

RESEARCHarXiv CS.CL·5/6/2026

When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal

This research evaluates same-model self-verification as a confidence signal for selective prediction, comparing it against likelihood-based baselines. The study reveals task- and model-dependent results, showing significant improvements for some models on ARC-Challenge but less reliability and occasional degradation on TruthfulQA-MC.

language models AI Confidence Selective Prediction machine learning

RESEARCHarXiv CS.AI·22d ago

TTE-Flash: Accelerating Reasoning-based Multimodal Representations via Think-Then-Embed Tokens

This work proposes TTE-Flash, a method to accelerate reasoning-based multimodal representations by replacing explicit Chain-of-Thought (CoT) with latent think tokens. It aims to achieve high-performance, reasoning-aware representations at a constant inference cost.

neural networks multimodal AI machine learning Computational Efficiency

RESEARCHarXiv CS.CL·29d ago

jina-embeddings-v5-omni: Geometry-preserving Embeddings via Locked Aligned Towers

This work introduces GELATO, a novel approach to multimodal embedding models that extends VLM-style architectures. It results in the jina-embeddings-v5-omni suite, which efficiently encodes text, image, audio, and video into a single semantic embedding space by freezing backbone text models and training only connecting components.

embedding models multimodal AI deep learning machine learning

RESEARCHarXiv CS.AI·20d ago

High Quality Embeddings for Horn Logic Reasoning

This paper introduces novel approaches for creating high-quality embeddings for logical statements, crucial for training neural networks to efficiently rank choices made by logical reasoners. These methods involve generating anchors with repeated terms, balancing easy, medium, and hard examples for triplet loss training, and periodically emphasizing the hardest examples.

neural networks Logic reasoning machine learning embeddings

RESEARCHarXiv CS.AI·22d ago

PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation

This paper introduces PRISMat, a cost-effective, permutation-invariant model designed for the rapid identification of candidate materials. It addresses the inefficiencies of large language models in material generation by offering a faster and cheaper alternative for material filtering.

Materials Science AI models machine learning Computational Efficiency

DOCHugging Face Blog·4/23/2026

How to Use Transformers.js in a Chrome Extension

This content provides a guide on integrating the Transformers.js library into a Chrome Extension. It details the steps and considerations for running machine learning models directly within the browser.

web development machine learning NLP AI

ARTICLEHugging Face (YouTube)·20d ago

On the slow death of Scaling (birth of Adaption Labs) | Sara Hooker | HF ML Club India EP2

This content explores the evolution of AI methodologies, discussing the decline of traditional scaling approaches and the emergence of new strategies, exemplified by the birth of Adaption Labs. Presented by Sara Hooker, the HF ML Club India episode delves into significant shifts within the field of artificial intelligence.

Adaption Labs machine learning scaling AI research

On the slow death of Scaling (birth of Adaption Labs) | Sara Hooker | HF ML Club India EP2

CASEAmazon Web Services (YouTube)·5/4/2026

Accuracy Without Compromise: AI for Investment Management | Amazon Web Services

This content from Amazon Web Services explores the application of AI in investment management, emphasizing accuracy and reliability. It highlights how AI can enhance decision-making without compromising precision in financial operations.

Investment Management Financial services machine learning AWS

Accuracy Without Compromise: AI for Investment Management | Amazon Web Services

DOCAWS Machine Learning Blog·5/4/2026

Agent-guided workflows to accelerate model customization in Amazon SageMaker AI

Amazon SageMaker AI now provides an agentic experience, allowing developers to describe their use cases using natural language. An AI coding agent then streamlines the entire model customization lifecycle, from data preparation to deployment, enhancing efficiency.

Model customization workflow automation machine learning Amazon SageMaker AI

RESEARCHarXiv CS.LG·4/23/2026

Graph-Theoretic Models for the Prediction of Molecular Measurements

This document explores graph-theoretic models for predicting molecular measurements. The research focuses on applying mathematical structures to understand and anticipate chemical and biological properties.

machine learning cheminformatics graph theory molecular prediction

ARTICLEDEV.to AI·4/22/2026

Is the DOE Framework Still Relevant in the Age of Claude Skills?

This article explores the continued relevance of the DOE (Design of Experiments) framework in the age of advanced AI skills, such as those offered by Claude. It evaluates whether traditional methodologies are still applicable or if new approaches are needed to optimize AI-driven systems.

development frameworks Claude AI machine learning

ARTICLEDEV.to AI·4/22/2026

Blog 2: Momentum-Based Optimizers

This blog content discusses momentum-based optimizers, exploring their function and importance in accelerating the training of machine learning models. It details how these algorithms improve the convergence and efficiency of neural networks.

deep learning machine learning AI Algorithms

DOCStatQuest (YouTube)·23d ago

The Essence of Linear Regression

This content explains the essence of linear regression, a fundamental statistical method used to model the relationship between a dependent variable and one or more independent variables. It covers the basic principles and significance of this technique in data analysis.

linear regression machine learning data science Statistics

RESEARCHDEV.to AI·4/21/2026

Learning to be Safe: Deep RL with a Safety Critic

This content explores a novel approach to Deep Reinforcement Learning by integrating a "safety critic" to prevent unsafe actions. The methodology aims to enhance the reliability and robustness of AI agents, making them suitable for real-world deployment where safety is critical.

deep learning reinforcement learning security machine learning

DOCGoogle for Developers (YouTube)·4/30/2026

Unlocking Low-Level Control: Customizing Keras Training Loops with JAX

This content discusses how to gain low-level control and customize Keras training loops. It details the integration with JAX to allow for greater flexibility and performance in machine learning model development.

Training Loops Keras deep learning machine learning

Unlocking Low-Level Control: Customizing Keras Training Loops with JAX

DOCML Mastery·4/27/2026

Text Summarization with Scikit-LLM

This content focuses on text summarization using the Scikit-LLM library. It is likely a guide or tutorial demonstrating how to implement this functionality. The piece explores applying large language models to efficiently condense textual information.

learning machine learning NLP Scikit-LLM

ARTICLEDEV.to AI·4/18/2026

Hermes 4's Tool-Calling Is Trained as a Separate Skill. Here's Why Your Agent Cares.

This article discusses why Hermes 4's tool-calling is trained as a separate skill, explaining the underlying rationale and its critical implications for AI agents. It highlights how this distinct training approach enhances agent capabilities and overall performance.

AI models machine learning tool-calling Agentic AI

RESEARCHarXiv CS.LG·4/17/2026

The Devil Is in Gradient Entanglement: Energy-Aware Gradient Coordinator for Robust Generalized Category Discovery

This research paper introduces an Energy-Aware Gradient Coordinator to address "gradient entanglement," a key challenge in Robust Generalized Category Discovery. The proposed method aims to improve the robustness and performance of AI models in identifying new categories.

Gradient Descent category discovery deep learning machine learning

RESEARCHarXiv CS.AI·4/16/2026

WebXSkill: Skill Learning for Autonomous Web Agents

This content introduces WebXSkill, a system focused on skill learning for autonomous web agents. It aims to enable AI agents to perform complex tasks on the web independently.

machine learning autonomous agents AI Skill Learning

RESEARCHarXiv CS.CL·4/15/2026

Filtered Reasoning Score: Evaluating Reasoning Quality on a Model's Most-Confident Traces

This research introduces the "Filtered Reasoning Score," a novel metric designed to assess the quality of reasoning in AI models. It specifically focuses on evaluating the reasoning evident in a model's most confident outputs or traces.

AI metrics machine learning Reasoning AI evaluation