deep learning

263 items

RESEARCHDEV.to AI·4/21/2026

Multi-Objective Deep Reinforcement Learning

This content explores the field of Multi-Objective Deep Reinforcement Learning. It likely delves into techniques for training AI agents to optimize multiple performance criteria concurrently.

Optimization deep learning reinforcement learning

ARTICLEDEV.to AI·4/26/2026

Your Transformer is Secretly Linear

This article explores the idea that, despite their complexity, Transformer models might exhibit linear properties or be equivalent to them in certain aspects. The discussion delves into the fundamental nature of these AI models and their implications.

neural networks deep learning machine learning AI

DOCDEV.to AI·4/28/2026

Building a No-Install AI Upscaler: Leveraging Cloud GPUs for Seamless Image Processing

The GoHard AI Upscaler is a browser-based tool for professional-grade image enhancement, removing the need for high-end local rigs. It achieves zero installation and consistent performance by utilizing Python, optimized AI models, and Google Colab cloud GPUs.

Image processing deep learning cloud computing machine learning

DOCHugging Face Blog·12d ago

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

This article is a beginner's guide to using `torch.profiler` for performance analysis in PyTorch. It explains how to effectively profile deep learning models to identify bottlenecks and optimize execution.

deep learning learning profiling performance

RESEARCHarXiv CS.LG·4/30/2026

RaMP: Runtime-Aware Megakernel Polymorphism for Mixture-of-Experts

RaMP is a routing-aware dispatch framework designed to optimize Mixture-of-Experts (MoE) inference, addressing significant throughput loss from current batch-size-only configurations. It uses a performance-region analysis and a four-parameter wave cost model to select optimal kernel configurations, achieving up to 1.22x kernel speedup and 0.93% mean regret versus exhaustive search.

deep learning AI optimization performance

RESEARCHarXiv CS.AI·5/6/2026

Virtual Speech Therapist: A Clinician-in-the-Loop AI Speech Therapy Agent for Personalized and Supervised Therapy

This paper introduces the Virtual Speech Therapist (VST), an intelligent agent-based platform that streamlines stuttering assessment and delivers customized therapy through automated and adaptive AI-driven workflows. VST integrates deep learning for stuttering classification and multi-agent LLM reasoning to generate and refine individualized therapy plans, with a critic agent ensuring clinical safety and adherence to guidelines.

deep learning AI in healthcare speech therapy stuttering

RESEARCHarXiv CS.AI·4/8/2026

MedGemma 1.5 Technical Report

O MedGemma 1.5 4B é um novo modelo que expande as capacidades do MedGemma 1, integrando análise de imagens médicas de alta dimensão (CT/MRI, histopatologia), localização anatômica e compreensão de documentos médicos. Ele demonstra ganhos significativos em precisão de classificação de condições em MRI e CT, e um aumento de 47% no macro F1 para imagens de patologia de lâmina inteira.

deep learning AI healthcare AI Medical Imaging

RESEARCHarXiv CS.LG·4/6/2026

Convolutional Surrogate for 3D Discrete Fracture-Matrix Tensor Upscaling

Este estudo aborda o alto custo computacional da modelagem de fluxo de água subterrânea em meios fraturados usando simulações DFM. Para otimizar o processo, propõe-se um modelo substituto baseado em rede neural convolucional 3D para prever a condutividade hidráulica equivalente, permitindo um framework Monte Carlo multinível mais eficiente.

Simulação Numérica Modelos Substitutos Modelagem Hidrogeológica Monte Carlo

RESEARCHarXiv CS.CL·4/6/2026

CIPHER: Conformer-based Inference of Phonemes from High-density EEG

CIPHER é um modelo baseado em Conformer para inferência de fonemas a partir de EEG de alta densidade, visando decodificar informações de fala do cérebro. Embora alcance alta performance em tarefas binárias, mostra desempenho limitado na discriminação de fonemas de 11 classes, sendo posicionado como um estudo de benchmark e comparação de características.

deep learning speech decoding brain-computer interface machine learning

RESEARCHarXiv CS.CL·28d ago

jina-embeddings-v5-omni: Geometry-preserving Embeddings via Locked Aligned Towers

This work introduces GELATO, a novel approach to multimodal embedding models that extends VLM-style architectures. It results in the jina-embeddings-v5-omni suite, which efficiently encodes text, image, audio, and video into a single semantic embedding space by freezing backbone text models and training only connecting components.

embedding models multimodal AI deep learning machine learning

ARTICLEML Mastery·10d ago

Serving Multiple Users at Once: How Continuous Batching Keeps LLM Inference Efficient

This article explores how continuous batching improves LLM inference efficiency, addressing the issues of static batching. It details dynamic scheduling and ragged batching to process multiple requests simultaneously.

inference deep learning efficiency Batching

Serving Multiple Users at Once: How Continuous Batching Keeps LLM Inference Efficient

ARTICLELangChain Blog·20d ago

Interpreters in Deep Agents: Code Between Tool Calls and Sandboxes

Deep Agents now supports interpreters: small embedded runtimes where agents write code to coordinate tools, hold working state, and decide what enters model context.

deep learning Tool Coordination Runtime Environments Interpreters

Interpreters in Deep Agents: Code Between Tool Calls and Sandboxes

ARTICLEDEV.to AI·4/22/2026

Blog 2: Momentum-Based Optimizers

This blog content discusses momentum-based optimizers, exploring their function and importance in accelerating the training of machine learning models. It details how these algorithms improve the convergence and efficiency of neural networks.

deep learning machine learning AI Algorithms

RESEARCHDEV.to AI·4/21/2026

Learning to be Safe: Deep RL with a Safety Critic

This content explores a novel approach to Deep Reinforcement Learning by integrating a "safety critic" to prevent unsafe actions. The methodology aims to enhance the reliability and robustness of AI agents, making them suitable for real-world deployment where safety is critical.

deep learning reinforcement learning security machine learning

DOCGoogle for Developers (YouTube)·4/30/2026

Unlocking Low-Level Control: Customizing Keras Training Loops with JAX

This content discusses how to gain low-level control and customize Keras training loops. It details the integration with JAX to allow for greater flexibility and performance in machine learning model development.

Training Loops Keras deep learning machine learning

Unlocking Low-Level Control: Customizing Keras Training Loops with JAX

RESEARCHarXiv CS.LG·4/17/2026

The Devil Is in Gradient Entanglement: Energy-Aware Gradient Coordinator for Robust Generalized Category Discovery

This research paper introduces an Energy-Aware Gradient Coordinator to address "gradient entanglement," a key challenge in Robust Generalized Category Discovery. The proposed method aims to improve the robustness and performance of AI models in identifying new categories.

Gradient Descent category discovery deep learning machine learning

RESEARCHarXiv CS.AI·4/15/2026

Identity as Attractor: Geometric Evidence for Persistent Agent Architecture in LLM Activation Space

This study explores identity as an attractor in the persistent agent architecture within LLM activation spaces. It presents geometric evidence to understand the underlying structure and behavior of language models.

AI architecture LLMs deep learning computational geometry

RESEARCHarXiv CS.LG·4/13/2026

Ranked Activation Shift for Post-Hoc Out-of-Distribution Detection

This research introduces a method called Ranked Activation Shift for post-hoc out-of-distribution detection. It aims to improve the identification of data samples that deviate from the training distribution.

OOD Detection neural networks deep learning machine learning

ARTICLETwo Minute Papers (YouTube)·4/28/2026

Solved: The Bug That Haunted AI Video For Years

A persistent bug that has affected AI video technology for years has finally been solved. This fix represents a significant advancement for the quality and stability of artificial intelligence-based video systems.

AI video deep learning computer vision bug fix

Solved: The Bug That Haunted AI Video For Years

RESEARCHHugging Face Blog·3/9/2026

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Este conteúdo aborda o paralelismo de sequência Ulysses, uma técnica inovadora para o treinamento de modelos de inteligência artificial. O foco está na capacitação de modelos para processar contextos de milhões de tokens de forma eficiente.

deep learning Long Contexts Training High-Performance Computing