deep learning

263 items

RESEARCHarXiv CS.LG·8d ago

BitsMoE: Efficient Spectral Energy-Guided Bit Allocation for MoE LLM Quantization

BitsMoE proposes a spectral-energy-guided bit-allocation framework for quantizing Mixture-of-Experts (MoE) large language models. It addresses memory-intensive deployment by decomposing MoE layers and using expert-specific spectral factors for fine-grained, activation-aware mixed-precision quantization.

MoE models deep learning AI optimization quantization

ARTICLEDEV.to AI·4/11/2026

CNN Layer Composition — A Practical Developer Guide to Activation, Pooling, and Fully Connected Layers

This practical guide details the composition of CNN layers, explaining how activation, pooling, and fully connected layers work together to transform feature maps into predictions. It emphasizes the crucial role of non-linearity, introduced by functions like ReLU, for learning complex features and the network's effective operation.

neural networks CNN deep learning Activation Functions

DOCDEV.to AI·12d ago

Recurrent Neural Networks for Time Series Forecasting

Recurrent Neural Networks are explored for time series forecasting, highlighting their ability to model data sequences. This content details how these architectures function and their practical applications in the field of artificial intelligence.

neural networks forecasting deep learning machine learning

RESEARCHDEV.to AI·17d ago

Visual Sentiment Prediction with Deep Convolutional Neural Networks

This paper focuses on visual sentiment prediction using deep convolutional neural networks. It explores advanced methods for analyzing and interpreting emotions in images through AI.

neural networks deep learning computer vision sentiment analysis

DOCDEV.to AI·5/1/2026

🏈 TensorCraft Playbook: De CNNs de Sala de Aula a Cloud TPUs com Keras

This content describes the fundamental components of a Convolutional Neural Network (CNN) architecture, detailing feature extraction with Conv2D, spatial reduction with MaxPooling2D, regularization with Dropout, and classification using dense layers. It focuses on designing a balanced structure for hierarchical spatial pattern extraction in images.

neural networks CNN Keras deep learning

RESEARCHDEV.to AI·4/26/2026

Deep Generative Dual Memory Network for Continual Learning

The title "Deep Generative Dual Memory Network for Continual Learning" describes a deep generative neural network architecture. It aims to enable continual learning, allowing the model to acquire new information without forgetting previously learned knowledge, by employing a dual memory approach.

neural networks deep learning Continual Learning Generative AI

RESEARCHDEV.to AI·27d ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

WebWatcher introduces a novel vision-language deep research agent, pushing the boundaries of AI capabilities by integrating visual understanding with language processing. This research explores new frontiers in how AI systems can perceive and interact with complex information.

deep learning AI agent vision-language AI research

RESEARCHDEV.to AI·4/26/2026

A Physics-Informed Deep Learning Paradigm for Car-Following Models

This research introduces a novel physics-informed deep learning paradigm designed for developing car-following models. The approach aims to integrate fundamental physical principles directly into deep neural networks to enhance the accuracy and interpretability of traffic simulations.

Traffic Modeling deep learning Autonomous Vehicles simulation

DOCDEV.to AI·20d ago

AI Tesla FSDWaymo

This comprehensive guide explores the shift from modular to end-to-end autonomous driving, comparing different architectures like Tesla FSD V12 and Waymo. It details the pros and cons of each approach, including hybrid solutions and multimodal large models.

Waymo deep learning autonomous driving Tesla FSD

DOCML Mastery·13d ago

The Statistics of Token Selection: Logits, Temperature, and Top-P Walkthrough

This content explains the process of token selection in large language models (LLMs). It details how criteria such as logits, temperature, and top-p influence the coherence and creativity of the generated outputs.

LLMs Token Selection deep learning machine learning

The Statistics of Token Selection: Logits, Temperature, and Top-P Walkthrough

ARTICLEDEV.to AI·4/24/2026

Layer Normalization — Deep Dive + Problem: Largest Connected Region

This content provides a deep dive into Layer Normalization, a crucial component of the Transformer Architecture. It details its importance for stabilizing training and improving the performance of Large Language Models (LLMs), originating from the "Attention is All You Need" paper.

Transformer Architecture LLMs deep learning NLP

RESEARCHDEV.to AI·15d ago

François Chollet 谈 AGI 未来

François Chollet discusses the future of AGI, predicting its arrival around 2030, and introduces NDI lab's mission to develop a new, "optimal" machine learning paradigm based on symbolic program synthesis. He critiques deep learning's limitations and outlines NDI's high-risk, high-reward strategy for foundational AI advancement.

AGI deep learning Symbolic AI Benchmarks

DOCDEV.to AI·5/3/2026

DeepRobust: A PyTorch Library for Adversarial Attacks and Defenses

DeepRobust is a PyTorch library designed for research and development in adversarial attacks and defenses. It provides tools for testing the robustness of deep learning models against malicious manipulations.

deep learning security machine learning adversarial AI

RESEARCHDEV.to AI·4/24/2026

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition

This content describes a two-stream 3D convolutional neural network designed for skeleton-based action recognition.

neural networks deep learning computer vision Action Recognition

RESEARCHarXiv CS.CL·4/7/2026

MultiPress: A Multi-Agent Framework for Interpretable Multimodal News Classification

Este artigo propõe o MultiPress, uma estrutura inovadora de múltiplos agentes em três estágios para a classificação de notícias multimodais, visando superar as limitações de métodos existentes na compreensão de dados heterogêneos como texto e imagens. A pesquisa integra agentes especializados para percepção, raciocínio aumentado por recuperação e fusão, demonstrando melhorias significativas em um novo conjunto de dados em grande escala.

news classification deep learning multimodal classification multi-agent systems

ARTICLEDEV.to AI·5/1/2026

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Gemini 3.1 Flash TTS marks a significant leap in expressive, human-like speech synthesis, leveraging advanced prosody modeling and context awareness. The system also achieves lightning-fast, near real-time latency.

deep learning AI Text-to-Speech

RESEARCHDEV.to AI·26d ago

Recent Advances in Object Detection in the Age of Deep Convolutional NeuralNetworks

This content discusses the recent advancements in object detection, specifically focusing on the role and impact of deep convolutional neural networks. It likely explores new techniques, models, and challenges within this rapidly evolving field of artificial intelligence.

deep learning object detection computer vision Convolutional Neural Networks

RESEARCHDEV.to AI·5/4/2026

Tensor Programs II: Neural Tangent Kernel for Any Architecture

This research explores "Tensor Programs II", focusing on extending the Neural Tangent Kernel (NTK) to be applicable across any neural network architecture. It aims to provide a unified framework for understanding and analyzing the infinite-width limit of neural networks.

Neural Tangent Kernel deep learning Tensor Programs machine learning

ARTICLEDEV.to AI·5/1/2026

I Built an AI That Detects Pneumonia From Chest X-Rays Here's Exactly How I Did It

The author built and launched "PneumoScan AI," a deep learning model that detects pneumonia from chest X-rays with over 90% accuracy, aiming to address slow diagnosis in low-resource areas. This article details the development process, including the use of a Kaggle dataset and the discovery of its imbalance.

deep learning pneumonia detection healthcare AI Medical Imaging

RESEARCHDEV.to AI·24d ago

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

MobileVLM V2 introduces a new and enhanced baseline for vision language models, focusing on faster performance and stronger capabilities. This research aims to advance the efficiency and robustness of VLMs on mobile platforms.

AI models Vision-Language Models research deep learning