← heapsort-ai

deep learning

263 items

RESEARCHarXiv CS.LG·1d ago

Multi-Scale Feature Attention Network for Polymer Classification using THz Dual-Comb Spectroscopy

This research proposes the Multi-Scale Feature Attention Network (MSFAN), a novel deep learning architecture for classifying 12 types of polymers using Terahertz Dual-Comb Spectroscopy (THz-DCS). This framework addresses the challenges of reliable polymer identification for recycled plastics by integrating feature gating and multi-scale parallel convolutions.

60
ARTICLEHugging Face (YouTube)·10h ago

What Is ONNX? (And Why Transformers.js Uses It)

ONNX (Open Neural Network Exchange) is an open standard that defines a common graph format for machine learning models, allowing interoperability between various ML frameworks. Transformers.js leverages ONNX to run pre-trained transformer models directly in the browser, providing efficient and platform-agnostic AI inference capabilities for web applications.

What Is ONNX? (And Why Transformers.js Uses It)
58
RESEARCHarXiv CS.LG·19h ago

MedicalRec: Medical recommender system for image classification without retraining

This study introduces MedicalRec, a medical recommender system for image classification, designed to optimize model selection without the need for extensive retraining. It addresses the computational and energy challenges of model identification by leveraging a publicly available dataset, MedicalRec-Bench, compiled from 3,000 articles and over 5,000 tested model records.

54
RESEARCHarXiv CS.LG·19h ago

TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation

TriHead-GAN proposes a Transformer-based Generative Adversarial Network with a triple-head discriminator to address the scarcity of city-level carbon emission data. This framework improves time series generation by better preserving cross-variable correlations and realistic step-wise variability compared to existing models.

54
RESEARCHarXiv CS.LG·19h ago

STARIXNet: Multivariate and Multi-attribute Deep Learning Approach to Real-Time Resource Allocation in Cloud Platforms

The paper introduces STARIXNet, a lightweight neural network for resource allocation in cloud platforms, addressing the limitations of current univariate solutions that neglect risks of underestimation and delays. This deep learning approach captures spatio-temporal relationships and multiple attributes to guide intelligent microservices scaling decisions.

54
ARTICLE↑ trendingReddit r/MachineLearning·4/11/2026

FlashAttention (FA1–FA4) in PyTorch - educational implementations focused on algorithmic differences [P]

An updated PyTorch repository features educational implementations of FlashAttention versions FA1 through FA4. The focus is on demonstrating the algorithmic differences and evolution of the method, facilitating an understanding of its design ideas without delving into hardware specifics.

45
ARTICLE↑ trendingReddit r/MachineLearning·4/12/2026

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP [P]

This educational PyTorch repository implements various distributed training parallelism techniques, including DP, FSDP, TP, and PP, from scratch. It explicitly writes forward/backward logic and collectives, allowing users to directly understand the algorithms and communication patterns without high-level abstractions.

43