Knowledge Distillation

11 items

ARTICLEDEV.to AI·2d ago

Cross-Modal Knowledge Distillation for satellite anomaly response operations across multilingual stakeholder groups

The author realized that Cross-Modal Knowledge Distillation (CMKD) could bridge communication gaps between technical teams, operations centers, and insurance stakeholders during satellite anomaly responses. This approach aids in translating complex technical jargon into understandable information for multilingual groups involved in critical operations.

AI applications Knowledge Distillation Multilingual Communication Satellite Operations

ARTICLEDEV.to AI·4d ago

Cross-Modal Knowledge Distillation for smart agriculture microgrid orchestration in carbon-negative infrastructure

The author encountered challenges building a multi-agent AI system for a carbon-negative smart agriculture microgrid due to conflicting data across different modalities. This led to the realization that cross-modal alignment, rather than individual agent intelligence, was the key problem for orchestrating the system effectively.

agriculture Knowledge Distillation microgrids sustainability

RESEARCHDEV.to AI·4/10/2026

Cross-Modal Knowledge Distillation for planetary geology survey missions with ethical auditability baked in

O texto narra a jornada de pesquisa do autor em destilação de conhecimento cross-modal com auditabilidade ética, impulsionada pela observação de que IAs de classificação mineral podem tomar decisões tecnicamente corretas, mas eticamente ingênuas. O objetivo é desenvolver sistemas de IA que sejam precisos e eticamente robustos para missões de pesquisa geológica planetária.

Knowledge Distillation Autonomous systems machine learning Planetary Geology

RESEARCHarXiv CS.LG·4/8/2026

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

Este artigo propõe um pipeline ordenado (poda, quantização INT8 e destilação de conhecimento) para otimizar a compressão de redes neurais, visando a latência de inferência medida em vez de métricas indiretas. A pesquisa revela que a quantização INT8 oferece o principal benefício de tempo de execução, enquanto a poda atua como um pré-condicionador e a destilação de conhecimento recupera a precisão.

Pruning Knowledge Distillation model efficiency Neural Network Compression

RESEARCHarXiv CS.CL·27d ago

ReAD: Reinforcement-Guided Capability Distillation for Large Language Models

ReAD proposes a Reinforcement-guided Capability Distillation framework for Large Language Models, aiming to compress LLMs while preserving essential abilities for downstream tasks. It explicitly accounts for the interdependence of capabilities, optimizing token budget usage and mitigating degradation of useful abilities.

Model Compression Knowledge Distillation LLMs reinforcement learning

ARTICLEDEV.to AI·27d ago

Needle: Distilling Gemini Tool Calling into a 26M Model

Needle is a 26-million-parameter model that successfully distilled Gemini's tool-calling capabilities, achieving near-parity accuracy at a fraction of the compute cost. This breakthrough is crucial for developers building AI agents and edge deployments.

AI models Knowledge Distillation tool-calling efficiency

RESEARCHarXiv CS.CL·4/13/2026

WAND: Windowed Attention and Knowledge Distillation for Efficient Autoregressive Text-to-Speech Models

WAND introduces a framework to adapt pretrained autoregressive text-to-speech (AR-TTS) models for constant computational and memory complexity. It achieves this by separating attention into global and local sliding-window mechanisms, employing curriculum learning, and utilizing knowledge distillation to maintain high-fidelity speech synthesis with significant KV cache memory reduction.

Knowledge Distillation Autoregressive Text-to-Speech Attention Mechanism Computational Efficiency

RESEARCHarXiv CS.LG·5/7/2026

Continual Distillation of Teachers from Different Domains

This research introduces Continual Distillation (CD), a new paradigm where a student model sequentially learns from a stream of teacher models without retaining prior access. It addresses challenges like unseen knowledge transfer (UKT) and forgetting (UKF) through Self External Data Distillation (SE2D), which uses external unlabeled data to stabilize learning across heterogeneous teachers.

Knowledge Distillation deep learning learning Continual Learning

RESEARCHarXiv CS.CL·15d ago

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

This paper proposes a knowledge-aware Text-to-SQL framework to convert natural language questions into executable SQL queries, even in low-resource settings. It addresses challenges like scarce annotated data and opaque schema definitions by injecting task-specific knowledge into both training and inference.

Knowledge Distillation Text-to-SQL Low-Resource AI Natural Language Processing

RESEARCHarXiv CS.CL·4/6/2026

Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge

Este artigo propõe uma estrutura de Reinforcement Learning (RL) que utiliza um LLM como juiz para gerar recompensas, permitindo a destilação de conhecimento sem a necessidade de rótulos de verdade fundamental. A abordagem demonstra ganhos substanciais de desempenho em benchmarks de raciocínio matemático, sugerindo que avaliadores baseados em LLM podem produzir sinais de treinamento eficazes.

language models Unlabeled Data Knowledge Distillation Math Reasoning

ARTICLEDEV.to AI·4/26/2026

Cross-Modal Knowledge Distillation for deep-sea exploration habitat design under multi-jurisdictional compliance

This article explores applying Cross-Modal Knowledge Distillation (CMKD) to design deep-sea exploration habitats. The author posits that CMKD can integrate chaotic, multi-source data to meet complex environmental, structural, and legal compliance across multiple jurisdictions.

multimodal AI Knowledge Distillation deep learning Deep-sea exploration