LoRA

21 items

ARTICLE↑ trendingReddit r/MachineLearning·4/18/2026

Trials and tribulations fine-tuning & deploying Gemma-4 [P]

An ML team documented the technical challenges faced while fine-tuning and deploying Gemma-4. Key issues included PEFT's incompatibility with Gemma 4's custom layers, SFTTrainer silently breaking KV-sharing attention, and DeepSpeed ZeRO-3 saving half-empty LoRA adapters.

MLOps Gemma 4 Fine-tuning LoRA

ARTICLE↑ trendingReddit r/MachineLearning·4/15/2026

[P] Added 8 Indian languages to Chatterbox TTS via LoRA — 1.4% of parameters, no phoneme engineering [P]

A project successfully added eight Indian languages (Telugu, Kannada, Bengali, Tamil, Malayalam, Marathi, Gujarati, and Hindi) to the Chatterbox-Multilingual TTS model using LoRA adapters and tokenizer extension. This approach trained only 1.4% of the model's parameters, avoiding the complex phoneme engineering typically required for each language.

Multilingual AI Chatterbox TTS LoRA

ARTICLE↑ trendingReddit r/LocalLLaMA·4/10/2026

[Model Release] I trained a 9B model to be agentic Data Analyst (Qwen3.5-9B + LoRA). Base model failed 100%, this LoRA completes 89% of workflows without human intervention.

Um desenvolvedor treinou um modelo Qwen3.5-9B com LoRA para atuar como analista de dados agente, focando em autonomia através de pesos. O modelo alcançou 89% de conclusão de fluxos de trabalho de ponta a ponta sem intervenção humana, superando a falha total do modelo base.

Data Analysis Agentic AI Fine-tuning LoRA

RESEARCHarXiv CS.LG·4/20/2026

Aletheia: Gradient-Guided Layer Selection for Efficient LoRA Fine-Tuning Across Architectures

Aletheia introduces a gradient-guided layer selection method for LoRA fine-tuning, identifying the most task-relevant layers and applying adapters selectively with asymmetric rank. This approach achieves a significant 15-28% training speedup across diverse large language models and architectures while broadly matching downstream behavior.

Parameter-efficient fine-tuning efficiency large language models Fine-tuning

RESEARCHarXiv CS.LG·4/9/2026

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

Este artigo investiga a geração de código cross-lingual, focando em métodos de fine-tuning paramétrico-eficiente (PEFT) e otimizadores para LLMs. Os autores demonstram que o fine-tuning LoRA no Code Llama 7B, com um dataset pequeno de alta qualidade, pode superar o desempenho de modelos mais amplamente fine-tuned, e que otimizadores como Sophia oferecem convergência mais rápida com resultados finais comparáveis.

Cross-lingual code generation PEFT LoRA LLM Fine-tuning

RESEARCHarXiv CS.CL·4d ago

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

This study systematically applies parameter-efficient fine-tuning (PEFT) using Low-Rank Adaptation (LoRA) to Qwen2.5-3B for a telecommunications customer support conversational assistant. It evaluates 16 LoRA configurations, varying hyperparameters and target modules, using a combinatorial synthetic data generation approach.

Telecommunications LLMs customer support PEFT

DOCDEV.to AI·16d ago

96. LoRA: Fine-Tune a Billion-Parameter Model on a Laptop

This article explains how the LoRA (Low-Rank Adaptation) technique enables fine-tuning billion-parameter language models on consumer hardware like laptops. Instead of updating all parameters, LoRA adds tiny trainable modules, drastically reducing GPU memory requirements.

GPU memory Fine-tuning LoRA HuggingFace

ARTICLEDEV.to AI·4/22/2026

Why LoRA? Understanding the representative PEFT

LoRA (Low-Rank Adaptation) is introduced as the leading PEFT method, enabling efficient adaptation of massive LLMs like Llama 3 without requiring extensive hardware resources. The post promises to delve into LoRA's mathematical intuition, the concept of "intrinsic dimension," and its game-changing impact for AI engineers.

LLMs deep learning Fine-tuning PEFT

RESEARCHarXiv CS.LG·4/9/2026

TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models

TalkLoRA propõe um framework MoELoRA que aborda a instabilidade de roteamento e a dominância de especialistas em métodos existentes, permitindo a comunicação entre especialistas antes do roteamento. Isso é feito através de um Módulo de Conversação leve, que facilita a troca de informações, gerando um sinal de roteamento mais robusto para Large Language Models (LLMs).

LLMs MoE Communication Fine-tuning

RESEARCHarXiv CS.LG·4/21/2026

Annotation Entropy Predicts Per-Example Learning Dynamics in LoRA Fine-Tuning

This research discovers that LoRA fine-tuning leads to 'un-learning' on contested examples, where high annotator disagreement correlates with increased loss during training. This pattern is distinct from full fine-tuning and consistently observed across multiple encoder and decoder-only models and datasets.

model training machine learning NLP Fine-tuning

RESEARCHarXiv CS.LG·6d ago

ReLoRA: Knowledge-Reusing Adaptation for Fast Rollout of Evolving LLM Services

This paper introduces ReLoRA, a knowledge-reusing re-adaptation framework that efficiently restores service-ready LoRA adapters for evolving LLM services. It addresses the computational cost of retraining and quality degradation from naive application to updated base models.

AI models machine learning Fine-tuning LoRA

RESEARCHarXiv CS.LG·20d ago

HELLoRA: Hot Experts Layer-Level Low-Rank Adaptation for Mixture-of-Experts Models

HELLoRA proposes a novel method for fine-tuning Mixture-of-Experts (MoE) models by applying Low-Rank Adaptation (LoRA) modules only to the most frequently activated experts at each layer. This technique significantly reduces trainable parameters and improves downstream performance, attributing its success to structured regularization that maintains expert specialization.

LLMs MoE AI Fine-tuning

RESEARCHarXiv CS.CL·4/27/2026

Where Should LoRA Go? Component-Type Placement in Hybrid Language Models

This research systematically investigates LoRA placement in hybrid language models, which combine attention and recurrent components. It finds that adapting the attention pathway consistently outperforms full-model adaptation with significantly fewer parameters, while the effect of adapting the recurrent backbone varies drastically depending on the hybrid architecture (sequential vs. parallel).

hybrid language models model adaptation attention mechanisms Recurrent Neural Networks

RESEARCHarXiv CS.LG·4/21/2026

Matched-Learning-Rate Analysis of Attention Drift and Transfer Retention in Fine-Tuned CLIP

This paper investigates how adaptation methods (Full FT vs. LoRA) and optimization scale jointly shape attention drift and transfer retention in fine-tuned CLIP models. A controlled matched-learning-rate comparison reveals that the learning rate strongly modulates structural change, with Full FT showing marked contraction at higher rates while LoRA remains entropy-positive.

CLIP Optimization attention Fine-tuning

RESEARCHarXiv CS.LG·28d ago

BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models

BaLoRA is a Bayesian extension of LoRA that enhances the accuracy of large-scale model adaptation. This novel approach not only quantifies uncertainty but also significantly narrows the performance gap with full fine-tuning.

Bayesian Methods machine learning large language models Fine-tuning

RESEARCHarXiv CS.CL·27d ago

Decomposing Evolutionary Mixture-of-LoRA Architectures: The Routing Lever, the Lifecycle Penalty, and a Substrate-Conditional Boundary

This paper decomposes an evolutionary Mixture-of-LoRA system, examining factors such as router rewrite, per-domain evaluation, and an adaptation lifecycle. Results indicate that the router rewrite is solely responsible for the balanced log-PPL improvement observed.

neural networks machine learning large language models LoRA

DOCHugging Face Blog·22d ago

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

This content details the fine-tuning process of the NVIDIA Cosmos Predict 2.5 model. It leverages LoRA/DoRA techniques for robot video generation applications.

NVIDIA Cosmos Predict 2.5 DoRA Robot Video Generation Fine-tuning

DOCDEV.to AI·4/25/2026

IP-Adapter + LoRA for product catalog rendering — putting shop items on AI characters

This content presents a runnable ComfyUI workflow for rendering AI characters with shop items, combining LoRA for character stability and IP-Adapter for reference image features. It details how to balance these techniques, recommending moderate IP-Adapter weight and early handoff to avoid distorting the character's face.

IP-Adapter image generation LoRA Generative AI

RESEARCHarXiv CS.CL·5/6/2026

Sparse Memory Finetuning as a Low-Forgetting Alternative to LoRA and Full Finetuning

Sparse Memory Finetuning (SMF) addresses catastrophic forgetting in pretrained language models by updating only a small subset of memory rows. Experiments show SMF improves performance on a medical exam task while substantially mitigating forgetting compared to LoRA and full finetuning.

Finetuning language models Sparse Memory Finetuning Catastrophic Forgetting

ARTICLEDEV.to AI·5/5/2026

[Day 2] I Trained an AI on 22 Photos of My Cat — Now It Draws Her in Any Scene

The author trained an AI model using 22 photos of their cat to enable it to generate images of the pet in various scenes, employing the LoRA technique. This article details the second day of the experiment, focusing on photo preparation and selection criteria to teach the AI the cat's distinctive features.

AI training personal-project image generation LoRA