← heapsort-ai

LLMs

722 items

RESEARCHarXiv CS.CL·27d ago

Domain Adaptation of Large Language Models for Polymer-Composite Additive Manufacturing Using Retrieval-Augmented Generation and Fine-Tuning

This study explores strategies for adapting general-purpose large language models (LLMs) to specialized engineering domains, specifically additive manufacturing, to enhance answer accuracy and relevance. It investigates the use of domain-specific fine-tuning and retrieval-augmented generation (RAG) by constructing a curated corpus for evaluation.

27
RESEARCHarXiv CS.LG·23d ago

Quantization Undoes Alignment: Bias Emergence in Compressed LLMs Across Models and Precision Levels

This study investigates the impact of post-training quantization on Large Language Models (LLMs) quality, revealing that compression can lead to bias emergence. 3-bit quantization caused 6-21% of previously unbiased items to develop new stereotypical behaviors in models like Qwen2.5-7B, Mistral-7B, and Phi-3.5-mini. This follows a clear dose-response pattern across various precision levels.

27
RESEARCHarXiv CS.CL·21d ago

Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution

This paper introduces Stepwise Confidence Attribution (SCA), a framework for closed-source LLMs that diagnoses multi-step reasoning failures by assigning step-level confidence. SCA applies the Information Bottleneck principle, flagging deviations from consensus structures as potential errors, and proposes two complementary methods: NIBS and GIBS.

27
RESEARCHarXiv CS.LG·12d ago

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?

This paper investigates the mechanistic origins of catastrophic forgetting in Large Language Models (LLMs), comparing Reinforcement Learning (RL) with Supervised Fine-Tuning (SFT). It reveals that RL preserves internal computational circuits more effectively, mitigating the forgetting of prior capabilities, unlike SFT which causes greater circuit disruption.

27
RESEARCHarXiv CS.CL·9d ago

Can LLM Teams Play What? Where? When?

This research explores how team-based interactions improve Large Language Model (LLM) performance on complex reasoning tasks, specifically in the quiz game What? Where? When?. It demonstrates that team strategies yield significant accuracy gains, with the best teams approaching human performance.

27