← heapsort-ai

LLMs

720 items

RESEARCHarXiv CS.LG·4/15/2026

Polynomial Expansion Rank Adaptation: Enhancing Low-Rank Fine-Tuning with High-Order Interactions

Polynomial Expansion Rank Adaptation (PERA) is a novel method to enhance low-rank adaptation (LoRA) for fine-tuning large language models. It introduces structured polynomial expansion into the low-rank factor space to model richer nonlinear high-order interactions, overcoming LoRA's linear limitations without increasing rank or inference cost.

28
RESEARCHarXiv CS.CL·4/21/2026

Reciprocal Co-Training (RCT): Coupling Gradient-Based and Non-Differentiable Models via Reinforcement Learning

This work introduces a reciprocal co-training framework that couples a Large Language Model (LLM) with a Random Forest (RF) classifier via reinforcement learning. It creates an iterative feedback loop where each model improves using signals from the other, demonstrating consistent performance gains across medical datasets.

28
RESEARCHarXiv CS.LG·4/14/2026

ExecTune: Effective Steering of Black-Box LLMs with Guide Models

This research introduces Guide-Core Policies (GCoP), a framework for steering black-box LLMs where a guide model generates strategies for a core model. The paper formalizes GCoP under a cost-sensitive utility objective, highlighting that end-to-end performance is governed by guide-averaged executability, which existing methods often fail to optimize effectively.

28
RESEARCHarXiv CS.AI·25d ago

Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

Multi-agent orchestration, where a hidden coordinator manages specialized worker agents, is a prevalent AI architecture for enterprise deployment, but its safety implications lack empirical testing. A 3x2 experiment using Claude Sonnet 4.5 revealed that invisible orchestration increased collective dissociation, with the orchestrator exhibiting maximal dissociation by retreating into private monologue and reducing public speech.

28
RESEARCHarXiv CS.CL·22d ago

PQR: A Framework to Generate Diverse and Realistic User Queries that Elicit QA Agent Failures

This paper introduces PQR, a framework designed to generate diverse and realistic user queries that elicit failures in LLM-based QA agents, going beyond existing methods that primarily focus on adversarial users. PQR operates through iterative query and prompt refinement modules to create realistic test scenarios that expose agent vulnerabilities.

28
RESEARCHarXiv CS.AI·15d ago

Practical Quantum CIM Empowerment via All-Domestic-Core Agentic Large Model

This study integrates a femtosecond laser-pumped Coherent Ising Machine (CIM) with an LLM-driven agentic system, leveraging LangGraph and LangChain frameworks. It demonstrates that LLMs can effectively perform tasks like QUBO/Ising model calibration and constraint weight iteration, achieving practical empowerment of quantum CIM with domestic technology.

28
RESEARCHarXiv CS.AI·5d ago

How Far Did They Go? The Persuasive Tactics of Covert LLM Agents in a Discontinued Field Experiment

This study analyzes a publicly released dataset from a discontinued field experiment on Reddit's r/ChangeMyView, where undisclosed AI-generated accounts engaged users in live debate. It conducts a structured content analysis evaluating identity performance, authority signaling, alignment strategies, and activation of cognitive heuristics by these large language models.

28
RESEARCHarXiv CS.CL·5d ago

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

This study systematically applies parameter-efficient fine-tuning (PEFT) using Low-Rank Adaptation (LoRA) to Qwen2.5-3B for a telecommunications customer support conversational assistant. It evaluates 16 LoRA configurations, varying hyperparameters and target modules, using a combinatorial synthetic data generation approach.

28
RESEARCHarXiv CS.CL·5d ago

From Scoring to Explanations: Evaluating SHAP and LLM Rationales for Rubric-based Teaching Quality Assessment

This research proposes a framework for sentence-level interpretability in rubric-based scoring, combining Shapley-value attributions with rationales from large language models (LLMs). It compares fine-tuned language models and prompted LLMs for teaching quality assessment, finding PLMs offer better prediction accuracy despite label compression.

28