← heapsort-ai

LLMs

715 items

RESEARCHarXiv CS.CL·4/13/2026

Temperature-Dependent Performance of Prompting Strategies in Extended Reasoning Large Language Models

This study evaluates the performance of prompting strategies (chain-of-thought and zero-shot) in extended reasoning LLMs like Grok-4.1, varying the sampling temperature across 39 challenging mathematical problems. It found that zero-shot prompting peaks at moderate temperatures, while chain-of-thought performs best at temperature extremes, significantly increasing the benefit of extended reasoning.

30
RESEARCHarXiv CS.CL·4/16/2026

Mathematical Reasoning Enhanced LLM for Formula Derivation: A Case Study on Fiber NLI Modellin

This research introduces a mathematical reasoning-enhanced generative AI approach for deriving optical communication formulas, specifically for fiber nonlinear interference modelling. By guiding an LLM with structured prompts, the study successfully reconstructed known expressions and derived a novel approximation, demonstrating both physical consistency and practical accuracy.

29
RESEARCHarXiv CS.CL·13d ago

Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications

This paper offers the first unified survey of Pretraining Data Exposure (PDE) in Large Language Models (LLMs), covering data contamination and membership inference. It formalizes PDE, reviews attack and defense methods, and highlights open challenges to ensure evaluation integrity and protect privacy.

29
ARTICLEDEV.to AI·4/21/2026

How we handle LLM context window limits without losing conversation quality

This article addresses the critical challenge of LLM context window limits, which causes chatbots to forget information and agents to lose track of goals, despite models offering larger windows. It highlights that simply expanding context windows is insufficient due to prohibitive costs and increased latency, promising to share production strategies and trade-offs.

29
RESEARCHarXiv CS.AI·5d ago

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

StepPRM-RTL is a novel framework that enhances LLM-based RTL code generation by combining stepwise trajectory modeling, process-reward modeling (PRM), and retrieval-augmented fine-tuning (RAFT). It uses dense feedback from a PRM to guide reinforcement-style updates and Monte Carlo Tree Search (MCTS) to enrich the training dataset.

29
ARTICLEDEV.to AI·4d ago

<think>

This article delves into cost-effective alternatives to GPT-4o, revealing how other AI models can offer significant savings for developers. It provides direct cost comparisons, highlighting options like DeepSeek V4 Flash and Qwen3-32B.

29
DOCML Mastery·5d ago

Using Scikit-LLM with Open-Source LLMs

This article provides a tutorial on integrating locally hosted open-source large language models such as Mistral, Gemma, and Llama 3 for language tasks like text classification. It demonstrates how to achieve this for free using Ollama and the Scikit-LLM Python library.

Using Scikit-LLM with Open-Source LLMs
29