AI Research

146 items

RESEARCHDEV.to AI·10h ago

Aligning with Human Judgement: The Role of Pairwise Preference in Large LanguageModel Evaluators

This content explores the critical role of pairwise preference in evaluating Large Language Models (LLMs). It discusses how this method can help align LLM performance more effectively with human judgment.

Human Alignment Pairwise Preference natural language processing AI Research

RESEARCHarXiv CS.AI·19h ago

Why Limit the Residual Stream to Layers and Not Tokens? Persistent Memory for Continuous Latent Reasoning

Large language models (LLMs) face a limitation called the 'concept bottleneck,' where they lose critical facts in deep latent reasoning. This paper proposes AGCLR (Adaptive Gated Continuous Latent Reasoning) to address this by augmenting CoCoNuT with a Gated Concept Stream for persistent memory.

machine learning Latent Reasoning Reasoning AI Research

RESEARCHarXiv CS.CL·19h ago

Bidirectional Small-Granularity Search between Code and Text

This research introduces a novel task: bidirectional small-granularity search between code and text, aiming to link scientific publications with corresponding code segments. It proposes a large dataset, partially generated by GPT-4, and a modular approach that achieves good in-domain results.

machine learning natural language processing Code Analysis Information Retrieval

RESEARCHarXiv CS.CL·19h ago

GraphLoRA: Structure-Aware Low-Rank Adaptation for Large Language Model Recommendation

GraphLoRA proposes a novel framework for Large Language Model Recommendation (LLMRec) that integrates structural information with textual semantics. It achieves this by embedding a trainable graph message-passing network within the low-rank adaptation pathway, allowing collaborative topology to explicitly guide parameter updates.

Low-Rank Adaptation Graph Neural Networks Recommendation Systems AI Research

ARTICLE↑ trendingHacker News (AI)·4d ago

Sakana AI's Recursive Self-Improvement (RSI) Lab

Sakana AI has launched its Recursive Self-Improvement (RSI) Lab, aiming to develop AI models capable of enhancing their own performance. This initiative focuses on foundational research to create more robust and adaptable AI systems.

Self-improvement AI Sakana AI machine learning AI development

ARTICLE↑ trendingHacker News (AI)·4d ago

Ask HN: AI researchers – what's a recent paper that recently blew your mind?

A Hacker News user asks AI researchers to share recent machine learning papers that have impressed them. The goal is to find new and exciting publications in the ML space for those actively seeking new developments.

Academic Papers Research Recommendations machine learning AI

RESEARCH↑ trendingReddit r/MachineLearning·27d ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually [R]

Large language models (LLMs) face catastrophic forgetting and plasticity loss when updating parameters for downstream tasks. This work introduces a fast-slow learning framework for LLMs, utilizing model parameters as "slow" weights and optimized context as "fast" weights to adapt efficiently without compromising general reasoning.

LLMs learning Catastrophic Forgetting AI Research

ARTICLE↑ trendingReddit r/MachineLearning·4/20/2026

SGOCR: A Spatially-Grounded OCR-focused Pipeline & V1 Dataset [P]

An independent researcher created SGOCR, an open-source dataset pipeline for spatially-grounded, OCR-focused VQA, to fill a gap in visual datasets for text grounding in imagery. This pipeline generates VQA tuples with rich metadata, supporting diverse VLM training strategies.

Open Source Vision-Language Models datasets OCR

ARTICLE↑ trendingReddit r/MachineLearning·4/12/2026

LLMs learn backwards, and the scaling hypothesis is bounded. [D]

This content discusses the perspective that Large Language Models (LLMs) learn in a reverse manner and that the scalability hypothesis has inherent limits.

LLMs deep learning scaling hypothesis modelos de linguagem

RESEARCH↑ trendingReddit r/MachineLearning·26d ago

Follow the Mean: Reference-Guided Flow Matching [R]

This content refers to a research paper titled "Follow the Mean: Reference-Guided Flow Matching". It explores a new methodology in generative models.

deep learning generative models machine learning Flow Matching

Follow the Mean: Reference-Guided Flow Matching [R]

ARTICLE↑ trendingReddit r/MachineLearning·4/30/2026

Seems ICML is rejecting MANY unanimous positively rated papers [D]

The content describes a perceived misalignment in the ICML review process, where reviewers feel pressured to homogenize scores to avoid prolonged discussions, potentially leading to positive papers being rejected. It highlights reviewers being reluctant to update scores even after concerns are addressed, creating distorted dynamics.

Peer review academic conference AI Research

RESEARCH↑ trendingReddit r/MachineLearning·19d ago

Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]

This discussion questions whether production Vision-Language Models (VLMs) still rely on fixed-patch Vision Transformers (ViTs) for their vision capabilities, despite the existence of more efficient tokenization methods. It explores potential reasons for this, such as marginal gains, pipeline limitations, or unclear scaling laws for adaptive patching.

VLMs deep learning Vision Transformers Tokenization

ARTICLE↑ trendingReddit r/MachineLearning·4/20/2026

Are we optimizing AI research for acceptance rather than lasting value? [D]

The title questions whether AI research is being optimized for immediate acceptance rather than lasting value. This prompts a critical discussion about the direction and priorities of innovation in artificial intelligence.

Innovation Research methodology long-term value AI Research

NEWS↑ trendingReddit r/MachineLearning·4/19/2026

KDD 2026 Cycle 2 reviews seem to have vanished from author view [D]

A KDD 2026 submitter noticed that their paper's reviews and discussions have disappeared from their author view, while discussions for other papers are still visible in their reviewer view. They are inquiring if other users are experiencing the same technical glitch with the review platform.

KDD Peer review academic conference AI Research

RESEARCHarXiv CS.CL·1d ago

The Piggyback Hypothesis of Generalization: Explaining and Mitigating Emergent Misalignment

The Piggyback Hypothesis explains how chat-template tokens can cause emergent misalignment in LLMs by generalizing finetuned behavior to out-of-domain queries. Token-Regularized Finetuning (TReFT) is proposed to mitigate this issue, preserving in-domain learning while reducing misalignment across models and datasets.

Finetuning Emergent Misalignment LLMs Generalization

ARTICLE↑ trendingReddit r/MachineLearning·26d ago

Would a 2000-2021 ML paper even get accepted today? [D]

The content discusses whether machine learning papers accepted between 2000 and 2021 would still get approved today, suggesting that the bar for publication has significantly risen. There's a debate on whether research standards have genuinely increased or if the field has simply become more crowded and competitive.

machine learning competition Peer review academic research

ARTICLE↑ trendingReddit r/MachineLearning·4/18/2026

ICML 2026 - Heavy score variance among various batches? [D]

A Reddit post discusses significant score variance among paper batches for ICML 2026, with some batches having few high scores while others report higher averages. The user questions the reasons for this disparity, such as domain differences or harsher reviewers, and whether ICML accounts for it.

academic conferences Peer review AI Research

ARTICLE↑ trendingReddit r/MachineLearning·4/27/2026

What do reviewers actually mean when they say the paper sound more like a technical report? [D]

An author's paper was rejected from a workshop for sounding more like a technical report than a research paper, despite following the usual computer vision format. They are seeking community opinion to understand common faux pas that lead to such an assessment.

academic publishing computer vision Peer review AI Research

ARTICLE↑ trendingReddit r/MachineLearning·4/19/2026

What are the future prospects of Spiking Neural Networks (and particularly, neuromorphics computing) and Liquid Neural Networks? [D]

An undergraduate student asks about the future prospects and mainstream adoption of Spiking Neural Networks and Liquid Neural Networks, wondering if they are promising areas for learning and projects. The user seeks to discuss the potential of these neuromorphic computing technologies.

Spiking Neural Networks deep learning Liquid Neural Networks Neuromorphic Computing

NEWS↑ trendingReddit r/MachineLearning·4/23/2026

UAI 2026 Reviews Waiting Place [D]

This is a place for UAI 2026 participants to share their reactions, whether rants or relief, once the conference reviews are released soon. Good luck to everyone with their results.

conferences Peer review AI Research