scientific research

12 items

RESEARCHarXiv CS.AI·23h ago

A case study of evaluating AI agents on a neuroscience data-to-discovery pipeline

This research empirically evaluates general-purpose AI coding agents on a neuroscience data-to-discovery pipeline, assessing their ability to automate complex scientific tasks. It finds agents can solve individual pipeline stages but struggle with scientific judgment in the absence of predefined iteration criteria.

Benchmarking Neuroscience automation AI agents

RESEARCH↑ trendingReddit r/LocalLLaMA·25d ago

internlm/Intern-S2-Preview · Hugging Face

Intern-S2-Preview is an efficient 35B scientific multimodal foundation model that achieves performance comparable to trillion-scale models by exploring task scaling and full-chain training. It excels in hundreds of professional scientific tasks while maintaining strong general reasoning, multimodal understanding, and agent capabilities.

AI models multimodal AI model training Foundation Models

internlm/Intern-S2-Preview · Hugging Face

RESEARCHarXiv CS.AI·4/7/2026

Toward Full Autonomous Laboratory Instrumentation Control with Large Language Models

Este trabalho explora o potencial de Grandes Modelos de Linguagem (LLMs), como o ChatGPT, e agentes de IA para automação e controle de instrumentação laboratorial. Demonstra-se como essas ferramentas reduzem barreiras de programação e podem evoluir para agentes autônomos capazes de operar equipamentos científicos e refinar estratégias de controle.

LLMs ChatGPT Instrumentation Control large language models

ARTICLEDEV.to AI·19d ago

Towards an AI co-scientist

This content explores the evolution of artificial intelligence to act as an "AI co-scientist", assisting researchers in various stages of the scientific process. It discusses AI's potential to accelerate discoveries and transform research methodology.

future-of-AI Scientific Discovery human-AI collaboration AI in science

RESEARCHarXiv CS.AI·4/15/2026

GoodPoint: Learning Constructive Scientific Paper Feedback from Author Responses

This research introduces GoodPoint, a method leveraging LLMs and author responses to generate constructive feedback for scientific papers. It develops GoodPoint-ICLR, a dataset of ICLR papers, and a training recipe using fine-tuning and preference optimization for valid and actionable feedback.

LLMs Feedback Generation machine learning NLP

RESEARCHarXiv CS.AI·4/16/2026

SciFi: A Safe, Lightweight, User-Friendly, and Fully Autonomous Agentic AI Workflow for Scientific Applications

This work introduces SciFi, a safe, lightweight, and user-friendly agentic framework for the autonomous execution of scientific tasks. It combines an isolated environment, a three-layer agent loop, and a self-assessing mechanism to ensure reliable operation, leveraging LLMs to automate routine scientific workloads and free researchers for creative activities.

LLMs Workflow Agentic AI automation

RESEARCHarXiv CS.AI·4/22/2026

AI scientists produce results without reasoning scientifically

LLM-based systems conduct autonomous scientific research but often fail to adhere to epistemic norms, ignoring evidence in 68% of traces. A study across eight domains and over 25,000 runs found that base models primarily determine agent performance and behavior.

LLMs AI Reasoning AI agents scientific research

NEWSGoogle DeepMind Blog·4/27/2026

Announcing our partnership with the Republic of Korea

Google DeepMind and the Republic of Korea announce a partnership to accelerate scientific breakthroughs. The collaboration aims to use frontier AI models to drive significant advancements.

deep learning government-collaboration Partnerships artificial intelligence

Announcing our partnership with the Republic of Korea

DOCDEV.to AI·20d ago

35 ChatGPT Prompts for Environmental Scientists: Accelerate Research, Reporting, and Stakeholder Communication

This content provides 35 ChatGPT prompts designed to help environmental scientists streamline their research, reporting, and stakeholder communication. It aims to reduce time spent on documentation tasks, allowing them to focus on essential science.

environmental science ChatGPT prompts workflow optimization

RESEARCHarXiv CS.LG·5/4/2026

Human-in-the-Loop Meta Bayesian Optimization for Fusion Energy and Scientific Applications

This paper introduces Human-in-the-Loop Meta Bayesian Optimization (HL-MBO), a framework combining expert knowledge with few-shot machine learning to accelerate discovery in data-scarce scientific domains. It outperforms current Bayesian Optimization methods in fusion energy yield optimization and other benchmarks.

Bayesian Optimization machine learning Fusion Energy scientific research

RESEARCHarXiv CS.LG·17d ago

Teaching Language Models to Forecast Research Success Through Comparative Idea Evaluation

This paper explores training language models to forecast the empirical success of research ideas by evaluating pairs of ideas against objective outcomes. SFT significantly boosts performance beyond GPT-5, and RLVR can train models to discover interpretable reasoning paths for this forecasting task.

language models research evaluation machine learning AI forecasting

RESEARCHarXiv CS.AI·15d ago

SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research

SciAtlas is a large-scale, multi-disciplinary knowledge graph designed to tackle the information explosion in academic output. Integrating millions of papers and billions of entities, it provides a structured network for automated scientific research and deep interdisciplinary integration.

Knowledge Graph information management research tools AI agents