← heapsort-ai

model training

16 items

RESEARCH↑ trendingReddit r/MachineLearning·4/24/2026

New project about llm hallucination [P]

This content introduces a new side project and its GitHub repository, focusing on mitigating LLM hallucination through a novel contrastive sampling and selective training method. The core idea treats hallucination as a preference problem, using self-generated negative samples and divergence-based, gated learning to push correct answers and suppress wrong ones.

New project about llm hallucination [P]
45
RESEARCH↑ trendingReddit r/LocalLLaMA·25d ago

internlm/Intern-S2-Preview · Hugging Face

Intern-S2-Preview is an efficient 35B scientific multimodal foundation model that achieves performance comparable to trillion-scale models by exploring task scaling and full-chain training. It excels in hundreds of professional scientific tasks while maintaining strong general reasoning, multimodal understanding, and agent capabilities.

internlm/Intern-S2-Preview · Hugging Face
42
RESEARCHarXiv CS.LG·20d ago

Simply Stabilizing the Loop via Fully Looped Transformer

Looped Transformers provide a way to improve model performance by iteratively reusing blocks without increasing parameter count, but they suffer from training instability at higher loop iterations. This instability is attributed to gradient oscillation and residual explosion, leading to the proposal of the Fully Looped Transformer, which introduces a Fully Looped Architecture and Attention Injection.

29
DOCAWS Machine Learning Blog·7d ago

The art and science of hyperparameter optimization on Amazon Nova Forge

This post explores the art and science of hyperparameter optimization on Amazon Nova Forge, detailing how to balance improving domain-specific performance without degrading a model's general capabilities. It covers customization strategies, configuring training parameters like learning rate and batch size, and avoiding common mistakes that lead to wasted training runs.

27