← heapsort-ai

LLMs

714 items

NEWS↑ trendingReddit r/MachineLearning·25d ago

arXiv implements 1-year ban for papers containing incontrovertible evidence of unchecked LLM-generated errors, such as hallucinated references or results. [N]

arXiv has announced a new policy imposing a 1-year ban for authors who submit papers containing incontrovertible evidence of unchecked LLM-generated errors, such as hallucinated references or results. This policy emphasizes that authors are fully responsible for all content, regardless of how it was generated by AI tools.

42
NEWS↑ trendingReddit r/LocalLLaMA·4/9/2026

Marco-Mini (17.3B, 0.86B active) and Marco-Nano (8B, 0.6B active) by Alibaba

A Alibaba lançou recentemente os modelos Marco-Mini e Marco-Nano, variantes instrucionadas de modelos de linguagem multilingues altamente esparsos baseados em Mixture-of-Experts (MoE). O Marco-Mini, com apenas 0.86B de 17.3B parâmetros ativos, destaca-se por superar outros modelos de até 12B de parâmetros ativados em benchmarks de desempenho.

42
NEWS↑ trendingReddit r/LocalLLaMA·4/27/2026

Skymizer Taiwan Inc. Unveils Breakthrough Architecture Enabling Ultra-Large LLM Inference on a Single Card

Skymizer Taiwan Inc. has unveiled a breakthrough architecture, the HTX301 card, that allows 700B-parameter LLM inference on a single PCIe card with 384GB memory and low power consumption (~240W). This approach offloads decoding to the HTX301 while GPUs handle prefill, enabling ultra-large LLM inference locally without massive GPU VRAM.

42
ARTICLE↑ trendingReddit r/MachineLearning·5/6/2026

Stop letting LLMs edit your .bib [D]

The author expresses shock at the frequent hallucinated citations by LLMs in academic papers, leading to incorrect author lists. They question the lack of respect for research and the need for harsher penalties, asking if others are experiencing the same issue.

42
ARTICLE↑ trendingReddit r/MachineLearning·27d ago

Sharing all KGC 2026 decks. More production-grade KG systems than I've seen at any conference. [D]

The Knowledge Graph Conference (KGC 2026) showcased a significant number of live production-grade Knowledge Graph systems from various enterprises, a departure from typical AI events often presenting only proofs of concept. Examples included Bloomberg's ontology governance, AbbVie's drug intelligence KG with an LLM interface, and Morgan Stanley's continuous SHACL drift detection for risk reporting.

42
CASE↑ trendingReddit r/LocalLLaMA·4/23/2026

Been using PI Coding Agent with local Qwen3.6 35b for a while now and its actually insane

The user reports an extremely positive and effective experience with the PI Coding Agent, utilizing a local Qwen3.6 35b model for production projects. Success was attributed to a custom "plan-first skill file" that enforces a structured planning workflow, ensuring step-by-step execution and plan approval before any coding.

42
ARTICLE↑ trendingReddit r/MachineLearning·4/26/2026

Going from 3B/7B dense to Nemotron 3 Nano (hybrid Mamba-MoE) for multi-task reasoning — what changes in the fine-tuning playbook? [D]

The author is transitioning from fine-tuning dense 3B/7B transformers to NVIDIA's Nemotron 3 Nano (a hybrid Mamba-Attention-MoE architecture) for multi-task reasoning. They are seeking guidance on how the hybrid architecture impacts standard LoRA fine-tuning, as their prior experience is limited to dense models.

42
ARTICLE↑ trendingReddit r/LocalLLaMA·4/16/2026

Gemma 4 31b 3D geometry

The author expresses strong satisfaction with Gemma 4's quality, highlighting its coding ability and adaptability in conversations and reasoning. A test involving 3D model generation from an F1 car image demonstrated that Gemma significantly outperformed models like Claude Sonnet, Gemini Pro, and ChatGPT, which exhibited notable flaws.

Gemma 4 31b 3D geometry
41