← heapsort-ai

LLM

609 items

ARTICLE↑ trendingReddit r/LocalLLaMA·4/13/2026

Experiment: Olmo 3 7B Instruct Q1_0

The author attempted to quantize OLMo-3 7B Instruct into a 1-bit format using quantization aware distillation, training the model for 12 hours on 4x B200 GPUs. Although the resulting model can produce basic English, it's generally unusable due to repetition loops and lack of context tracking, attributed to premature training cessation and an unsuitable dataset choice.

Experiment: Olmo 3 7B Instruct Q1_0
43
RESEARCH↑ trendingReddit r/LocalLLaMA·4/10/2026

Stanford: Self improving Meta-Harness

Meta-Harness é um novo sistema da Stanford que otimiza o "harness" de Large Language Models (LLMs), corrigindo autonomamente erros para melhorar o desempenho e reduzir o uso de contexto. Ele demonstra melhorias notáveis em classificação de texto, superando sistemas existentes e utilizando 4 vezes menos tokens.

43
NEWS↑ trendingReddit r/LocalLLaMA·4/16/2026

Qwen3.6-35B-A3B released!

The Qwen3.6-35B-A3B model has been released and open-sourced, featuring a sparse MoE architecture with 35B total parameters and 3B active, under an Apache 2.0 license. It excels in agentic coding, multimodal perception, and reasoning, touted as efficient, powerful, and versatile.

Qwen3.6-35B-A3B released!
42
RESEARCH↑ trendingReddit r/MachineLearning·4/14/2026

You can decompose models into a graph database [N]

This content introduces the LarQL project, which allows the decomposition of static LLM models into a graph database to perform KNN walks mathematically identical to matrix multiplication. This innovative approach enables updating a model's factual knowledge without retraining, simply by inserting information into the graph database, and it uses less memory.

42
RESEARCH↑ trendingReddit r/MachineLearning·4/16/2026

Training Qwen2.5-0.5B-Instruct on Reddit posts summarization tasks with length constraint on my 3xMac Minis with GRPO - evals update [P]

The author trained Qwen2.5-0.5B-Instruct for Reddit post summarization using two reward strategies, finding that a combination of quality and length penalties yielded significantly better results. Evaluation was conducted using LLM-As-A-Judge and DeepEval tools for metrics like conscientiousness and clarity.

42
NEWS↑ trendingReddit r/LocalLLaMA·5/7/2026

Qwen3.6 27B uncensored heretic v2 Native MTP Preserved is Out Now With KLD 0.0021, 6/100 Refusals and the Full 15 MTPs Preserved and Retained, Available in Safetensors, GGUFs and NVFP4s formats.

The Qwen3.6 27B uncensored heretic v2 Native MTP Preserved language model has been released, boasting a KLD of 0.0021 and only 6 refusals out of 100. It is available in various formats including Safetensors, GGUFs, and NVFP4s, with all 15 MTPs fully preserved and retained.

42