LLMs

714 items

RESEARCH↑ trendingReddit r/LocalLLaMA·4/17/2026

Qwen3.6 GGUF Benchmarks

This content presents KLD performance benchmarks for Unsloth's Qwen3.6-35B-A3B GGUF quants, highlighting their efficiency in terms of KLD versus disk space. It also clarifies that frequent GGUF updates are typically due to external bug fixes or official improvements, rather than Unsloth's internal errors.

LLMs quantization Benchmarks

DOCDEV.to AI·4/23/2026

How to Integrate Claude with n8n to Build AI Workflows

This guide details how to integrate Claude with n8n to build AI workflows that can interpret, decide, and act dynamically. The combination allows overcoming traditional automation limitations by processing unstructured inputs and generating structured outputs based on reasoning.

integration LLMs AI Workflows automation

ARTICLE↑ trendingReddit r/LocalLLaMA·5/7/2026

Need advice on hardware purchasing decision: RTX 5090 vs. M5 Max 128GB for agentic software development

The user is seeking advice on choosing between an RTX 5090 and an M5 Max 128GB for agentic software development using Qwen3.6 27B locally. The RTX 5090 offers 3x speed, while the M5 Max provides 4x memory, presenting a trade-off between rapid code generation and larger context capacity.

LLMs GPU hardware performance

ARTICLE↑ trendingReddit r/LocalLLaMA·4/9/2026

16 GB VRAM users, what model do we like best now?

Um usuário com 16 GB de VRAM compartilha sua experiência positiva com o modelo Qwen 3.5 27b em quants IQ3 em uma RTX 4080, alcançando boa velocidade e contexto. Ele discute os desafios de otimizar modelos de IA localmente com essa quantidade de VRAM, ponderando entre qualidade e velocidade ao lidar com diferentes níveis de quantização.

LLMs VRAM modelos de linguagem hardware

ARTICLE↑ trendingReddit r/LocalLLaMA·4/27/2026

Guys this is so fun!

A user expresses excitement about running various AI models like Qwen and Llama locally on their MacBook Air and an AI Workstation with an RTX Pro 6000 Blackwell, utilizing tools such as LM Studio and LM Link.

open source models LLMs Local AI hardware

CASE↑ trendingReddit r/LocalLLaMA·4/19/2026

"Browser OS" implemented by Qwen 3.6 35B: The best result I ever got from a local model

A user shares their experience implementing 'Browser OS' using the Qwen 3.6 35B local model, noting it achieved the best results they've personally obtained from a local AI model. The content likely points to a demonstration or detailed account of this impressive performance.

AI models LLMs demonstration Local AI

"Browser OS" implemented by Qwen 3.6 35B: The best result I ever got from a local model

ARTICLE↑ trendingReddit r/LocalLLaMA·4/21/2026

2x 512gb ram M3 Ultra mac studios

A user with two high-end M3 Ultra Mac Studios (512GB RAM each, $25k in hardware) is testing LLM models like Deepseek and GLM, and is asking the community for suggestions on what else to load. They are troubleshooting backend issues and awaiting optimizations for Kimi 2.6.

Apple AI models LLMs Mac Studio

RESEARCH↑ trendingReddit r/LocalLLaMA·4/23/2026

Qwen 3.6 27B Makes Huge Gains in Agency on Artificial Analysis - Ties with Sonnet 4.6

Qwen 3.6 27B has achieved significant gains, matching Sonnet 4.6 on Artificial Analysis's Agentic Index and surpassing several other prominent models. The model's training appears focused on agentic use, showing surprising performance for its size despite questionable Coding Index metrics.

model performance AI models LLMs Benchmarking

Qwen 3.6 27B Makes Huge Gains in Agency on Artificial Analysis - Ties with Sonnet 4.6

ARTICLEDEV.to AI·4/22/2026

We Built a 31-Agent AI Team That Hires Itself, Critiques Itself, and Dreams

This engineering writeup details a self-evolving, 31-agent AI team built on Claude Code, featuring a parallel cognitive layer, dynamic hiring pipeline, and robust verification. It critiques common agent frameworks, highlighting the need for specialization, cross-verification, memory calibration, and self-improvement in multi-agent systems.

Self-evolving AI AI architecture LLMs multi-agent systems

ARTICLE↑ trendingReddit r/LocalLLaMA·4/22/2026

Recent Open models from last 6 Months - Nov 2025 - Apr 2026

The user created a chart with recent open models released in the last six months (Nov 2025 - Apr 2026), focusing on the latest versions and noting the high volume of "Local LLMs." They invite the community to discuss the overall graph and underrated models.

LLMs open-source AI Model Releases Local LLMs

Recent Open models from last 6 Months - Nov 2025 - Apr 2026

ARTICLEKDNuggets·1d ago

Why Do LLMs Corrupt Your Documents When You Delegate?

This content analyzes several reasons why structural content decay may occur when delegating complex document editing tasks to Large Language Models (LLMs). It explores the inherent challenges and issues in such delegation.

content editing LLMs AI limitations AI delegation

Why Do LLMs Corrupt Your Documents When You Delegate?

ARTICLE↑ trendingReddit r/LocalLLaMA·4/19/2026

Is anyone getting real coding work done with Qwen3.6-35B-A3B-UD-Q4_K_M on a 32GB Mac in opencode, claude code or similar?

A user is attempting to perform real coding tasks with Qwen3.6-35B on a 32GB M2 Macbook Pro, encountering memory exhaustion and context window management issues. Despite the model identifying the essence of a bug, it struggles with implementation as critical information is lost during context compaction.

LLMs open-source AI local inference code generation

ARTICLE↑ trendingReddit r/LocalLLaMA·4/19/2026

Switching from Opus 4.7 to Qwen-35B-A3B

A user is considering switching from Opus 4.7 to Qwen-35B-A3B as their daily coding agent and is seeking community experiences. They question if Qwen-35B-A3B will suffice for most tasks, acknowledging Opus might have an edge in complex reasoning, running on an M5 Max 128GB.

AI models LLMs Coding Agent model comparison

ARTICLEDEV.to AI·4/23/2026

I Built a Local AI VRAM Calculator & GPU Planner (Beta)

The author has launched a new beta tool called "Local AI VRAM Calculator & GPU Planner" to help determine GPU and VRAM requirements for running local LLMs. This tool aims to make hardware tradeoffs visible for different workloads and quantization levels before committing to components.

LLMs GPU VRAM AI tools

ARTICLE↑ trendingReddit r/MachineLearning·4/23/2026

First time fine-tuning, need a sanity check — 3B or 7B for multi-task reasoning? [D]

A self-taught user new to fine-tuning seeks advice on choosing between 3B and 7B LLM models for a multi-task reasoning project. The project involves understanding underlying questions, maintaining multiple perspectives, and handling messy inputs.

LLMs model selection multi-task reasoning NLP

ARTICLE↑ trendingReddit r/LocalLLaMA·4/10/2026

gemma-4-26B-A4B with my coding agent Kon

O autor compartilha Kon, seu agente de codificação de IA, que funciona bem com modelos locais para tarefas simples. Ele é notável por seu prompt de sistema pequeno, ausência de telemetria, compatibilidade com os melhores modelos locais e provedores populares, além de uma base de código simples e recursos avançados.

Open Source LLMs Coding Agent local models

ARTICLE↑ trendingReddit r/MachineLearning·4/19/2026

Converting XQuery to SQL with Local LLMs: Do I Need Fine-Tuning or a Better Approach? [P]

The author details the challenge of converting XQuery to SQL using local LLMs in an enterprise setting, hampered by limited and diverse training data. Initial attempts with parsing-based methods and prompt engineering proved insufficient for varied or complex queries.

LLMs prompt-engineering SQL data conversion

ARTICLE↑ trendingReddit r/MachineLearning·4/9/2026

Studying Sutton and Barto's RL book and its connections to RL for LLMs (e.g., tool use, math reasoning, agents, and so on)? [D]

Um graduado em Matemática busca orientação para estudar Aprendizado por Reforço (RL) e suas conexões com LLMs, especialmente para aplicações em matemática. Ele questiona a relevância do livro 'Sutton e Barto' em um contexto moderno de LLMs e pede ajuda para focar em tópicos e algoritmos mais recentes como PPO e GRPO.

Sutton e Barto LLMs AI para Matemática reinforcement learning

NEWS↑ trendingReddit r/LocalLLaMA·4/9/2026

Local (small) LLMs found the same vulnerabilities as Mythos

Pequenos Modelos de Linguagem Grandes (LLMs) descobriram as mesmas vulnerabilidades que o sistema Mythos. Este achado sugere que modelos menores podem replicar descobertas críticas de segurança em sistemas de IA.

LLMs Mythos vulnerabilities AI security

ARTICLE↑ trendingReddit r/LocalLLaMA·4/30/2026

Open Models - April 2026 - One of the best months of all time for Local LLMs?

The content discusses open models, particularly Local LLMs, from April 2026, highlighting it as a potentially great month for them. It also notes a license change for MiniMax-M2.7 and asks for suggestions on underrated models.

Open Source AI models LLMs licensing

Open Models - April 2026 - One of the best months of all time for Local LLMs?