← heapsort-ai

large language models

262 items

RESEARCH↑ trendingReddit r/MachineLearning·5/7/2026

META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet?

Meta Superintelligence Lab introduces ProgramBench, an initiative testing the ability of advanced AIs to recreate executable programs like ffmpeg and SQLite from scratch, without internet access. This study aims to explore the limits of AI code generation. The research focuses on evaluating the autonomy and completeness of AI models in complex software synthesis.

42
ARTICLE↑ trendingReddit r/MachineLearning·4/26/2026

Why do only big ML labs dominate widely-used models despite many open-source pretrained models smaller labs could do RL on? [D]

The content questions why large AI labs dominate widely-used models like GPT and Claude, despite the existence of many open-source pretrained models of similar scale. The author suggests that Reinforcement Learning from Human Feedback (RLHF) is key to the superiority of these models and wonders why it wouldn't be more accessible for smaller labs.

42
RESEARCH↑ trendingReddit r/MachineLearning·4/13/2026

I scaled a pure Spiking Neural Network (SNN) to 1.088B parameters from scratch. Ran out of budget, but here is what I found [R]

An 18-year-old indie developer scaled a pure Spiking Neural Network (SNN) to 1.088 billion parameters from scratch for language modeling, achieving loss convergence despite common beliefs about vanishing gradients. Key findings include maintaining 93% sparsity and the unexpected emergence of structurally correct Russian text, though the experiment was halted due to budget constraints.

42
RESEARCHarXiv CS.LG·1d ago

FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models

Diffusion Large Language Models (dLLMs) face a "stability lag" due to irreversible token commitment, a problem exacerbated by Post-Training Quantization (PTQ) errors. FAIR-Calib proposes a two-stage PTQ framework that uses a position prior and layer-wise calibration to protect fragile frontier states, enhancing quantization for dLLMs.

40
ARTICLE↑ trendingReddit r/LocalLLaMA·5/6/2026

Bad news: Apple drops high-memory Mac Studio configs

Apple has quietly discontinued high-memory configurations for the Mac Studio, leaving the M3 Ultra version with a maximum of 96GB RAM and the Mac mini at 48GB. This change is a significant setback for users wanting to run large AI models locally, as high-memory options were crucial for such tasks.

Bad news: Apple drops high-memory Mac Studio configs
36
ARTICLE↑ trendingReddit r/LocalLLaMA·4/27/2026

Anthropic's Claude remote uses GLM-4.7

A user observed that Anthropic's Claude code remote environment defaults to using the GLM-4.7 model, rather than a proprietary Anthropic model. This finding prompts questions about AI companies, known for their proprietary models, potentially serving open-weight models.

35
RESEARCHarXiv CS.LG·4/20/2026

Aletheia: Gradient-Guided Layer Selection for Efficient LoRA Fine-Tuning Across Architectures

Aletheia introduces a gradient-guided layer selection method for LoRA fine-tuning, identifying the most task-relevant layers and applying adapters selectively with asymmetric rank. This approach achieves a significant 15-28% training speedup across diverse large language models and architectures while broadly matching downstream behavior.

32
ARTICLEDEV.to AI·4/22/2026

AI এখন শুধু একটা টুল না থেকে ধীরে ধীরে intelligence এর দিকে যাচ্ছে

Recent whispers in Silicon Valley point to Anthropic's Mythos, an AI model rumored to be transcending the definition of a mere tool towards intelligence. Insiders suggest Mythos can deeply analyze complex systems, understand software structures, and detect hidden vulnerabilities, capabilities far beyond standard language models.

31