← heapsort
ARTICLE↑ trending42

Nanochat vs Llama for training from scratch? [P]

Reddit r/MachineLearningΒ·April 24, 2026

The user is training an AI model from scratch and seeks advice on the best architecture, considering switching from Nanochat (which lacks Transformers compatibility) to the Llama architecture. The goal is an open-source project with a new, larger dataset, despite Nanochat's advantages.

Read original β†—