← heapsort-ai

language model

5 items

NEWS↑ trendingReddit r/LocalLLaMA·18d ago

[NEW] Supra-50M Released!

SupraLabs has released Supra-50M, a compact 50M-parameter causal language model built with a Llama-style architecture. Trained on 20 billion high-quality tokens, it achieves competitive or superior results on several key benchmarks despite being significantly smaller than comparable open models.

[NEW] Supra-50M Released!
42
RESEARCHarXiv CS.CL·25d ago

VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use

VectraYX-Nano is a 42M-parameter Spanish language model specifically developed for cybersecurity with a Latin-American focus and native tool invocation. This research details its training from scratch, including a custom 170M-token Spanish corpus, a specific Transformer architecture, and a curriculum learning approach with replay.

27