Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling Paper ⢠2406.12585 ⢠Published Jun 18, 2024 ⢠2
Llama-Primus-Nemotron-70B Collection Llama-Primus-Nemotron-70B š is obtained by continued pretraining Llama-3.1-Nemotron-70B-Instruct on over 10B tokens of cybersecurity texts. ⢠4 items ⢠Updated Aug 9, 2025 ⢠6
Llama-Primus-Nemotron-70B Collection Llama-Primus-Nemotron-70B šš½āāļø is obtained by continued pretraining Llama-3.1-Nemotron-70B-Instruct on over 10B tokens of cybersecurity texts. ⢠4 items ⢠Updated Aug 9, 2025 ⢠3
trend-cybertron/Llama-Primus-Nemotron-70B-Instruct Text Generation ⢠71B ⢠Updated Aug 9, 2025 ⢠146 ⢠13
trend-cybertron/Llama-Primus-Nemotron-70B-Base Text Generation ⢠71B ⢠Updated Aug 9, 2025 ⢠6 ⢠6
Llama-Primus-Nemotron-70B Collection Llama-Primus-Nemotron-70B šš½āāļø is obtained by continued pretraining Llama-3.1-Nemotron-70B-Instruct on over 10B tokens of cybersecurity texts. ⢠4 items ⢠Updated Aug 9, 2025 ⢠3
Llama-Primus-Nemotron-70B Collection Llama-Primus-Nemotron-70B š is obtained by continued pretraining Llama-3.1-Nemotron-70B-Instruct on over 10B tokens of cybersecurity texts. ⢠4 items ⢠Updated Aug 9, 2025 ⢠6
trendmicro-ailab/Llama-Primus-Reasoning Text Generation ⢠8B ⢠Updated Jun 2, 2025 ⢠220 ⢠⢠17