Octavio-Santana/deberta-v3-base-prompt-injection-detection Text Classification • 0.2B • Updated 20 days ago • 26 • 1
Octavio-Santana/distilbert-base-sentiment-analysis-pt Text Classification • 0.1B • Updated 20 days ago • 232 • 1
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 172
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper • 2509.24372 • Published Sep 29, 2025 • 11
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10, 2025 • 190
view article Article A Survey of Small Language Models in the Era of LLMs: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Jul 16, 2025 • 4
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 511