view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 12 days ago • 59
view article Article I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago" Dec 9, 2025 • 13
view article Article Deploy NVIDIA Riva ASR on Kubernetes GPU Cluster in Just 5 Minutes Sep 5, 2025 • 1
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 31
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 May 23, 2025 • 171
SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments Paper • 2410.11331 • Published Oct 15, 2024 • 8
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective Paper • 2503.01933 • Published Mar 3, 2025 • 13