Veyra AI

community
Activity Feed

AI & ML interests

Building tiny English language models for practical local AI. Veyra AI focuses on CPU-friendly inference, function calling, tool use, Python-oriented small models, distillation, RLVR, and lightweight fine-tuning. The goal is to make compact models that are easy to run, inspect, adapt, and use in real workflows without large hardware.

Recent Activity

Jdudeo  updated a collection 7 days ago
Veyra2
Jdudeo  updated a collection 7 days ago
Veyra2
View all activity

Organization Card

VeyraAIBanner

Welcome to Veyra AI

Tiny English language models built for fast local inference. Veyra AI focuses on compact, CPU-friendly language models that are easy to run, fine-tune, and experiment with. Our work is centered on small English models, function calling, Python-oriented variants, distillation, RLVR, tool use, and local AI. The goal is simple: make capable small models that are practical for local workflows, research, and lightweight deployment.

Current Model Families:

  • Veyra2 30M/15M — Ultra-lightweight models built for highly resource-constrained environments. Ideal for edge devices and ultra-fast inference.
  • Veyra 30M (Legacy) — Proven 30M base model with strong instruction-following and balanced general capabilities. Still reliable for many use cases.

Planned Model Families:

  • Veyra3 30M/15M Next-generation models optimized for low-latency inference and on-device deployment. Delivers excellent speed and efficiency without sacrificing responsiveness.
  • Kairo 30M — Experimental architecture designed to validate next-generation design choices for the entire Veyra lineup. Some Kairo models are out now.

datasets 0

None public yet