--- language: - en license: apache-2.0 tags: - ssm - state-space-model - causal-lm - rabbit - rtaforge - india - sovereign-ai pipeline_tag: text-generation --- # Anvaya-Rabbit 2.7B **India's first sovereign SSM-based language model.** Non-transformer architecture. No attention mechanism. Constitutional training via Gurukul. 7 patents filed at IP India. --- ## ⚠️ Checkpoint Deprecation Notice | Checkpoint | Status | Notes | |---|---|---| | `Anvaya-Rabbit-2.7B-0.55-base.pt` | ✅ **CURRENT** | Wikipedia warmup complete, CE 0.993x | | Any prior checkpoint | ⚠️ **DEPRECATED** | Do not use for inference | Prior checkpoints are retained for research transparency. The current checkpoint reflects iterative refinement of the ANVAYA RtaSSM architecture and training pipeline. **Always use the latest `-base.pt` for any downstream work.** --- ## What's in this repo | Tier | File | Use this when… | |---|---|---| | **Base** | `base/Anvaya-Rabbit-2.7B-0.55-base.pt` | You want raw pretrained weights for your own fine-tuning | Instruct and Imprint tiers are in preparation (epoch 2 → SFT → imprint pipeline). --- ## Quickstart ```bash pip install rtaforge transformers ``` ```python from transformers import AutoModelForCausalLM, AutoTokenizer tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b") tokenizer.add_special_tokens({"additional_special_tokens": ["<|im_start|>", "<|im_end|>"]}) model = AutoModelForCausalLM.from_pretrained( "RtaForge/Anvaya-Rabbit-2.7B", trust_remote_code=True, torch_dtype="bfloat16", device_map="auto", ) prompt = "Rabbit is a helpful and honest assistant.\n\nUser: Who are you?\nRabbit:" inputs = tokenizer(prompt, return_tensors="pt").to(model.device) outputs = model.generate(**inputs, max_new_tokens=60, repetition_penalty=1.3) print(tokenizer.decode(outputs[0], skip_special_tokens=True)) ``` > The `rtaforge` runtime package provides the compiled architecture. Source is not distributed. --- ## Why SSM? > Transformers scale quadratically with context length because every token attends to every other token. SSMs replace attention with a fixed-size recurrent state: inference cost stays **constant per token** regardless of context length, VRAM footprint shrinks dramatically, and long-document throughput improves by orders of magnitude — all at the same parameter count. --- ## Architecture Rabbit is built on **RtaSSM v7.2.2-FU "Fortress Unbroken"**, a custom state-space model developed at RtaForge: - **No attention mechanism** — purely recurrent SSM layers with learned state dynamics - **64 layers, 2560 hidden dimensions**, 2.7B parameters, bfloat16 - **Constitutional training** — Gurukul curriculum with wiki pretraining → instruct SFT → persona imprint - **Vocabulary** 50,280 tokens (GPT-NeoX tokenizer) --- ## Training | Stage | Data | Notes | |---|---|---| | Wiki warmup (v0.55) | Wikipedia (en) | 700 constitutional proposals via Gurukul — **complete** | | Epoch 2 (planned) | RedPajama | Gate-only, ~3,350 proposals | | Instruct SFT (planned) | ChatML instruction pairs | `gate_only` trainable strategy | | Persona imprint (planned) | Rabbit constitutional corpus | Identity and value alignment | --- ## Evaluation Access Weights are publicly available. Runtime package is live: ```bash pip install rtaforge ``` To evaluate Rabbit or discuss deployment: 📧 guha@rtaforge.in 🌐 rtaforge.in Runtime documentation coming soon. --- ## Maturity and Roadmap **v0.55 is a base pretrained checkpoint** — Wikipedia warmup complete, CE ratio 0.993×. Usable conversational behaviour is targeted at **v0.8–v0.9**, currently in training. - Evaluating for deployment? Wait for v0.9. - Evaluating the architecture or training methodology? v0.55-base is exactly what you need. ## Limitations v0.55 has not been evaluated on standard benchmarks. She is small, she is new, and she is learning. Feedback welcome at guha@rtaforge.in. --- ## Citation ```bibtex @misc{anvaya-rabbit-2026, title = {Anvaya-Rabbit: A Sovereign SSM Language Model}, author = {RtaForge}, year = {2026}, url = {https://huggingface.co/RtaForge/Anvaya-Rabbit-2.7B} } ``` --- *Anvaya (अन्वय) — logical connection, coherence. Rabbit — the fast runner.*