gss1147's picture
Update README.md
0499ef7 verified
---
datasets:
- nvidia/Nemotron-CC-v2
- nvidia/Nemotron-Post-Training-Dataset-v2
- nvidia/Nemotron-Instruction-Following-Chat-v1
- nvidia/Nemotron-Science-v1
- nvidia/Nemotron-Agentic-v1
- nvidia/Nemotron-Competitive-Programming-v1
- nvidia/Nemotron-Math-Proofs-v1
- nvidia/Nemotron-RL-Agentic-Conversational-Tool-Use-Pivot-v1
- nvidia/Nemotron-RL-instruction_following
- nvidia/Nemotron-RL-agent-calendar_scheduling
- nvidia/Nemotron-RL-instruction_following-structured_outputs
---
Nvidia.Agentic.Coder-4B-GGUF
📌 Model Overview
Model Name: WithinUsAI/Nvidia.Agentic.Coder-4B-GGUF
Organization: Within Us AI
Model Type: Code LLM (Agentic, Instruction-Following)
Parameter Size: 4B
Format: GGUF (quantized for local inference)
Primary Use: Agentic coding, tool-using workflows, software engineering reasoning
This model is part of the Within Us AI ecosystem focused on building agentic, reasoning-driven coding systems designed to think, act, and verify like real engineers. 
🧬 Architecture & Lineage
* Base Family: NVIDIA Nemotron-style 4B class models (inferred lineage from naming + ecosystem alignment)
* Format Conversion: GGUF quantization for efficient local inference
* Training Approach:
* Instruction-tuned for coding tasks
* Agentic workflow emphasis (multi-step reasoning, tool usage)
* Likely merged / fine-tuned using Within Us AI proprietary pipelines
Related ecosystem models include:
* NVIDIA-Nemotron-3-Nano-4B
* Other 4B agentic coders and merges in the same class 
⚙️ Key Capabilities
🧑‍💻 Code Intelligence
* Multi-language code generation
* Bug fixing and refactoring
* Structured output generation
🤖 Agentic Behavior
* Step-by-step reasoning
* Task decomposition
* Tool-calling alignment (design goal)
🧠 Reasoning Focus
* Instruction-following with logical chaining
* Designed for evaluation-style datasets (tests-as-truth philosophy)
📦 GGUF Quantization
GGUF allows efficient local inference with tools like:
* llama.cpp
* LM Studio
* Ollama (GGUF-compatible builds)
Typical quantizations for 4B GGUF models include:
* Q2_K (~1.8GB)
* Q3_K (~2.0–2.3GB)
* Q4_K (~2.5GB, recommended balance) 
🚀 Intended Use
✅ Ideal Use Cases
* Local AI coding assistants
* Autonomous coding agents
* SWE-bench style evaluation
* Tool-augmented workflows
* Offline developer copilots
⚠️ Limitations
* Smaller 4B parameter size limits deep reasoning vs larger models
* Performance depends heavily on prompt structure
* Tool-use requires external orchestration (not built-in runtime)
🛠️ Usage Example (llama.cpp)
./main -m Nvidia.Agentic.Coder-4B.Q4_K.gguf \
-p "Write a Python function to parse JSON logs and extract errors." \
-n 512
🧪 Training Philosophy (Within Us AI)
Within Us AI focuses on:
* Agentic AI systems
* Test-driven training (tests-as-truth)
* Diff-first patching workflows
* Secure and auditable code generation
* Evaluation-first development pipelines 
📊 Evaluation
No formal benchmark results published yet.
Expected strengths:
* Strong instruction adherence
* Lightweight agentic reasoning
* Efficient local deployment
📚 Datasets & Training Sources
This model follows the Within Us AI methodology:
* Proprietary datasets created by Within Us AI
* May include third-party datasets for training (no ownership claimed)
* Emphasis on:
* Code reasoning traces
* Agentic workflows
* Evaluation-driven samples
📜 License
License Type: Custom / Other (Within Us AI License)
Terms:
* Within Us AI created the fine-tuning, merging, and training methodology
* Base model architecture originates from third-party LLM ecosystems (e.g., NVIDIA / Nemotron class)
* Third-party datasets may be used without claiming ownership
* Full credit and acknowledgment belong to original dataset and base model creators
🙏 Acknowledgements
Special thanks to:
* NVIDIA Nemotron ecosystem contributors
* Open-source GGUF tooling community
* Dataset creators across Hugging Face
* The broader open-source AI research community
🔗 Links
* Model: https://huggingface.co/WithinUsAI/Nvidia.Agentic.Coder-4B-GGUF
* Organization: https://huggingface.co/WithinUsAI