StabIeGenius commited on
Commit
b82520a
·
verified ·
1 Parent(s): 5e1650a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +121 -3
README.md CHANGED
@@ -1,3 +1,121 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - code
7
+ - assistant
8
+ - ollama
9
+ - openai-compatible
10
+ - streaming
11
+ - voice
12
+ pipeline_tag: text-generation
13
+ inference: false
14
+ ---
15
+
16
+ # GeniusPro Coder v1
17
+
18
+ **GeniusPro Coder v1** is a coding-focused AI assistant model built for the [GeniusPro](https://geniuspro.io) platform. It powers the chat dashboard at [chat.geniuspro.io](https://chat.geniuspro.io) and the GeniusPro Voice Assistant, providing intelligent code generation, explanation, and general-purpose AI assistance.
19
+
20
+ ## Highlights
21
+
22
+ - Code generation, debugging, and explanation across multiple languages
23
+ - Natural conversational ability for non-code tasks
24
+ - OpenAI-compatible API (drop-in replacement for existing tooling)
25
+ - Streaming support for real-time token delivery
26
+ - Voice mode with concise, spoken-friendly responses
27
+ - Runs locally on consumer hardware via [Ollama](https://ollama.com)
28
+
29
+ ## Intended Use
30
+
31
+ GeniusPro Coder v1 is designed for:
32
+
33
+ - **Code assistance** — generating, reviewing, debugging, and explaining code
34
+ - **Chat** — general-purpose question answering and conversation
35
+ - **Voice interaction** — concise, natural-language responses optimized for text-to-speech
36
+
37
+ It powers the GeniusPro platform, which includes a web-based chat dashboard and a real-time voice assistant.
38
+
39
+ ## API Usage
40
+
41
+ GeniusPro Coder v1 is served through an OpenAI-compatible API at `api.geniuspro.io`.
42
+
43
+ ### Chat Completions
44
+
45
+ ```bash
46
+ curl https://api.geniuspro.io/v1/chat/completions \
47
+ -H "Authorization: Bearer YOUR_API_KEY" \
48
+ -H "Content-Type: application/json" \
49
+ -d '{
50
+ "model": "geniuspro-coder-v1",
51
+ "messages": [
52
+ {"role": "user", "content": "Write a Python function to merge two sorted lists"}
53
+ ],
54
+ "stream": true
55
+ }'
56
+ ```
57
+
58
+ ### Supported Parameters
59
+
60
+ | Parameter | Description |
61
+ |-----------|-------------|
62
+ | `temperature` | Controls randomness (0.0 = deterministic, 1.0 = creative) |
63
+ | `top_p` | Nucleus sampling threshold |
64
+ | `max_tokens` | Maximum tokens to generate |
65
+ | `stop` | Stop sequences |
66
+ | `stream` | Enable streaming responses (SSE) |
67
+
68
+ ### Available Endpoints
69
+
70
+ | Endpoint | Method | Description |
71
+ |----------|--------|-------------|
72
+ | `/v1/models` | GET | List available models |
73
+ | `/v1/chat/completions` | POST | Chat completions (streaming + non-streaming) |
74
+ | `/v1/voice` | WebSocket | Real-time voice interaction |
75
+ | `/health` | GET | Health check (no auth required) |
76
+
77
+ ## Running Locally with Ollama
78
+
79
+ ```bash
80
+ # Pull the model
81
+ ollama pull geniuspro-coder-v1
82
+
83
+ # Run interactively
84
+ ollama run geniuspro-coder-v1
85
+
86
+ # Serve via API
87
+ ollama serve
88
+ ```
89
+
90
+ Once running, the model is available at `http://localhost:11434` with the same OpenAI-compatible API format.
91
+
92
+ ## Infrastructure
93
+
94
+ GeniusPro Coder v1 runs on dedicated hardware for low-latency inference:
95
+
96
+ - **GPU**: NVIDIA RTX 5090 (32 GB VRAM)
97
+ - **Runtime**: [Ollama](https://ollama.com) for model serving
98
+ - **Gateway**: FastAPI reverse proxy with auth, rate limiting, and usage tracking
99
+ - **Deployment**: Ubuntu Server behind Nginx + Cloudflare Tunnel
100
+
101
+ ## Limitations
102
+
103
+ - Optimized for English. Other languages may work but are not officially supported.
104
+ - Code generation quality varies by language — strongest in Python, JavaScript/TypeScript, and common web technologies.
105
+ - Not suitable for safety-critical applications without human review.
106
+ - Context window and output length are bounded by the underlying architecture.
107
+
108
+ ## License
109
+
110
+ This model is released under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0).
111
+
112
+ ## Links
113
+
114
+ - **Platform**: [geniuspro.io](https://geniuspro.io)
115
+ - **Chat Dashboard**: [chat.geniuspro.io](https://chat.geniuspro.io)
116
+ - **API Docs**: [api.geniuspro.io/docs](https://api.geniuspro.io/docs)
117
+ - **GitHub**: [github.com/GeniusProAI](https://github.com/GeniusProAI)
118
+
119
+ ---
120
+
121
+ Built by [GeniusPro](https://geniuspro.io) — We Host Your AI.