docs: Add comprehensive interactive documentation with CDN images
Browse files
README.md
CHANGED
|
@@ -35,11 +35,11 @@ license_link: https://ameforge.tech
|
|
| 35 |
|
| 36 |
## What is GearCut?
|
| 37 |
|
| 38 |
-
GearCut is a **natural language video editing engine** developed by [AMFORGE](https://huggingface.co/AMFORGE). Instead of learning complex video editing software, you simply describe your edit in plain English — and GearCut's
|
| 39 |
|
| 40 |
-
The core model (`gc_editor`) contains **
|
| 41 |
|
| 42 |
-
> **"remove the first 3 seconds
|
| 43 |
|
| 44 |
---
|
| 45 |
|
|
@@ -225,11 +225,14 @@ The tokenizer uses a custom vocabulary (`gearcut_tok.vocab`) optimized for tempo
|
|
| 225 |
|
| 226 |
| Property | Value |
|
| 227 |
|---|---|
|
| 228 |
-
| **Architecture** |
|
| 229 |
-
| **Parameters** |
|
|
|
|
|
|
|
| 230 |
| **Vocabulary size** | 682 tokens |
|
| 231 |
-
| **
|
| 232 |
-
| **
|
|
|
|
| 233 |
| **Version** | v1-editor |
|
| 234 |
| **Developed by** | AMFORGE |
|
| 235 |
|
|
@@ -237,6 +240,18 @@ The core model files (`gearcut_compiler.py`, `gearcut_model.py`, `gearcut_infer.
|
|
| 237 |
|
| 238 |
---
|
| 239 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 240 |
## Requirements
|
| 241 |
|
| 242 |
```
|
|
|
|
| 35 |
|
| 36 |
## What is GearCut?
|
| 37 |
|
| 38 |
+
GearCut is a **natural language video editing engine** developed by [AMFORGE](https://huggingface.co/AMFORGE). Instead of learning complex video editing software, you simply describe your edit in plain English — and GearCut's model translates your instruction into a structured list of editing operations that the project compiler then executes.
|
| 39 |
|
| 40 |
+
The core model (`gc_editor`) is built on AMFORGE's in-house **SparseMind** architecture — sparse attention, sparse FFN, dynamic neuron typing, and episodic memory. It contains **28,759,300 parameters (~28.8M)** with a specialized vocabulary of **682 tokens** designed exclusively for video editing semantics. It understands temporal references, clip identifiers, and export configurations, then generates a structured operation plan with frame-accurate precision.
|
| 41 |
|
| 42 |
+
> **"remove the first 3 seconds"** → `[{"op":"trim","clip":"c1","in":3.0,"out":8.0}]` — done.
|
| 43 |
|
| 44 |
---
|
| 45 |
|
|
|
|
| 225 |
|
| 226 |
| Property | Value |
|
| 227 |
|---|---|
|
| 228 |
+
| **Architecture** | SparseMind (decoder-only, sparse) |
|
| 229 |
+
| **Parameters** | 28,759,300 (~28.8M) |
|
| 230 |
+
| **Hidden size / Layers** | 384 / 8 |
|
| 231 |
+
| **Context length** | 256 tokens |
|
| 232 |
| **Vocabulary size** | 682 tokens |
|
| 233 |
+
| **Tokenizer** | GearCut SentencePiece-BPE (`gearcut_tok.vocab` + `gearcut_tok.model`) |
|
| 234 |
+
| **Precision** | fp32 |
|
| 235 |
+
| **Model file** | `gc_editor.pt` |
|
| 236 |
| **Version** | v1-editor |
|
| 237 |
| **Developed by** | AMFORGE |
|
| 238 |
|
|
|
|
| 240 |
|
| 241 |
---
|
| 242 |
|
| 243 |
+
## Evaluation
|
| 244 |
+
|
| 245 |
+
Measured on a held-out synthetic validation split. The meaningful metrics are not perplexity but whether the generated operations are directly usable:
|
| 246 |
+
|
| 247 |
+
| Metric | Score |
|
| 248 |
+
|---|---|
|
| 249 |
+
| **Valid JSON** | 100.0% |
|
| 250 |
+
| **Exact match** (operations == reference) | 76.5% |
|
| 251 |
+
| **Best exact match during training** | 88.0% |
|
| 252 |
+
|
| 253 |
+
---
|
| 254 |
+
|
| 255 |
## Requirements
|
| 256 |
|
| 257 |
```
|