ameforge commited on
Commit
af03da2
·
verified ·
1 Parent(s): d0f1905

docs: Add comprehensive interactive documentation with CDN images

Browse files
Files changed (1) hide show
  1. README.md +22 -7
README.md CHANGED
@@ -35,11 +35,11 @@ license_link: https://ameforge.tech
35
 
36
  ## What is GearCut?
37
 
38
- GearCut is a **natural language video editing engine** developed by [AMFORGE](https://huggingface.co/AMFORGE). Instead of learning complex video editing software, you simply describe your edit in plain English — and GearCut's transformer-based model translates your instruction into precise ffmpeg operations.
39
 
40
- The core model (`gc_editor`) contains **9,721,219 parameters** with a specialized vocabulary of **682 tokens** designed exclusively for video editing semantics. It understands temporal references, clip identifiers, and export configurations, then generates a structured operation plan that ffmpeg executes with frame-accurate precision.
41
 
42
- > **"remove the first 3 seconds and export as out.mp4"** → GearCut trims from 3.0s to end, renders at youtube_1080p preset, done.
43
 
44
  ---
45
 
@@ -225,11 +225,14 @@ The tokenizer uses a custom vocabulary (`gearcut_tok.vocab`) optimized for tempo
225
 
226
  | Property | Value |
227
  |---|---|
228
- | **Architecture** | Transformer encoder-decoder |
229
- | **Parameters** | 9,721,219 |
 
 
230
  | **Vocabulary size** | 682 tokens |
231
- | **Model file** | `gc_editor.pt` (~38 MB) |
232
- | **Tokenizer** | Custom BPE (`gearcut_tok.vocab` + `gearcut_tok.model`) |
 
233
  | **Version** | v1-editor |
234
  | **Developed by** | AMFORGE |
235
 
@@ -237,6 +240,18 @@ The core model files (`gearcut_compiler.py`, `gearcut_model.py`, `gearcut_infer.
237
 
238
  ---
239
 
 
 
 
 
 
 
 
 
 
 
 
 
240
  ## Requirements
241
 
242
  ```
 
35
 
36
  ## What is GearCut?
37
 
38
+ GearCut is a **natural language video editing engine** developed by [AMFORGE](https://huggingface.co/AMFORGE). Instead of learning complex video editing software, you simply describe your edit in plain English — and GearCut's model translates your instruction into a structured list of editing operations that the project compiler then executes.
39
 
40
+ The core model (`gc_editor`) is built on AMFORGE's in-house **SparseMind** architecture — sparse attention, sparse FFN, dynamic neuron typing, and episodic memory. It contains **28,759,300 parameters (~28.8M)** with a specialized vocabulary of **682 tokens** designed exclusively for video editing semantics. It understands temporal references, clip identifiers, and export configurations, then generates a structured operation plan with frame-accurate precision.
41
 
42
+ > **"remove the first 3 seconds"** → `[{"op":"trim","clip":"c1","in":3.0,"out":8.0}]` done.
43
 
44
  ---
45
 
 
225
 
226
  | Property | Value |
227
  |---|---|
228
+ | **Architecture** | SparseMind (decoder-only, sparse) |
229
+ | **Parameters** | 28,759,300 (~28.8M) |
230
+ | **Hidden size / Layers** | 384 / 8 |
231
+ | **Context length** | 256 tokens |
232
  | **Vocabulary size** | 682 tokens |
233
+ | **Tokenizer** | GearCut SentencePiece-BPE (`gearcut_tok.vocab` + `gearcut_tok.model`) |
234
+ | **Precision** | fp32 |
235
+ | **Model file** | `gc_editor.pt` |
236
  | **Version** | v1-editor |
237
  | **Developed by** | AMFORGE |
238
 
 
240
 
241
  ---
242
 
243
+ ## Evaluation
244
+
245
+ Measured on a held-out synthetic validation split. The meaningful metrics are not perplexity but whether the generated operations are directly usable:
246
+
247
+ | Metric | Score |
248
+ |---|---|
249
+ | **Valid JSON** | 100.0% |
250
+ | **Exact match** (operations == reference) | 76.5% |
251
+ | **Best exact match during training** | 88.0% |
252
+
253
+ ---
254
+
255
  ## Requirements
256
 
257
  ```