cvtechniques
/

VideoGameHandGestures

Object Detection

computer-vision

gesture-recognition

Model card Files Files and versions

omsoul commited on 13 days ago

Commit

38c61bc

·

verified ·

1 Parent(s): 108d0b8

Update README.md

Files changed (1) hide show

README.md +38 -4

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ The model was trained from pretrained YOLOv8n weights and fine-tuned on a custom
 # Training Data
 ### Dataset Sources
-*The training dataset was constructed from two sources:*
 Rock-Paper-Scissors dataset
 * Source: Roboflow Universe
@@ -30,14 +30,14 @@ Custom gesture dataset
 * Video parsed into frames at 10 frames per second
 * Images manually selected and annotated
-**Dataset Size**
 | Category         | Count     |
 | ---------------- | --------- |
 | Original Images  | 444       |
 | Augmented Images | 1066      |
 | Image Resolution | 512 × 512 |
-**Class Distribution**
 | Class    | Gesture     | Annotation Count |
 | -------- | ----------- | ---------------- |
 | Forward  | Open Palm   | 169              |
@@ -85,4 +85,38 @@ Dataset availability: https://universe.roboflow.com/b-data-497-ws/hand-gesture-c
 * Limited diversity in backgrounds and lighting conditions
 * Limited number of subjects (primarily one person)
-*These factors may affect model generalization.*

 # Training Data
 ### Dataset Sources
+**The training dataset was constructed from two sources:**
 Rock-Paper-Scissors dataset
 * Source: Roboflow Universe
 * Video parsed into frames at 10 frames per second
 * Images manually selected and annotated
+### Dataset Size
 | Category         | Count     |
 | ---------------- | --------- |
 | Original Images  | 444       |
 | Augmented Images | 1066      |
 | Image Resolution | 512 × 512 |
+### Class Distribution
 | Class    | Gesture     | Annotation Count |
 | -------- | ----------- | ---------------- |
 | Forward  | Open Palm   | 169              |
 * Limited diversity in backgrounds and lighting conditions
 * Limited number of subjects (primarily one person)
+*These factors may affect model generalization.*
+# Training Procedure
+### Framework
+Training was performed using
+### Model Architecture
+Base model: YOLOv8n (Nano)
+**Reasons for selection:**
+* Lightweight architecture
+* Low inference latency
+* Lower hardware requirements
+* Faster training times
+* Suitable for real-time applications
+### Training Configuration
+| Parameter               | Value                        |
+| ----------------------- | ---------------------------- |
+| Epochs                  | 200 (training stopped early) |
+| Early stopping patience | 10                           |
+| Image size              | 512 × 512                    |
+| Batch size              | 64                           |
+### Training Hardware
+| Component     | Specification    |
+| ------------- | ---------------- |
+| GPU           | A100 (High Ram)  |
+| VRAM          | 80 GB            |
+| Training Time | ~40 minutes      |
+### Preprocessing Steps
+* Images resized to 512×512
+* Bounding box annotations normalized
+* Augmented images generated before training