omsoul commited on
Commit
6fc7d9b
·
verified ·
1 Parent(s): 61798fb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +73 -2
README.md CHANGED
@@ -1,6 +1,6 @@
1
  # Model Description
2
  ### Overview
3
- This model detects hand gestures that can be used as input controls for video games. It uses object detection to recognize specific hand poses from a webcam or standard camera and translate them into game actions.
4
  The goal of the project is to explore whether computer vision–based gesture recognition can provide a low-cost and accessible alternative to traditional game controllers.
5
 
6
  ### Training Approach
@@ -13,4 +13,75 @@ The model was trained from pretrained YOLOv8n weights and fine-tuned on a custom
13
  * Interactive displays
14
  * Public kiosks
15
  * Smart home media controls
16
- * Desktop navigation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # Model Description
2
  ### Overview
3
+ This model detects hand gestures for use as input controls for video games. It uses object detection to recognize specific hand poses from a webcam or standard camera and translate them into game actions.
4
  The goal of the project is to explore whether computer vision–based gesture recognition can provide a low-cost and accessible alternative to traditional game controllers.
5
 
6
  ### Training Approach
 
13
  * Interactive displays
14
  * Public kiosks
15
  * Smart home media controls
16
+ * Desktop navigation
17
+
18
+ # Training Data
19
+ ### Dataset Sources
20
+ **The training dataset was constructed from two sources:**
21
+ Rock-Paper-Scissors dataset
22
+ * Source: Roboflow Universe
23
+ * Creator: Audrey
24
+ * Used for the first three gesture classes
25
+ * Dataset URL: https://universe.roboflow.com/audrey-x3i6m/rps-knmjj
26
+
27
+ Custom gesture dataset
28
+ * Created by recording a 30-second video of the author performing gestures
29
+ * Video parsed into frames at 10 frames per second
30
+ * Images manually selected and annotated
31
+
32
+ **Dataset Size**
33
+ | Category | Count |
34
+ | ---------------- | --------- |
35
+ | Original Images | 444 |
36
+ | Augmented Images | 1066 |
37
+ | Image Resolution | 512 × 512 |
38
+
39
+ **Class Distribution**
40
+ | Class | Gesture | Annotation Count |
41
+ | -------- | ----------- | ---------------- |
42
+ | Forward | Open Palm | 169 |
43
+ | Backward | Closed Fist | 210 |
44
+ | Jump | Peace Sign | 187 |
45
+ | Attack | Thumbs Up | 121 |
46
+
47
+ ### Data Collection Methodology
48
+ The dataset combines stock gesture images with a custom dataset created from recorded video frames.
49
+
50
+ **The custom dataset was generated by:**
51
+ * Recording a short gesture demonstration video
52
+ * Extracting frames at 10 FPS
53
+ * Selecting usable frames
54
+ * Annotating gesture bounding boxes
55
+ * This process produced 236 custom images that were merged with the stock dataset.
56
+
57
+ ### Annotation Process
58
+ All annotations were created manually using Roboflow.
59
+ Bounding boxes were drawn around the visible hand gesture in each image.
60
+ Due to missing annotation metadata from the original dataset, all 444 images were annotated manually.
61
+ Estimated annotation time: 2–3 hours
62
+
63
+ ### Train / Validation / Test Split
64
+ | Dataset Split | Image Count |
65
+ | ------------- | ----------- |
66
+ | Training | 933 |
67
+ | Validation | 88 |
68
+ | Test | 45 |
69
+
70
+ ### Data Augmentation
71
+ **The following augmentations were applied:**
72
+ * Rotation: ±15 degrees
73
+ * Saturation adjustment: ±30%
74
+
75
+ *These augmentations expanded the dataset from 444 to 1066 images.*
76
+
77
+ ### Dataset Availability
78
+ Dataset availability: https://universe.roboflow.com/b-data-497-ws/hand-gesture-controls
79
+
80
+ ### Known Dataset Biases and Limitations
81
+ * Small dataset size
82
+ * Class imbalance (thumbs-up has fewer examples)
83
+ * Mixed image quality between stock and custom images
84
+ * Limited diversity in backgrounds and lighting conditions
85
+ * Limited number of subjects (primarily one person)
86
+
87
+ *These factors may affect model generalization.*