AdrianLlopart commited on
Commit
d592f2b
Β·
verified Β·
1 Parent(s): f1e4d4f

chore: publish rSkill OpenRAL/rskill-3d-diffuser-actor-rlbench v0.1.0

Browse files
Files changed (4) hide show
  1. README.md +6 -6
  2. SKILL.md +2 -2
  3. eval/rlbench.json +1 -1
  4. rskill.yaml +2 -2
README.md CHANGED
@@ -16,13 +16,13 @@ language:
16
 
17
  <!--
18
  rSkill README β€” 3D Diffuser Actor (RLBench PerAct setup).
19
- Discovery + provenance card; mirrors rskill.yaml. ADR-0061.
20
  -->
21
 
22
  # rskill-3d-diffuser-actor-rlbench
23
 
24
  3D Diffuser Actor β€” a diffusion policy over end-effector **keyposes** for RLBench,
25
- running on the CoppeliaSim/PyRep RLBench benchmark backend (ADR-0061).
26
 
27
  ## What this skill does
28
 
@@ -108,7 +108,7 @@ supervision).
108
 
109
  ```bash
110
  # One-time: provision CoppeliaSim 4.1.0 + PyRep + RLBench@peract + the checkpoint
111
- # in the py3.10 sidecar venv (see docs/adr/0061-rlbench-benchmark-backend.md).
112
  openral benchmark scene \
113
  --config scenes/benchmark/rlbench_open_drawer.yaml \
114
  --rskill rskills/3d-diffuser-actor-rlbench
@@ -116,7 +116,7 @@ openral benchmark scene \
116
 
117
  Inference VRAM peaks ~0.43 GB; runs comfortably on an 8 GB GPU. CoppeliaSim is
118
  proprietary (free EDU license) and is **never** vendored β€” it is an
119
- externally-provisioned dependency (CLAUDE.md Β§1.9 / ADR-0061).
120
 
121
  ## Evaluation
122
 
@@ -142,7 +142,7 @@ openral benchmark run --suite rlbench --rskill rskills/3d-diffuser-actor-rlbench
142
  > **Note on variance.** RLBench's sampling-based `EndEffectorPoseViaPlanning`
143
  > mover is non-deterministic, so per-task rates vary run-to-run; 3 of the 75
144
  > episodes hit a planner path-failure and are counted as failed episodes (the
145
- > sidecar handles them gracefully rather than aborting the run β€” ADR-0061).
146
  > Per-task paper baselines (Ke et al., 2402.10885, Table 1) are intentionally
147
  > not transcribed into the artifact to avoid mis-citation.
148
 
@@ -159,4 +159,4 @@ uses `license: mit` for the consumer-visible weight/runtime posture.
159
  - `scenes/benchmark/rlbench_meat_off_grill.yaml`
160
  - `scenes/benchmark/rlbench_close_jar.yaml`
161
  - `benchmarks/rlbench.yaml`
162
- - `docs/adr/0061-rlbench-benchmark-backend.md`
 
16
 
17
  <!--
18
  rSkill README β€” 3D Diffuser Actor (RLBench PerAct setup).
19
+ Discovery + provenance card; mirrors rskill.yaml. ADR-0062.
20
  -->
21
 
22
  # rskill-3d-diffuser-actor-rlbench
23
 
24
  3D Diffuser Actor β€” a diffusion policy over end-effector **keyposes** for RLBench,
25
+ running on the CoppeliaSim/PyRep RLBench benchmark backend (ADR-0062).
26
 
27
  ## What this skill does
28
 
 
108
 
109
  ```bash
110
  # One-time: provision CoppeliaSim 4.1.0 + PyRep + RLBench@peract + the checkpoint
111
+ # in the py3.10 sidecar venv (see docs/adr/0062-rlbench-benchmark-backend.md).
112
  openral benchmark scene \
113
  --config scenes/benchmark/rlbench_open_drawer.yaml \
114
  --rskill rskills/3d-diffuser-actor-rlbench
 
116
 
117
  Inference VRAM peaks ~0.43 GB; runs comfortably on an 8 GB GPU. CoppeliaSim is
118
  proprietary (free EDU license) and is **never** vendored β€” it is an
119
+ externally-provisioned dependency (CLAUDE.md Β§1.9 / ADR-0062).
120
 
121
  ## Evaluation
122
 
 
142
  > **Note on variance.** RLBench's sampling-based `EndEffectorPoseViaPlanning`
143
  > mover is non-deterministic, so per-task rates vary run-to-run; 3 of the 75
144
  > episodes hit a planner path-failure and are counted as failed episodes (the
145
+ > sidecar handles them gracefully rather than aborting the run β€” ADR-0062).
146
  > Per-task paper baselines (Ke et al., 2402.10885, Table 1) are intentionally
147
  > not transcribed into the artifact to avoid mis-citation.
148
 
 
159
  - `scenes/benchmark/rlbench_meat_off_grill.yaml`
160
  - `scenes/benchmark/rlbench_close_jar.yaml`
161
  - `benchmarks/rlbench.yaml`
162
+ - `docs/adr/0062-rlbench-benchmark-backend.md`
SKILL.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  name: 3d-diffuser-actor-rlbench
3
  description: >-
4
- S1 Vision-Language-Action policy. Capabilities: generalist, open, close, pick, place. 3D Diffuser Actor (Ke et al., 2024) β€” a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-0061). MIT code + checkpoints. The PerAct checkpoint is loaded verbatim; ships three live-verified starter tasks. Discovery view of an OpenRAL rSkill β€” NOT directly runnable by an agent harness; it runs via rSkill.from_pretrained + the robot HAL.
5
  metadata:
6
  openral_rskill: true # generated discovery view of an rSkill
7
  schema_version: 0.1
@@ -36,7 +36,7 @@ metadata:
36
 
37
  ## What it is
38
 
39
- An OpenRAL **Vision-Language-Action policy** (`role: s1`, `kind: vla`). 3D Diffuser Actor (Ke et al., 2024) β€” a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-0061). MIT code + checkpoints. The PerAct checkpoint is loaded verbatim; ships three live-verified starter tasks.
40
 
41
  ## Capabilities
42
 
 
1
  ---
2
  name: 3d-diffuser-actor-rlbench
3
  description: >-
4
+ S1 Vision-Language-Action policy. Capabilities: generalist, open, close, pick, place. 3D Diffuser Actor (Ke et al., 2024) β€” a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-0062). MIT code + checkpoints. The PerAct checkpoint is loaded verbatim; ships three live-verified starter tasks. Discovery view of an OpenRAL rSkill β€” NOT directly runnable by an agent harness; it runs via rSkill.from_pretrained + the robot HAL.
5
  metadata:
6
  openral_rskill: true # generated discovery view of an rSkill
7
  schema_version: 0.1
 
36
 
37
  ## What it is
38
 
39
+ An OpenRAL **Vision-Language-Action policy** (`role: s1`, `kind: vla`). 3D Diffuser Actor (Ke et al., 2024) β€” a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-0062). MIT code + checkpoints. The PerAct checkpoint is loaded verbatim; ships three live-verified starter tasks.
40
 
41
  ## Capabilities
42
 
eval/rlbench.json CHANGED
@@ -64,4 +64,4 @@
64
  },
65
  "baselines": {},
66
  "trace_id": null
67
- }
 
64
  },
65
  "baselines": {},
66
  "trace_id": null
67
+ }
rskill.yaml CHANGED
@@ -6,7 +6,7 @@
6
  # LICENSE: MIT (code + released checkpoints) β€” commercially permissive. No
7
  # license guard needed (unlike RVT/RVT-2, which are NVIDIA non-commercial).
8
  #
9
- # RUNTIME: auto-managed out-of-process sidecar (ZMQ + msgpack), ADR-0061. The
10
  # policy AND the CoppeliaSim/PyRep RLBench scene run in their own externally-
11
  # provisioned py3.10 venv (CoppeliaSim is proprietary, free-EDU, NEVER vendored).
12
  # The openral adapter (openral_sim.policies.rlbench_3dda) forks
@@ -93,7 +93,7 @@ description: >
93
  3D Diffuser Actor (Ke et al., 2024) β€” a diffusion policy over end-effector
94
  keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench
95
  PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar
96
- with the rlbench scene backend (ADR-0061). MIT code + checkpoints. The PerAct
97
  checkpoint is loaded verbatim; ships three live-verified starter tasks.
98
 
99
  # ADR-0022 β€” action vocabulary surfaced to the reasoner LLM tool palette.
 
6
  # LICENSE: MIT (code + released checkpoints) β€” commercially permissive. No
7
  # license guard needed (unlike RVT/RVT-2, which are NVIDIA non-commercial).
8
  #
9
+ # RUNTIME: auto-managed out-of-process sidecar (ZMQ + msgpack), ADR-0062. The
10
  # policy AND the CoppeliaSim/PyRep RLBench scene run in their own externally-
11
  # provisioned py3.10 venv (CoppeliaSim is proprietary, free-EDU, NEVER vendored).
12
  # The openral adapter (openral_sim.policies.rlbench_3dda) forks
 
93
  3D Diffuser Actor (Ke et al., 2024) β€” a diffusion policy over end-effector
94
  keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench
95
  PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar
96
+ with the rlbench scene backend (ADR-0062). MIT code + checkpoints. The PerAct
97
  checkpoint is loaded verbatim; ships three live-verified starter tasks.
98
 
99
  # ADR-0022 β€” action vocabulary surfaced to the reasoner LLM tool palette.