chore: publish rSkill OpenRAL/rskill-3d-diffuser-actor-rlbench v0.1.0
Browse files- README.md +6 -6
- SKILL.md +2 -2
- eval/rlbench.json +1 -1
- rskill.yaml +2 -2
README.md
CHANGED
|
@@ -16,13 +16,13 @@ language:
|
|
| 16 |
|
| 17 |
<!--
|
| 18 |
rSkill README β 3D Diffuser Actor (RLBench PerAct setup).
|
| 19 |
-
Discovery + provenance card; mirrors rskill.yaml. ADR-
|
| 20 |
-->
|
| 21 |
|
| 22 |
# rskill-3d-diffuser-actor-rlbench
|
| 23 |
|
| 24 |
3D Diffuser Actor β a diffusion policy over end-effector **keyposes** for RLBench,
|
| 25 |
-
running on the CoppeliaSim/PyRep RLBench benchmark backend (ADR-
|
| 26 |
|
| 27 |
## What this skill does
|
| 28 |
|
|
@@ -108,7 +108,7 @@ supervision).
|
|
| 108 |
|
| 109 |
```bash
|
| 110 |
# One-time: provision CoppeliaSim 4.1.0 + PyRep + RLBench@peract + the checkpoint
|
| 111 |
-
# in the py3.10 sidecar venv (see docs/adr/
|
| 112 |
openral benchmark scene \
|
| 113 |
--config scenes/benchmark/rlbench_open_drawer.yaml \
|
| 114 |
--rskill rskills/3d-diffuser-actor-rlbench
|
|
@@ -116,7 +116,7 @@ openral benchmark scene \
|
|
| 116 |
|
| 117 |
Inference VRAM peaks ~0.43 GB; runs comfortably on an 8 GB GPU. CoppeliaSim is
|
| 118 |
proprietary (free EDU license) and is **never** vendored β it is an
|
| 119 |
-
externally-provisioned dependency (CLAUDE.md Β§1.9 / ADR-
|
| 120 |
|
| 121 |
## Evaluation
|
| 122 |
|
|
@@ -142,7 +142,7 @@ openral benchmark run --suite rlbench --rskill rskills/3d-diffuser-actor-rlbench
|
|
| 142 |
> **Note on variance.** RLBench's sampling-based `EndEffectorPoseViaPlanning`
|
| 143 |
> mover is non-deterministic, so per-task rates vary run-to-run; 3 of the 75
|
| 144 |
> episodes hit a planner path-failure and are counted as failed episodes (the
|
| 145 |
-
> sidecar handles them gracefully rather than aborting the run β ADR-
|
| 146 |
> Per-task paper baselines (Ke et al., 2402.10885, Table 1) are intentionally
|
| 147 |
> not transcribed into the artifact to avoid mis-citation.
|
| 148 |
|
|
@@ -159,4 +159,4 @@ uses `license: mit` for the consumer-visible weight/runtime posture.
|
|
| 159 |
- `scenes/benchmark/rlbench_meat_off_grill.yaml`
|
| 160 |
- `scenes/benchmark/rlbench_close_jar.yaml`
|
| 161 |
- `benchmarks/rlbench.yaml`
|
| 162 |
-
- `docs/adr/
|
|
|
|
| 16 |
|
| 17 |
<!--
|
| 18 |
rSkill README β 3D Diffuser Actor (RLBench PerAct setup).
|
| 19 |
+
Discovery + provenance card; mirrors rskill.yaml. ADR-0062.
|
| 20 |
-->
|
| 21 |
|
| 22 |
# rskill-3d-diffuser-actor-rlbench
|
| 23 |
|
| 24 |
3D Diffuser Actor β a diffusion policy over end-effector **keyposes** for RLBench,
|
| 25 |
+
running on the CoppeliaSim/PyRep RLBench benchmark backend (ADR-0062).
|
| 26 |
|
| 27 |
## What this skill does
|
| 28 |
|
|
|
|
| 108 |
|
| 109 |
```bash
|
| 110 |
# One-time: provision CoppeliaSim 4.1.0 + PyRep + RLBench@peract + the checkpoint
|
| 111 |
+
# in the py3.10 sidecar venv (see docs/adr/0062-rlbench-benchmark-backend.md).
|
| 112 |
openral benchmark scene \
|
| 113 |
--config scenes/benchmark/rlbench_open_drawer.yaml \
|
| 114 |
--rskill rskills/3d-diffuser-actor-rlbench
|
|
|
|
| 116 |
|
| 117 |
Inference VRAM peaks ~0.43 GB; runs comfortably on an 8 GB GPU. CoppeliaSim is
|
| 118 |
proprietary (free EDU license) and is **never** vendored β it is an
|
| 119 |
+
externally-provisioned dependency (CLAUDE.md Β§1.9 / ADR-0062).
|
| 120 |
|
| 121 |
## Evaluation
|
| 122 |
|
|
|
|
| 142 |
> **Note on variance.** RLBench's sampling-based `EndEffectorPoseViaPlanning`
|
| 143 |
> mover is non-deterministic, so per-task rates vary run-to-run; 3 of the 75
|
| 144 |
> episodes hit a planner path-failure and are counted as failed episodes (the
|
| 145 |
+
> sidecar handles them gracefully rather than aborting the run β ADR-0062).
|
| 146 |
> Per-task paper baselines (Ke et al., 2402.10885, Table 1) are intentionally
|
| 147 |
> not transcribed into the artifact to avoid mis-citation.
|
| 148 |
|
|
|
|
| 159 |
- `scenes/benchmark/rlbench_meat_off_grill.yaml`
|
| 160 |
- `scenes/benchmark/rlbench_close_jar.yaml`
|
| 161 |
- `benchmarks/rlbench.yaml`
|
| 162 |
+
- `docs/adr/0062-rlbench-benchmark-backend.md`
|
SKILL.md
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
---
|
| 2 |
name: 3d-diffuser-actor-rlbench
|
| 3 |
description: >-
|
| 4 |
-
S1 Vision-Language-Action policy. Capabilities: generalist, open, close, pick, place. 3D Diffuser Actor (Ke et al., 2024) β a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-
|
| 5 |
metadata:
|
| 6 |
openral_rskill: true # generated discovery view of an rSkill
|
| 7 |
schema_version: 0.1
|
|
@@ -36,7 +36,7 @@ metadata:
|
|
| 36 |
|
| 37 |
## What it is
|
| 38 |
|
| 39 |
-
An OpenRAL **Vision-Language-Action policy** (`role: s1`, `kind: vla`). 3D Diffuser Actor (Ke et al., 2024) β a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-
|
| 40 |
|
| 41 |
## Capabilities
|
| 42 |
|
|
|
|
| 1 |
---
|
| 2 |
name: 3d-diffuser-actor-rlbench
|
| 3 |
description: >-
|
| 4 |
+
S1 Vision-Language-Action policy. Capabilities: generalist, open, close, pick, place. 3D Diffuser Actor (Ke et al., 2024) β a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-0062). MIT code + checkpoints. The PerAct checkpoint is loaded verbatim; ships three live-verified starter tasks. Discovery view of an OpenRAL rSkill β NOT directly runnable by an agent harness; it runs via rSkill.from_pretrained + the robot HAL.
|
| 5 |
metadata:
|
| 6 |
openral_rskill: true # generated discovery view of an rSkill
|
| 7 |
schema_version: 0.1
|
|
|
|
| 36 |
|
| 37 |
## What it is
|
| 38 |
|
| 39 |
+
An OpenRAL **Vision-Language-Action policy** (`role: s1`, `kind: vla`). 3D Diffuser Actor (Ke et al., 2024) β a diffusion policy over end-effector keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar with the rlbench scene backend (ADR-0062). MIT code + checkpoints. The PerAct checkpoint is loaded verbatim; ships three live-verified starter tasks.
|
| 40 |
|
| 41 |
## Capabilities
|
| 42 |
|
eval/rlbench.json
CHANGED
|
@@ -64,4 +64,4 @@
|
|
| 64 |
},
|
| 65 |
"baselines": {},
|
| 66 |
"trace_id": null
|
| 67 |
-
}
|
|
|
|
| 64 |
},
|
| 65 |
"baselines": {},
|
| 66 |
"trace_id": null
|
| 67 |
+
}
|
rskill.yaml
CHANGED
|
@@ -6,7 +6,7 @@
|
|
| 6 |
# LICENSE: MIT (code + released checkpoints) β commercially permissive. No
|
| 7 |
# license guard needed (unlike RVT/RVT-2, which are NVIDIA non-commercial).
|
| 8 |
#
|
| 9 |
-
# RUNTIME: auto-managed out-of-process sidecar (ZMQ + msgpack), ADR-
|
| 10 |
# policy AND the CoppeliaSim/PyRep RLBench scene run in their own externally-
|
| 11 |
# provisioned py3.10 venv (CoppeliaSim is proprietary, free-EDU, NEVER vendored).
|
| 12 |
# The openral adapter (openral_sim.policies.rlbench_3dda) forks
|
|
@@ -93,7 +93,7 @@ description: >
|
|
| 93 |
3D Diffuser Actor (Ke et al., 2024) β a diffusion policy over end-effector
|
| 94 |
keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench
|
| 95 |
PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar
|
| 96 |
-
with the rlbench scene backend (ADR-
|
| 97 |
checkpoint is loaded verbatim; ships three live-verified starter tasks.
|
| 98 |
|
| 99 |
# ADR-0022 β action vocabulary surfaced to the reasoner LLM tool palette.
|
|
|
|
| 6 |
# LICENSE: MIT (code + released checkpoints) β commercially permissive. No
|
| 7 |
# license guard needed (unlike RVT/RVT-2, which are NVIDIA non-commercial).
|
| 8 |
#
|
| 9 |
+
# RUNTIME: auto-managed out-of-process sidecar (ZMQ + msgpack), ADR-0062. The
|
| 10 |
# policy AND the CoppeliaSim/PyRep RLBench scene run in their own externally-
|
| 11 |
# provisioned py3.10 venv (CoppeliaSim is proprietary, free-EDU, NEVER vendored).
|
| 12 |
# The openral adapter (openral_sim.policies.rlbench_3dda) forks
|
|
|
|
| 93 |
3D Diffuser Actor (Ke et al., 2024) β a diffusion policy over end-effector
|
| 94 |
keyposes fusing multi-view RGB-D into a 3D scene representation, on the RLBench
|
| 95 |
PerAct 18-task benchmark. Shares the out-of-process CoppeliaSim/PyRep sidecar
|
| 96 |
+
with the rlbench scene backend (ADR-0062). MIT code + checkpoints. The PerAct
|
| 97 |
checkpoint is loaded verbatim; ships three live-verified starter tasks.
|
| 98 |
|
| 99 |
# ADR-0022 β action vocabulary surfaced to the reasoner LLM tool palette.
|