--- title: MiniGridEnv Blog emoji: 🐠 colorFrom: green colorTo: pink sdk: static pinned: false license: apache-2.0 short_description: Blog for MiniGridEnv for OpenEnv Comp in AgentX --- # MiniGridEnv Blog Static blog post for the OpenEnv track of the AgentX competition (UC Berkeley RDI), covering: - An OpenEnv-native wrap of Farama's MiniGrid / BabyAI with text observations and NL actions. - GRPO post-training (`MiniGridPT`) with **cross-episodic, LLM-rewritten, line-budgeted markdown memory**. - **Branch-stable** memory-file naming so each GRPO chain keeps a stable file across optimizer steps. ## Files - `index.html` — main blog (self-contained: inline CSS, Mermaid via CDN). - `banner.png` — 3-panel hero image (Observe → Act → Remember). - `style.css` — legacy placeholder from the Spaces scaffold; `index.html` inlines all styling. ## Rebuild the banner The banner is generated from a matplotlib script kept with the other impl docs: ```bash # from the repo root python impl-context/build_blog_images.py # writes MiniGridEnv_Blog/banner.png at 200 DPI ``` Dependencies: `pip install matplotlib numpy`. ## Open locally ```bash open MiniGridEnv_Blog/index.html # or: python -m http.server --directory MiniGridEnv_Blog 8080 ``` ## `` placeholders The blog ships with a handful of `` placeholders that must be filled before publishing: - `` — repo URL (hero badges, buttons, quickstart `git clone`, footer). - `` — live environment Space (topnav, hero buttons, footer). - `` / `` / `` — arXiv links in the Foundations table (pre-filled paper IDs are in the surrounding text: `2305.16291`, `2303.11366`, `2304.03442`). - `` — sibling project Space in the Foundations table. - `` cells in the Results table — measured completion rates for GRPO and GRPO+Memory per level once converged checkpoints are available. - `` — optional: replace the illustrative memory-evolution cards with verbatim snapshots after a memory-mode training run. See the Spaces configuration reference at https://huggingface.co/docs/hub/spaces-config-reference.