Spaces:

yashu2000
/

MiniGridEnv_Blog

Running

App Files Files Community

MiniGridEnv_Blog / README.md

yashu2000

Blog Render issue Update

b997527 verified about 16 hours ago

preview code

raw

history blame contribute delete

2.32 kB

metadata

title: MiniGridEnv Blog
emoji: 🐠
colorFrom: green
colorTo: pink
sdk: static
pinned: false
license: apache-2.0
short_description: Blog for MiniGridEnv for OpenEnv Comp in AgentX

MiniGridEnv Blog

Static blog post for the OpenEnv track of the AgentX competition (UC Berkeley RDI), covering:

An OpenEnv-native wrap of Farama's MiniGrid / BabyAI with text observations and NL actions.
GRPO post-training (MiniGridPT) with cross-episodic, LLM-rewritten, line-budgeted markdown memory.
Branch-stable memory-file naming so each GRPO chain keeps a stable file across optimizer steps.

Files

index.html — main blog (self-contained: inline CSS, Mermaid via CDN).
banner.png — 3-panel hero image (Observe → Act → Remember).
style.css — legacy placeholder from the Spaces scaffold; index.html inlines all styling.

Rebuild the banner

The banner is generated from a matplotlib script kept with the other impl docs:

# from the repo root
python impl-context/build_blog_images.py
# writes MiniGridEnv_Blog/banner.png at 200 DPI

Dependencies: pip install matplotlib numpy.

Open locally

open MiniGridEnv_Blog/index.html
# or: python -m http.server --directory MiniGridEnv_Blog 8080

`<INSERT>` placeholders

The blog ships with a handful of <INSERT: ...> placeholders that must be filled before publishing:

<INSERT: GitHub URL> — repo URL (hero badges, buttons, quickstart git clone, footer).
<INSERT: HF Space URL> — live environment Space (topnav, hero buttons, footer).
<INSERT: Voyager arXiv URL> / <INSERT: Reflexion arXiv URL> / <INSERT: Generative Agents arXiv URL> — arXiv links in the Foundations table (pre-filled paper IDs are in the surrounding text: 2305.16291, 2303.11366, 2304.03442).
<INSERT: Lottery HF Space URL> — sibling project Space in the Foundations table.
<INSERT> cells in the Results table — measured completion rates for GRPO and GRPO+Memory per level once converged checkpoints are available.
<INSERT: verbatim memory snapshot per checkpoint> — optional: replace the illustrative memory-evolution cards with verbatim snapshots after a memory-mode training run.

See the Spaces configuration reference at https://huggingface.co/docs/hub/spaces-config-reference.

MiniGridEnv Blog

Files

Rebuild the banner

Open locally

<INSERT> placeholders

`<INSERT>` placeholders