| # SQLEnv Team Coordination |
|
|
| ## Team |
|
|
| | Person | Role | Contact | |
| |--------|------|---------| |
| | You (Hjerp) | Coordinator | - | |
| | Kevlar | Contributor | - | |
| | Jindal | Contributor | - | |
|
|
| --- |
|
|
| ## Kickoff Meeting - Feb 9 |
|
|
| ### 1. Personal Wins |
|
|
| > "What would make this project a win for you personally?" |
|
|
| | Person | Personal Win | |
| |--------|--------------| |
| | You | | |
| | Kevlar | | |
| | Jindal | | |
|
|
| ### 2. Availability |
|
|
| | Person | Hours/Week | Best Times | Known Conflicts | |
| |--------|------------|------------|-----------------| |
| | You | | | | |
| | Kevlar | | | | |
| | Jindal | | | | |
|
|
| **Total available:** _____ hours/week |
| **Estimated need:** ~40-60 hours total over 16 days (~15-20 hrs/week combined) |
| **Gap?** If yes, scope discussion needed. |
|
|
| ### 3. Ownership Areas |
|
|
| From project brief, 5 natural ownership areas. Let people **claim** - don't assign. |
|
|
| | Area | Owner | Notes | |
| |------|-------|-------| |
| | Environment Engineering | | OpenEnv integration, WebSocket, Docker, action handlers | |
| | Reward Design | | 3-layer rewards, progress metrics, anti-gaming | |
| | Dataset Curation | | Spider questions, answer verification, difficulty balance | |
| | Training Pipeline | | GRPO setup, prompts, evaluation, Green Agent | |
| | Storytelling/Blog | | Blog post, demos, results visualization | |
|
|
| **Note:** Some areas can be shared or split. Training Pipeline depends on Environment + Reward being done first. |
|
|
| --- |
|
|
| ## First Deliverables (Full Tickets) |
|
|
| Use **full format** for first deliverables (commitment test). |
|
|
| ### Ticket 1 |
|
|
| **WHO:** |
| **WHAT:** |
| **WHY:** |
| **CHALLENGE:** |
| **DUE:** |
| **DELIVERED:** |
|
|
| ### Ticket 2 |
|
|
| **WHO:** |
| **WHAT:** |
| **WHY:** |
| **CHALLENGE:** |
| **DUE:** |
| **DELIVERED:** |
|
|
| ### Ticket 3 |
|
|
| **WHO:** |
| **WHAT:** |
| **WHY:** |
| **CHALLENGE:** |
| **DUE:** |
| **DELIVERED:** |
|
|
| --- |
|
|
| ## Suggested First Deliverables (If Needed) |
|
|
| Based on Phase 1 requirements, here are ready-to-use tickets: |
|
|
| ### Environment Scaffold |
|
|
| **WHO:** [Claim during meeting] |
| **WHAT:** Run `openenv init sql_env`, customize Pydantic models (SQLAction, SQLObservation), get `openenv validate` passing |
| **WHY:** Unblocks all environment work; proves Docker/WebSocket setup works |
| **CHALLENGE:** Docker configuration; WebSocket timeout settings; understanding OpenEnv API |
| **DUE:** [Wednesday EOD?] |
| **DELIVERED:** |
|
|
| ### Initial Question Set |
|
|
| **WHO:** [Claim during meeting] |
| **WHAT:** Select 30 questions from Spider dev set (12 easy, 12 medium, 6 hard) with gold answers in JSON format |
| **WHY:** Enables manual testing of environment in Phase 2; needed for reward computation |
| **CHALLENGE:** Balancing difficulty; avoiding questions needing unsupported SQL features |
| **DUE:** [Wednesday EOD?] |
| **DELIVERED:** |
|
|
| ### OpenEnv Tutorial Review |
|
|
| **WHO:** [Claim during meeting] |
| **WHAT:** Complete OpenEnv tutorial notebook, document key learnings and gotchas for team |
| **WHY:** Reduces ramp-up time for others; surfaces unknowns early |
| **CHALLENGE:** Tutorial may have gaps; need to map to our SQL use case |
| **DUE:** [Tuesday EOD?] |
| **DELIVERED:** |
|
|
| --- |
|
|
| ## Coordination Rituals |
|
|
| ### Daily Async Standup |
|
|
| Post in shared doc/channel (30 seconds): |
|
|
| ``` |
| [Date] [Name] |
| Did: [What you accomplished] |
| Blocked: [Nothing / specific blocker] |
| ``` |
|
|
| **Where:** [TBD - decide in kickoff] |
|
|
| ### Weekly Sync |
|
|
| **When:** [TBD - decide in kickoff] |
| **Duration:** 30 min |
| **Agenda:** |
| 1. Blockers (10 min) - resolve or escalate |
| 2. Decisions (10 min) - use decision format below |
| 3. Next deliverables (10 min) - create tickets for next week |
|
|
| --- |
|
|
| ## Decisions Log |
|
|
| ### Decision: [Topic] |
|
|
| **Context:** [Why needed now] |
|
|
| **Options:** |
| 1. [Option A] |
| - Pro: |
| - Con: |
|
|
| 2. [Option B] |
| - Pro: |
| - Con: |
|
|
| **Recommendation:** |
|
|
| **Decided:** [Date] - [Choice] - [Who consulted] |
|
|
| --- |
|
|
| ## Open Questions for Kickoff |
|
|
| From project brief - need team input: |
|
|
| 1. **Reward components**: Expose as separate rewards to TRL, or sum into single scalar? |
| 2. **Question selection**: Hand-pick for diversity, or random sample by difficulty? |
| 3. **HINT action**: Add a hint mechanism, or keep it pure exploration? |
|
|
| --- |
|
|
| ## Timeline Summary |
|
|
| | Phase | Days | Key Milestone | Dependencies | |
| |-------|------|---------------|--------------| |
| | 1. Scaffold | 1-2 | `openenv validate` passes | None | |
| | 2. Core Loop | 3-5 | Full episode works manually | Phase 1 | |
| | 3. Dense Reward | 6-8 | Reward varies meaningfully | Phase 2 | |
| | 4. Training | 9-13 | Trained model beats random | Phases 1-3 | |
| | 5. Polish | 14-16 | All artifacts submitted | Phase 4 | |
|
|
| **Submission deadline:** ~16 days from kickoff |
|
|
| --- |
|
|
| ## Communication Channels |
|
|
| | Channel | Purpose | |
| |---------|---------| |
| | [TBD] | Daily standups | |
| | [TBD] | Quick questions / blockers | |
| | [TBD] | Code (GitHub repo) | |
| | Google Drive | Shared docs (this doc, project brief) | |
|
|
| --- |
|
|
| ## Next Sync |
|
|
| **When:** [Fill in after kickoff] |
| **Where:** [Fill in after kickoff] |
|
|
| --- |
|
|
| ## Completed Tickets Archive |
|
|
| Move completed tickets here with DELIVERED filled in. |
|
|
| *(None yet)* |
|
|