File size: 2,043 Bytes
d02bacd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
spec_version: 1
name: autodatalab_plus
type: space
runtime: fastapi
app: server.app:app
port: 7860

description: >
  Multi-agent company simulation where a trainable Chief of Staff routes work
  across Data Analyst, Finance, HR, and Strategy specialists to solve
  realistic business briefing tasks.

tags:
  - openenv
  - multi-agent
  - enterprise
  - reinforcement-learning
  - orchestration

tasks:
  - id: easy_brief
    difficulty: easy
    grader: ceo_brief_env.graders:grade_episode
    summary: >
      Revenue sanity check: trust the raw sales file, estimate next-quarter
      revenue, and send the team an internal update.
  - id: medium_brief
    difficulty: medium
    grader: ceo_brief_env.graders:grade_episode
    summary: >
      Identify the growth-driving category, forecast next-quarter revenue,
      recommend action, and communicate to department leads.
  - id: hard_brief
    difficulty: hard
    grader: ceo_brief_env.graders:grade_episode
    summary: >
      Produce a board-style summary with risk flags, forecast, strategy, and a
      CFO-facing memo.
  - id: expert_brief
    difficulty: hard
    grader: ceo_brief_env.graders:grade_episode
    summary: >
      Full specialist stack: analytics, finance forecast, NVDA/AAPL/JPM strategy
      line, and executive memo; use RAG and Stooq when advanced mode is on.
  - id: risk_brief
    difficulty: hard
    grader: ceo_brief_env.graders:grade_episode
    summary: >
      Board-level risk brief after an internal audit flag: quantify data
      issues, finance downside and break-even sensitivity, NVDA/AAPL/JPM
      stance under risk controls, and an HR/comms memo for the audit
      committee.
  - id: crisis_brief
    difficulty: hard
    grader: ceo_brief_env.graders:grade_episode
    summary: >
      Mid-quarter shock + hiring freeze: tight data-quality read, finance
      variance and break-even sensitivity, defensive NVDA/AAPL/JPM strategy,
      and an HR/comms memo to CEO/CFO/COO that names variance, projection,
      strategy, and break-even.