HumanCompatibleAI/ppo-CartPole-v1
Reinforcement Learning
• Updated • 706
HumanCompatibleAI/ppo-seals-HalfCheetah-v1
Reinforcement Learning
• Updated • 15
HumanCompatibleAI/sac-seals-Swimmer-v1
Reinforcement Learning
• Updated • 2
HumanCompatibleAI/sac-seals-Humanoid-v1
Reinforcement Learning
• Updated • 6
HumanCompatibleAI/sac-seals-Ant-v1
Reinforcement Learning
• Updated • 6
HumanCompatibleAI/sac-seals-HalfCheetah-v1
Reinforcement Learning
• Updated • 6
HumanCompatibleAI/sac-seals-Hopper-v1
Reinforcement Learning
• Updated • 13
HumanCompatibleAI/sac-seals-Walker2d-v1
Reinforcement Learning
• Updated • 5
HumanCompatibleAI/ppo-seals-Walker2d-v1
Reinforcement Learning
• Updated • 15
HumanCompatibleAI/ppo-seals-Humanoid-v1
Reinforcement Learning
• Updated • 23
HumanCompatibleAI/ppo-seals-Hopper-v1
Reinforcement Learning
• Updated • 22
HumanCompatibleAI/ppo-seals-Swimmer-v1
Reinforcement Learning
• Updated • 29
HumanCompatibleAI/ppo-seals-Ant-v1
Reinforcement Learning
• Updated • 20
HumanCompatibleAI/ppo-Pendulum-v1
Reinforcement Learning
• Updated • 17.4k
• 6
HumanCompatibleAI/ppo-seals-CartPole-v0
Reinforcement Learning
• Updated • 42.7k
• 17
HumanCompatibleAI/ppo-seals-MountainCar-v0
Reinforcement Learning
• Updated • 101
• 1
HumanCompatibleAI/sac-seals-Walker2d-v0
Reinforcement Learning
• Updated • 4
HumanCompatibleAI/ppo-seals-Walker2d-v0
Reinforcement Learning
• Updated • 6
HumanCompatibleAI/sac-seals-Humanoid-v0
Reinforcement Learning
• Updated • 1
HumanCompatibleAI/ppo-seals-Humanoid-v0
Reinforcement Learning
• Updated • 3
HumanCompatibleAI/sac-seals-Ant-v0
Reinforcement Learning
• Updated • 2
HumanCompatibleAI/ppo-seals-Hopper-v0
Reinforcement Learning
• Updated • 2
HumanCompatibleAI/sac-seals-Hopper-v0
Reinforcement Learning
• Updated HumanCompatibleAI/sac-seals-HalfCheetah-v0
Reinforcement Learning
• Updated • 6
HumanCompatibleAI/ppo-seals-HalfCheetah-v0
Reinforcement Learning
• Updated • 1
HumanCompatibleAI/sac-seals-Swimmer-v0
Reinforcement Learning
• Updated • 4
HumanCompatibleAI/ppo-seals-Swimmer-v0
Reinforcement Learning
• Updated HumanCompatibleAI/ppo-seals-Ant-v0
Reinforcement Learning
• Updated • 3
HumanCompatibleAI/ppo-AsteroidsNoFrameskip-v4
Reinforcement Learning
• Updated • 1