PIK GaNe
Pinned Loading
Repositories
- satisfia-marl Public Forked from paulvautravers/satisfia-marl
A repo to explore multi-agent reinforcement learning in the context of aspiration based, non-maximising agents. This project is part of the Supervised Program for Alignment Research.
pik-gane/satisfia-marl’s past year of commit activity - webppl-agents-satisfia Public Forked from agentmodels/webppl-agents
Webppl library for generating Gridworld MDPs. JS library for displaying Gridworld. Additional agents that satisfice.
pik-gane/webppl-agents-satisfia’s past year of commit activity - cleanrl-satisfia Public Forked from vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
pik-gane/cleanrl-satisfia’s past year of commit activity - stable-baselines3-contrib-satisfia Public Forked from Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
pik-gane/stable-baselines3-contrib-satisfia’s past year of commit activity - alpaca_farm-collective Public Forked from tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
pik-gane/alpaca_farm-collective’s past year of commit activity - ai-safety-gridworlds-satisfia Public Forked from google-deepmind/ai-safety-gridworlds
This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.
pik-gane/ai-safety-gridworlds-satisfia’s past year of commit activity