#

mcts

Here are 379 public repositories matching this topic...

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

reinforcement-learning mathematics coding mcts strawberry llm chain-of-thought openai-o1

Updated Dec 5, 2024

suragnair / alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

reinforcement-learning deep-learning neural-network tensorflow keras pytorch mcts othello gomoku monte-carlo-tree-search gobang alphago tf alphago-zero alpha-zero alphazero self-play

Updated Jun 6, 2024
Jupyter Notebook

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

board-game reinforcement-learning tensorflow pytorch mcts gomoku rl monte-carlo-tree-search self-learning gobang alphago alphago-zero alphazero

Updated Apr 24, 2024
Python

werner-duvaud / muzero-general

MuZero

machine-learning reinforcement-learning deep-learning neural-network deep-reinforcement-learning python3 pytorch gym mcts rl tensorboard residual-network monte-carlo-tree-search self-learning alphago model-based-rl alphazero muzero muzero-general

Updated Sep 3, 2024
Python

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Updated Dec 3, 2024
Python

chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Updated Dec 15, 2020

s-casci / tinyzero

Easily train AlphaZero-like agents on any environment you want!

reinforcement-learning mcts alphazero

Updated Jan 11, 2024
Python

hrpan / tetris_mcts

MCTS project for Tetris

game reinforcement-learning deep-learning tetris mcts tetris-bots

Updated Oct 9, 2024
Python

dylandjian / SuperGo

A student implementation of Alpha Go Zero

machine-learning reinforcement-learning python3 pytorch mcts alphago alphago-zero

Updated Aug 1, 2018
Python

DataCanvasIO / Hypernets

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

reinforcement-learning keras mcts hyperparameter-optimization evolutionary-algorithms nas monte-carlo-tree-search hyperparameter-tuning automl neural-architecture-search nasnet enas autodl

Updated Jul 19, 2024
Python

CrazyAra

QueensGambit / CrazyAra

A Deep Learning UCI-Chess Variant Engine written in C++ & Python 🦜

python open-source machine-learning chess-engine deep-learning mxnet artificial-intelligence mcts gluon lichess convolutional-neural-network alphago python-chess alphazero crazyhouse mcgs

Updated Oct 23, 2024
Jupyter Notebook

sungyubkim / Deep_RL_with_pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

deep-reinforcement-learning pytorch dqn mcts uct c51 iqn hedge ppo a2c gail counterfactual-regret-minimization qr-dqn random-network-distillation soft-actor-critic self-imitation-learning

Updated Apr 24, 2023
Jupyter Notebook

vgarciasc / mcts-viz

Visualization of MCTS algorithm applied to Tic-tac-toe.

visualization mcts tictactoe p5js

Updated Aug 25, 2021
JavaScript

initial-h / AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

algorithm tensorflow parallel deep-reinforcement-learning mcts gomoku tree-search tensorlayer alphago mpi4py dirichlet-distribution alphazero alphazero-gomoku

Updated Jan 20, 2020
Python

thuxugang / doudizhu

AI斗地主

reinforcement-learning ai card-game dqn mcts doudizhu

Updated Jun 13, 2018
Python

kaesve / muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

reinforcement-learning deep-learning tensorflow deep-reinforcement-learning tf2 mcts alphazero tensorflow2 muzero

Updated Mar 28, 2021
Jupyter Notebook

akolishchak / doom-net-pytorch

Reinforcement learning models in ViZDoom environment

agent learning reinforcement-learning pytorch doom behavior-tree mcts vizdoom reinforcement ppo doomnet-track1

Updated Mar 9, 2022
Python

manyoso / allie

Allie: A UCI compliant chess engine

chess-engine chess neural-network mcts deepmind alphabeta alphazero

Updated Apr 8, 2021
C++

zjeffer / chess-deep-rl

Research project: create a chess engine using Deep Reinforcement Learning

machine-learning chess-engine chess reinforcement-learning ai deep-learning neural-network deep-reinforcement-learning artificial-intelligence mcts neural-networks alphazero

Updated Jun 29, 2024
Jupyter Notebook

Sayuri

CGLemon / Sayuri

AlphaZero based engine for the game of Go (圍棋/围棋).

baduk weiqi mcts deeplearning alphago alphazero gumbel-alphazero sayuri

Updated Dec 5, 2024
C++

Improve this page

Add a description, image, and links to the mcts topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mcts topic, visit your repo's landing page and select "manage topics."