lupuandr

Follow

Andrei Lupu lupuandr

Follow

Ph.D. at Oxford and Meta AI studying (multi-agent) reinforcement learning. Formerly at Mila / McGill University

13 followers · 3 following

FLAIR, University of Oxford / FAIR team at Meta AI
Oxford, UK

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

Target-UCB Public

Simple implementation of the Target-UCB algorithm.

Python 2
luchris429/purejaxrl Public

Really Fast End-to-End Jax RL Implementations

Python 858 69
FLAIROx/JaxMARL Public

Multi-Agent Reinforcement Learning with JAX

Python 559 112
montrealrobotics/DeepRLInTheWorld Public

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

229 29
facebookresearch/off-belief-learning Public archive

Implementation of the Off Belief Learning algorithm.

Python 46 8
FLAIROx/behaviour-distillation Public

Code for Behaviour Distillation (ICML 2024)

Python 3