Skip to content
View lupuandr's full-sized avatar
  • FLAIR, University of Oxford / FAIR team at Meta AI
  • Oxford, UK

Highlights

  • Pro

Organizations

@fairinternal

Block or report lupuandr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Target-UCB Public

    Simple implementation of the Target-UCB algorithm.

    Python 2

  2. luchris429/purejaxrl Public

    Really Fast End-to-End Jax RL Implementations

    Python 858 69

  3. FLAIROx/JaxMARL Public

    Multi-Agent Reinforcement Learning with JAX

    Python 559 112

  4. montrealrobotics/DeepRLInTheWorld Public

    From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

    229 29

  5. facebookresearch/off-belief-learning Public archive

    Implementation of the Off Belief Learning algorithm.

    Python 46 8

  6. FLAIROx/behaviour-distillation Public

    Code for Behaviour Distillation (ICML 2024)

    Python 3