Skip to content

Latest commit

 

History

History
68 lines (52 loc) · 4.59 KB

web_resources.md

File metadata and controls

68 lines (52 loc) · 4.59 KB

Optimization

  • Notes on gradient descent, Toussaint 2012. pdf. Check Algorithm 2 and 3

Probability

  • Review of expectation, 2009. pdf

Topics

Overview

Tutorials

Repos

  • Minimal clean examples. Iteration methods, policy gradient, Grid world, CartPole, Atari, etc. repo
  • Many PyTorch tutorials, all levels, for image and text. repo
  • OpenAi Universe starter code, A3C algo. repo
  • Minimalist REINFORCE for discrete and continuous actions. repo
  • RLCode. Minimal example of DQN, DDQN, PG, A2C, A3C
  • ikostrikov: a2c, ppo, acktr

Tools

Course

  • Deep RL Bootcamp, site

Datasets

NLP

  • The Stanford Natural Language Inference (SNLI) Corpus. 570k human-written English sentences. Text entailment site

Environments

Simulators

Environments

  • VizDoom. Doom environment using only visual information. Visuals include: FPV game pixels, object labelling visual, depth map, 2D map. Should probably use with a gym wrapper, like this one. To understand how to setup the engine, checkout this minimalist example. Also, checkout this pytorch example.
  • MAME tookit, wrapper around the popular MAME arcade emulator
  • MiniWorl, 2D and 3D environments, minimial dependencies, gym friendly

DL

  • Unreasonable effectiveness of one neuron, blog

Math

  • Matrix albegra review, 24 pages, pdf
  • Variational Inference, slides
  • Maximum likelihood, blog

Coding

State Course

NLP