Module: prioritized_replay_buffer

An implementation of Prioritized Experience Replay (PER).

This implementation is based on the paper “Prioritized Experience Replay” by Tom
Schaul et al. (2015). Many thanks to Tom Schaul, John Quan, and Matteo Hessel
for providing useful pointers on the algorithm and its implementation.

Classes

class OutOfGraphPrioritizedReplayBuffer:
An out-of-graph Replay Buffer for Prioritized Experience Replay.

class WrappedPrioritizedReplayBuffer:
Wrapper of OutOfGraphPrioritizedReplayBuffer with in-graph sampling.