UniformReplayBuffer¶
-
class
maze.train.trainers.common.replay_buffer.uniform_replay_buffer.
UniformReplayBuffer
(buffer_size: int, seed: int)¶ Replay buffer for off policy learning.
- Parameters
buffer_size – The maximum buffer size.
seed – The random seed used for initializing the uniform random sampling in the buffer.
-
add_rollout
(rollout: Union[maze.core.trajectory_recording.records.trajectory_record.SpacesTrajectoryRecord, List[maze.core.trajectory_recording.records.structured_spaces_record.StructuredSpacesRecord]]) → None¶ (overrides
BaseReplayBuffer
)implementation of
BaseReplayBuffer
-
add_transition
(transition: Union[maze.core.trajectory_recording.records.structured_spaces_record.StructuredSpacesRecord, maze.core.trajectory_recording.records.trajectory_record.SpacesTrajectoryRecord]) → None¶ (overrides
BaseReplayBuffer
)implementation of
BaseReplayBuffer
-
sample_batch
(n_samples: int, learner_device: str) → List[Union[maze.core.trajectory_recording.records.structured_spaces_record.StructuredSpacesRecord, maze.core.trajectory_recording.records.trajectory_record.SpacesTrajectoryRecord]]¶ (overrides
BaseReplayBuffer
)implementation of
BaseReplayBuffer