MyCaffe  1.12.2.41
Deep learning software for Windows C# programmers.
MyCaffe.trainers.dqn.noisy.simple.TrainerNoisyDqn< T > Class Template Reference

The TrainerNoisyDqn implements the Noisy-DQN algorithm as described by Google Dopamine DNQAgent, Gheshlagi et al. and inspired by 'higgsfield' More...

Inheritance diagram for MyCaffe.trainers.dqn.noisy.simple.TrainerNoisyDqn< T >:
MyCaffe.trainers.IxTrainerRL MyCaffe.trainers.IxTrainer

Public Member Functions

 TrainerNoisyDqn (MyCaffeControl< T > mycaffe, PropertySet properties, CryptoRandom random, IxTrainerCallback icallback)
 The constructor. More...
 
void Dispose ()
 Release all resources used. More...
 
bool Initialize ()
 Initialize the trainer. More...
 
bool Shutdown (int nWait)
 Shutdown the trainer. More...
 
ResultCollection RunOne (int nDelay=1000)
 Run a single cycle on the environment after the delay. More...
 
byte[] Run (int nN, PropertySet runProp, out string type)
 Run a set of iterations and return the resuts. More...
 
bool Test (int nN, ITERATOR_TYPE type)
 Run the test cycle - currently this is not implemented. More...
 
bool Train (int nN, ITERATOR_TYPE type, TRAIN_STEP step)
 Train the network using a modified PG training algorithm optimized for GPU use. More...
 

Detailed Description

The TrainerNoisyDqn implements the Noisy-DQN algorithm as described by Google Dopamine DNQAgent, Gheshlagi et al. and inspired by 'higgsfield'

See also
Noisy Networks for Exploration, Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg, arXiv:1706.10295
Prioritized Experience Replay, Tom Schaul, John Quan, Ioannis Antonoglou, David Silver, 2016
Dopamine: A Research Framework for Deep Reinforcement Learning Pablo Samuel Castro, Subhodeep Moitra, Carles Gelada, Saurabh Kumar, and Marc G. Bellemare, 2018, Google Brain
Github:google/dopamine, Google, 2018, license Apache 2.0 (https://github.com/google/dopamine/blob/master/LICENSE)
GitHub:Kaixhin/NoisyNet-A3C, Kaixhin 2018, license MIT (https://github.com/Kaixhin/NoisyNet-A3C/blob/master/LICENSE.md)
Github:openai/baselines, OpenAI, 2018, license MIT (https://github.com/openai/baselines/blob/master/LICENSE)
GitHub:Kyushik/DRL, Kyushik, 2019
Github:higgsfield/RL-Adventure, higgsfield, 2018
Template Parameters
T

Definition at line 36 of file TrainerNoisyDqn.cs.

Constructor & Destructor Documentation

◆ TrainerNoisyDqn()

The constructor.

Parameters
mycaffeSpecifies the MyCaffeControl to use for learning and prediction.
propertiesSpecifies the property set containing the key/value pairs of property settings.
randomSpecifies a Random number generator used for random selection.
icallbackSpecifies the callback for parent notifications and queries.

Definition at line 50 of file TrainerNoisyDqn.cs.

Member Function Documentation

◆ Dispose()

Release all resources used.

Definition at line 61 of file TrainerNoisyDqn.cs.

◆ Initialize()

Initialize the trainer.

Returns
Returns true.

Implements MyCaffe.trainers.IxTrainer.

Definition at line 69 of file TrainerNoisyDqn.cs.

◆ Run()

byte[] MyCaffe.trainers.dqn.noisy.simple.TrainerNoisyDqn< T >.Run ( int  nN,
PropertySet  runProp,
out string  type 
)

Run a set of iterations and return the resuts.

Parameters
nNSpecifies the number of samples to run.
runPropOptionally specifies properties to use when running.
typeReturns the data type contained in the byte stream.
Returns
The results of the run containing the action are returned as a byte stream.

Implements MyCaffe.trainers.IxTrainerRL.

Definition at line 127 of file TrainerNoisyDqn.cs.

◆ RunOne()

Run a single cycle on the environment after the delay.

Parameters
nDelaySpecifies a delay to wait before running.
Returns
The results of the run containing the action are returned.

Implements MyCaffe.trainers.IxTrainerRL.

Definition at line 111 of file TrainerNoisyDqn.cs.

◆ Shutdown()

bool MyCaffe.trainers.dqn.noisy.simple.TrainerNoisyDqn< T >.Shutdown ( int  nWait)

Shutdown the trainer.

Parameters
nWaitSpecifies a wait in ms. for the shutdown to complete.
Returns
Returns true.

Implements MyCaffe.trainers.IxTrainer.

Definition at line 81 of file TrainerNoisyDqn.cs.

◆ Test()

Run the test cycle - currently this is not implemented.

Parameters
nNSpecifies the number of iterations (based on the ITERATION_TYPE) to run, or -1 to ignore.
typeSpecifies the iteration type (default = ITERATION).
Returns
A value of true is returned when handled, false otherwise.

Implements MyCaffe.trainers.IxTrainer.

Definition at line 143 of file TrainerNoisyDqn.cs.

◆ Train()

bool MyCaffe.trainers.dqn.noisy.simple.TrainerNoisyDqn< T >.Train ( int  nN,
ITERATOR_TYPE  type,
TRAIN_STEP  step 
)

Train the network using a modified PG training algorithm optimized for GPU use.

Parameters
nNSpecifies the number of iterations (based on the ITERATION_TYPE) to run, or -1 to ignore.
typeSpecifies the iteration type (default = ITERATION).
stepSpecifies the stepping mode to use (when debugging).
Returns
A value of true is returned when handled, false otherwise.

Implements MyCaffe.trainers.IxTrainer.

Definition at line 169 of file TrainerNoisyDqn.cs.


The documentation for this class was generated from the following file: