P is a matrix of size (num_actions, batch_size). It is allowed to put weight on invalid actions (see evaluate).
V is a row vector of size (1, batch_size)

source

AlphaZero.Network.train! — Function

train!(callback, ::AbstractNetwork, opt::OptimiserSpec, loss, batches, n)

Update a given network to fit some data.

opt specifies which optimiser to use.
loss is a function that maps a batch of samples to a tracked real.
data is an iterator over minibatches.
n is the number of minibatches. If length is defined on data, we must have length(data) == n. However, not all finite iterators implement length and thus this argument is needed.
callback(i, loss) is called at each step with the batch number i and the loss on last batch.

source

AlphaZero.Network.set_test_mode! — Function

set_test_mode!(mode=true)

Put a network in test mode or in training mode. This is relevant for networks featuring layers such as batch normalization layers.

source

AlphaZero.Network.params — Function

params(::AbstractNetwork)

Return the collection of trainable parameters of a network.

source

AlphaZero.Network.regularized_params — Function

regularized_params(::AbstractNetwork)

Return the collection of regularized parameters of a network. This usually excludes neuron's biases.

source

Conversion and Copy

AlphaZero.Network.to_gpu — Function

to_gpu(::AbstractNetwork)

Return a copy of the given network that has been transferred to the GPU if one is available. Otherwise, return the given network untouched.

source

AlphaZero.Network.to_cpu — Function

to_cpu(::AbstractNetwork)

Return a copy of the given network that has been transferred to the CPU or return the given network untouched if it is already on CPU.

source

AlphaZero.Network.on_gpu — Function

on_gpu(::AbstractNetwork) :: Bool

Test whether or not a network is located on GPU.

source

AlphaZero.Network.convert_input — Function

convert_input(::AbstractNetwork, input)

Convert an array (or number) to the right format so that it can be used as an input by a given network.

source

AlphaZero.Network.convert_output — Function

convert_output(::AbstractNetwork, output)

Convert an array (or number) produced by a neural network to a standard CPU array (or number) type.

source

Misc

AlphaZero.Network.gc — Function

gc(::AbstractNetwork)

Perform full garbage collection and empty the GPU memory pool.

source

Derived Functions

Evaluation Functions

AlphaZero.Network.forward_normalized — Function

forward_normalized(network::AbstractNetwork, states, actions_mask)

Evaluate a batch of vectorized states. This function is a wrapper on forward that puts a zero weight on invalid actions.

Arguments

states is a tensor whose last dimension has size bach_size
actions_mask is a binary matrix of size (num_actions, batch_size)

Returned value

Return a (P, V, Pinv) triple where:

P is a matrix of size (num_actions, batch_size).
V is a row vector of size (1, batch_size).
Pinv is a row vector of size (1, batch_size) that indicates the total probability weight put by the network on invalid actions for each sample.

All tensors manipulated by this function have elements of type Float32.

source

AlphaZero.Network.evaluate — Function

evaluate(::AbstractNetwork, state)

(nn::AbstractNetwork)(state) = evaluate(nn, state)

Evaluate the neural network as an MCTS oracle on a single state.

Note, however, that evaluating state positions once at a time is slow and so you may want to use a BatchedOracle along with an inference server that uses evaluate_batch.

source

AlphaZero.Network.evaluate_batch — Function

evaluate_batch(::AbstractNetwork, batch)

Evaluate the neural network as an MCTS oracle on a batch of states at once.

Take a list of states as input and return a list of (P, V) pairs as defined in the MCTS oracle interface.

source

Misc

AlphaZero.Network.num_parameters — Function

num_parameters(::AbstractNetwork)

Return the total number of parameters of a network.

source

AlphaZero.Network.num_regularized_parameters — Function

num_regularized_parameters(::AbstractNetwork)

Return the total number of regularized parameters of a network.

source

AlphaZero.Network.mean_weight — Function

mean_weight(::AbstractNetwork)

Return the mean absolute value of the regularized parameters of a network.

source

AlphaZero.Network.copy — Method

copy(::AbstractNetwork; on_gpu, test_mode)

A copy function that also handles CPU/GPU transfers and test/train mode switches.

source

Optimiser Specification

AlphaZero.Network.OptimiserSpec — Type

OptimiserSpec

Abstract type for an optimiser specification.

source

AlphaZero.Network.CyclicNesterov — Type

CyclicNesterov(; lr_base, lr_high, lr_low, momentum_low, momentum_high)

SGD optimiser with a cyclic learning rate and cyclic Nesterov momentum.

During an epoch, the learning rate goes from lr_low to lr_high and then back to lr_low.
The momentum evolves in the opposite way, from high values to low values and then back to high values.

source

AlphaZero.Network.Adam — Type

Adam(;lr)

Adam optimiser.

source