Evaluation Metrics

Evaluation Metrics

struct AllRewards <: AbstractEvaluationMetrics
    rewards::Array{Float64, 1}

Records all rewards.

source
AllRewards()

Initializes with empty array.

source
EvaluationPerEpisode <: AbstractEvaluationMetrics
    values::Array{Float64, 1}
    metric::SimpleEvaluationMetric

Stores the value of the simple metric for each episode in values.

source
EvaluationPerEpisode(metric = MeanReward())

Initializes with empty values array and simple metric (default MeanReward). Other options are TimeSteps (to measure the lengths of episodes) or TotalReward.

source
EvaluationPerT <: AbstractEvaluationMetrics
    T::Int64
    counter::Int64
    values::Array{Float64, 1}
    metric::SimpleEvaluationMetric

Stores the value of the simple metric after every T steps in values.

source
EvaluationPerT(T, metric = MeanReward())

Initializes with T, counter = 0, empty values array and simple metric (default MeanReward). Another option is TotalReward.

source
mutable struct MeanReward <: TabularReinforcementLearning.SimpleEvaluationMetric
    meanreward::Float64
    counter::Int64

Computes iteratively the mean reward.

source
MeanReward()

Initializes counter and meanreward to 0.

source
struct RecordAll <: AbstractEvaluationMetrics
    r::Array{Float64, 1}
    a::Array{Int64, 1}
    s::Array{Int64, 1}
    isterminal::Array{Bool, 1}

Records everything.

source
RecordAll()

Initializes with empty arrays.

source
mutable struct TimeSteps <: SimpleEvaluationMetric
    counter::Int64

Counts the number of timesteps the simulation is running.

source
TimeSteps()

Initializes counter to 0.

source
mutable struct TotalReward <: TabularReinforcementLearning.SimpleEvaluationMetric
    reward::Float64

Accumulates all rewards.

source
TotalReward()

Initializes reward to 0.

source