AdverserialPolicy

class itergp.methods.policies.AdverserialPolicy(base_policy)

Bases: LinearSolverPolicy

Adverserial policy.

Policy returning actions which are orthogonal to the data causing no update to the posterior mean except for the last iteration. The actions are given by \(s_i = (I - y(y^\top y)^{-1}y^\top)\tilde{s}_i\), where \(\tilde{s}_i\) are arbitrary linearly independent vectors.

Parameters

base_policy (policies.LinearSolverPolicy) – Policy which generates \(\tilde{s}_i\) which then are transformed to be orthogonal to \(y\).

Return type

None

Attributes Summary

base_policy

Methods Summary

__call__(solver_state, rng)

Return an action for a given solver state.

Attributes Documentation

base_policy

Methods Documentation

__call__(solver_state, rng)[source]

Return an action for a given solver state.

Parameters
Returns

Next action to take.

Return type

action