Skip to content

Latest commit

 

History

History

controller

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

Controllers

We mainly use the RBF controller for our experiments as it is universally applicable for all problems. Linear controllers are too limited for most applications, but can be sufficient for stabilization around an equilibrium. see Consequently, the CartpoleStabShort-v0 task could be solved with this, however in our experiments the RBF policy was also able to solve the environment after only 2 episodes and the inital random samples.
New controllers can be implemented by inheriting from the Controller class.

Linear Controller

The linear controller is computing the action by weighting the state with an NxN matrix. We found that the linear controller ist often causing numerical issues if not initalized perfectly. We recommend to use rbf policies for all tasks.

RBF Controller

We define our RBF controller as deterministic Gaussian Process based on the original implementation from Deisenroth. The signal variance is fixed to 1 and the noise variance to 0.01, which results in a signal-to-noise ration of 10. The lengthscales are initialized with 1. The majority of the parameters are pseudo samples with trainable inputs and targets for the GP fit. Further, we squash the action output of the RBF Policy with the sin transformation to an predefined symmetric action range.